Cloudera
293 Case Studies
A Cloudera Case Study
PRGX Global, a leader in accounts-payable recovery audits serving top global retailers, was struggling to analyze 2.3 PB of structured and unstructured data (including 150M emails) with a legacy RDBMS. Long processing lead times and difficulty handling unstructured data limited auditors’ time for discovery and slowed the company’s ability to scale and innovate.
PRGX built a Hadoop-based HiPer platform with Cloudera and Talend—using Spark, Hive/Pig, Impala, Cloudera Search and Navigator—to ingest, prepare, search and analyze massive data sets. The new platform delivered on average 9–10x faster processing (45x in one case), cut storage footprint/costs substantially (about one-fourth footprint / ~25% reported decrease), reduced processes from 140 to 6 hours in some cases, increased auditor productivity and recoveries, and enabled new products and services.
Tushar Sachdev
Chief Information Officer and Senior Vice President, PRGX