Cloudera
293 Case Studies
A Cloudera Case Study
An American technology company that builds large-scale ETL pipelines to ingest advertising and member data—creating identity profiles and analytics to personalize ads—needed a more cost-effective, flexible way to access, store, and process massive volumes of data. Running CDH on-premises across East and West data centers made it difficult to combine workflows, adopt newer capabilities, and scale efficiently for variable demand.
The company migrated 100% of its data to Cloudera Data Platform (CDP) Public Cloud on Azure, deploying four production Data Hubs (Kafka for ingest; Hive and Spark for processing) with an HA data lake and SDX-based security. Using CDP autoscaling and automation, it now runs at about 25% capacity on weekdays and scales from 150 static nodes up to 250–500 for weekend peaks, with faster Spark jobs, consolidated workflows, lower operational overhead, and stable East/West production environments.
American Technology Company