Case Study: Edmunds.com achieves 6x faster ad-hoc analytics and 60% faster reporting with Databricks

Improving Data Integrity and Customer Experience with Analytics

Edmunds.com, a leading car-shopping site serving about 20 million visitors a month, faced rapidly growing data volumes (tens to hundreds of TB) and widespread missing or inaccurate vehicle details on listing pages. Engineers spent large amounts of time maintaining ad hoc MapReduce/Oozie reporting jobs and could not easily quantify data-quality gaps or the ROI of various data sources used to decode VINs and enrich listings.

Edmunds adopted Apache Spark via the Databricks managed service to simplify cluster management, democratize data access, audit APIs, and build Spark SQL workflows for VIN decoding and reporting. The change sped ad hoc analysis six-fold, cut report-job processing time by about 60% (e.g., 30–60 min queries to 5–10 min), reduced weekly reporting effort from ~10–15 to 3–5 hours, and improved site data quality by roughly 35%, enabling better recommendations and faster, data-driven product decisions.

Open case study document...

Edmunds.com

Shaun Elliott

Technical Lead of Service Engineering

Databricks

460 Case Studies

Case Study: Edmunds.com achieves 6x faster ad-hoc analytics and 60% faster reporting with Databricks

Improving Data Integrity and Customer Experience with Analytics

Edmunds.com

Databricks

Was it helpful? Rate this case study:

Thank you for your feedback.