Databricks
398 Case Studies
A Databricks Case Study
SafeGraph is a geospatial data company that aggregates points of interest, foot traffic, and other non‑PII datasets to help retailers, real estate firms, financial services, and government make location‑based decisions. With petabytes of historical data and a lean engineering team, they faced slow, resource‑intensive processing, incomplete and mismatched datasets, and difficulty securely sharing large datasets with customers.
SafeGraph adopted the Databricks Lakehouse Platform—using Delta Lake to unify storage, Delta Sharing for secure real‑time data exchange, and MLflow for model deployment—alongside AWS Redshift, EKS, and Elasticsearch. This streamlined pipelines, improved data quality, and democratized access, delivering 2–10x faster spatial queries and indexing, a 50% reduction in peak memory consumption, and cutting data access times from months to minutes.
Felix Cheung
VP of Engineering