Baseten
13 Case Studies
A Baseten Case Study
Latent Health uses Baseten to power AI-native medical search and clinical question answering for major U.S. health systems. As its multi-modal workflows grew across notes, labs, medications, documents, and media, Latent faced increasing infrastructure complexity, difficulty managing many models, reliability risks, performance pressure, and slower experimentation while trying to serve millions of documents daily.
Baseten helped Latent deploy and orchestrate its compound AI system with Baseten Chains, while also improving runtime performance through the Baseten Inference Stack. By splitting workflows into independently scalable Chainlets and optimizing inference, Baseten enabled 99.999% uptime, 600 ms P90 end-to-end latency, and 6x improved GPU utilization, while also simplifying deployment, observability, and experimentation.
Allan Bishop
Head of Engineering