IBM StreamSets
16 Case Studies
A IBM StreamSets Case Study
GSK, the global healthcare company, needed a way to give more than 10,000 scientists self-service access to millions of diverse data elements across over 1,000 data sources to accelerate drug discovery and R&D. To support this, GSK worked with IBM StreamSets and its StreamSets platform to build a Data Center of Excellence for faster, cleaner data delivery.
With IBM StreamSets, GSK automated pipeline creation and drift handling, enabling self-service data flow without disrupting critical research operations. The result was the ability to deploy a million pipelines for thousands of data sources, helping GSK scale analytics across 10,000+ scientists and 6 Pb of stored data while improving speed to market for new healthcare solutions.
Mark Ramsey
Former Chief Data & Analytics Officer