IBM StreamSets
16 Case Studies
A IBM StreamSets Case Study
Availity, a healthcare IT company that connects health plans and providers in real time, needed a new data strategy to enable real-time analytics, self-service data access and a DataOps culture while keeping total cost of ownership low and ensuring performance and reliability at scale for hundreds of terabytes of data. To meet those goals Availity adopted IBM StreamSets, using StreamSets Data Collector and StreamSets Dataflow Performance Manager as core components of a stream‑first real‑time data repository.
IBM StreamSets implemented streaming ingestion and microservice-style data pipelines on Availity’s Cloudera platform, integrating sources and sinks such as Kafka, Oracle, Kudu, HBase, ElasticSearch, Solr and HDFS. The solution syncs systems in real time at throughput of thousands of records per second (avoiding out‑of‑sync cyclical loads), enables self‑service DataOps so general data engineers build pipelines, and has reduced TCO while accelerating innovation and improving reliability and customer insights.
Jeff Currier
Senior Manager, Data Management & Analytics