Case Study: Reserved.ai catches incidents and finds root cause faster with Zebrium

A Zebrium Case Study

Preview of the Reserved.ai Case Study

Automatically Catching Incidents and Root Cause in a Cloud Native Stack

Reserved.ai is a cloud cost optimization platform that runs a cloud-native stack on AWS with Kubernetes, Flask, Celery, Redis, PostgreSQL, Airflow, and React. As the company grew, its teams were overwhelmed by huge volumes of logs and relied on manual tools like vi, grep, awk, and custom scripts, making incident troubleshooting slow and painful. They turned to Zebrium to improve log analysis and incident detection.

Zebrium’s log and metrics collectors were deployed quickly, and within minutes Reserved.ai was able to use machine-learning-based event categorization, timelines, filters, and automatic incident correlation to pinpoint problems faster. Zebrium caught an AWS API change that could have caused a service disruption and helped the team identify the root cause quickly. Since adopting Zebrium, Reserved.ai says it has saved countless hours on debugging and is spending far more time building features instead of hunting through logs.


Open case study document...

Reserved.ai

Aran Khanna

CEO & Co-founder


Zebrium

4 Case Studies