Case Study: Karolinska Institutet achieves secure, cost-effective genomic data processing with Hopsworks

A Hopsworks Case Study

Preview of the Karolinska Institutet Case Study

Data preparation, cataloging, and feature management for a massive genomic dataset containing sensitive information

Karolinska Institutet, one of the world’s leading medical universities, needed a way to securely manage and process massive genomic and omics datasets containing sensitive information. Its researchers were working with hundreds of terabytes of sequencing data and needed a platform that could support notebooks, Apache Spark, PySpark, TensorFlow, and GPUs without the complexity and cost of running separate infrastructure for each study.

Hopsworks provided a GDPR-compliant, multi-tenant data science platform built around projects, enabling secure collaboration on a shared cluster while keeping study data isolated. With Hopsworks, Karolinska Institutet achieved easier collaboration, faster large-scale data processing, and an integrated environment for data preparation, cataloging, and feature management, while cutting costs by 90% for storage and compute.


View this case study…

Karolinska Institutet

Luciano Dani

Head of IT Archiving


Hopsworks

9 Case Studies