KNIME
53 Case Studies
A KNIME Case Study
Soluzioni Informatiche needed a way to quickly gather and harmonize hazard information from large volumes of Safety Data Sheets (SDS) instead of relying on slow, error-prone manual review. The company wanted to extract risk, safety, hazard, and precautionary phrases from PDFs to support chemical risk management and compliance.
Using KNIME Analytics Platform, KNIME built a workflow that reads SDS PDFs, parses text with Tika, and uses text mining, string manipulation, regex, and try/catch logic to identify CAS numbers, product names, and required phrases. The solution automated extraction across thousands of SDS files in less than an hour, cut repetitive work from about two minutes to a few seconds per SDS, and reduced human error by allowing phrase lists to be updated through Excel.