Centogene | Codino

Data management platform for biomarker discovery and research

Industry

Biotechnology & Healthcare

Technologies

Country

Germany

Client Overview Client Needs Services Provided Scope of Work Technologies Used Development Process

Client Overview

Centogene is a biotechnology company specializing in genetic diagnostics for rare diseases, biomarker discovery, and clinical trial support. They provide genetic testing services, identify and validate biomarkers, and assist in recruiting patients with specific rare genetic conditions for clinical trials. The platform helps researchers to manage mass spectrometry-based metabolomics experiments. The system organizes experimental data hierarchically, supports quality control measures, and incorporates data processing techniques like drift correction, peak mapping and statistical analysis, for biomarker discovery.

Client Needs

Clear Data Visualization

Quality Control Integration

Custom Machine Learning Algorithms

Statistical Analysis Tools

Centogene needed a system to visualise, manage and analyse mass spectrometry-based metabolomics data. It had to handle complex experimental data structures, support quality control, and run the data processing techniques that biomarker discovery needs.

Services Provided

Data Processing Pipeline: Implemented algorithms for drift correction, peak mapping, and data normalization.

Quality Control Integration: Enabled tracking and management of QC samples and flagged data anomalies.

Custom Machine Learning Algorithms: Developed specialized ML algorithms for clustering and feature selection in metabolomics data.

Statistical Analysis Tools: Integrated tools for calculating RSD, detection rates, and other key metrics.

Scope of Work

Our collaboration with Centogene focused on developing a specialized platform for managing and analyzing mass spectrometry data in the context of rare disease research.

Designed a hierarchical data model to represent experimental structures including batches, measurements, and samples.
Implemented quality control mechanisms to track and manage QC samples and data anomalies.
Developed data processing algorithms for drift correction, peak mapping, and normalization.
Developed custom machine learning algorithms for clustering and feature selection on metabolomics data.
Implemented statistical analysis tools to calculate key metrics such as RSD and detection rates.

Technologies Used

React

Python

PostgreSQL

Docker

Redis

Development Process

The project commenced with an in-depth analysis of Centogene's requirements for managing mass spectrometry data. We designed a hierarchical data model to represent batches, measurements, and samples. The development focused on integrating quality control processes, implementing data processing algorithms, and ensuring secure user access through API tokens. Custom machine learning algorithms were developed to enhance biomarker discovery capabilities.

Check our work on Clutch