Skip to content
View EttoreRocchi's full-sized avatar
💻
4 8 15 16 23 42 EXECUTE
💻
4 8 15 16 23 42 EXECUTE

Block or report EttoreRocchi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
EttoreRocchi/README.md

Ettore Rocchi

About me

Physics background, biomedical mission.

I'm a PhD researcher at the University of Bologna, where I develop computational methods to predict antimicrobial resistance, discover patient phenotypes, and make sense of high-dimensional omics data. My work spans MALDI-TOF mass spectrometry, multi-omics integration, and metagenomics, always with a focus on interpretability and clinical impact. Part of the Physics4MedicineLab group and the Multi-Omics and Health-Care Data Analytics Unit at Sant'Orsola Hospital.

Website LinkedIn Google Scholar Scopus University Email

GitHub Stats

GitHub followers GitHub stars


MaldiSuite - a Python ecosystem for MALDI-TOF spectral processing and analysis in antimicrobial resistance research. Visit the MaldiSuite website.

Three sklearn-compatible packages that chain into an end-to-end clinical AMR pipeline: preprocess with MaldiAMRKit, harmonise across batches/sites with MaldiBatchKit, classify with MaldiDeepKit.


What I Work On

Research focus Description
Antimicrobial resistance prediction
MALDI-TOF mass spectrometry · supervised learning · deep neural networks
Machine learning on mass spectra and clinical data to anticipate resistance phenotypes prior to culture-based diagnostics.
MaldiSuite, ResPredAI
Multi-centre data harmonisation
Batch-effect correction · ComBat · batch-mixing diagnostics
Batch-effect correction methods for machine learning models on high-throughput data across instruments and clinical sites.
MaldiBatchKit, combatlearn
Computational patient phenotyping
unsupervised clustering · survival analysis · multi-state modelling
Discovery of clinically meaningful subgroups from heterogeneous patient cohorts, with prognostic and trajectory modelling.
phenocluster
Genomics and metagenomics analyses
microbial community networks · pathogen detection · structural and somatic variants · mutational signatures · CRISPR-Cas9 editing
Network-based modelling of microbial communities for pathogen detection, characterisation of structural and somatic variants and of mutational signatures, with broader interests in computational tools for CRISPR-Cas9 genome editing.
CATS, APOBECSeeker, CAMISIM-BrokenStick

Tech Stack

Python scikit-learn PyTorch Bash Snakemake Nextflow


Other Projects

Project Description
combatlearn Scikit-learn compatible ComBat batch-effect correction
ResPredAI AI model to predict resistances in Gram-negative bloodstream infections
phenocluster Unsupervised clinical phenotype discovery with survival and multistate modeling
CATS Automated Cas9 nuclease comparison with ClinVar integration
CAMISIM-BrokenStick Broken stick model extension for metagenomic simulation
APOBECSeeker APOBEC-style mutation identification from multiple sequence alignment
nestkit Nested cross-validation with calibration, threshold optimization, and statistical tests

Selected Publications

For a complete list, see my Google Scholar profile.

Pinned Loading

  1. MaldiSuite MaldiSuite Public

    MaldiSuite - a Python ecosystem for MALDI-TOF spectral processing and analysis in antimicrobial resistance research

    CSS

  2. MaldiAMRKit MaldiAMRKit Public

    Comprehensive toolkit for MALDI-TOF mass spectrometry data preprocessing for antimicrobial resistance (AMR) prediction purposes

    Jupyter Notebook 3 1

  3. combatlearn combatlearn Public

    The ComBat algorithm for a learning framework (scikit-learn compatible)

    Python 6 1

  4. ResPredAI ResPredAI Public

    Implementation of the pipeline described in the work "Artificial intelligence model to predict resistances in Gram-negative bloodstream infections" by Bonazzetti et al., npj Digit. Med. 8, 319 (2025)

    Python 5

  5. phenocluster phenocluster Public

    PhenoCluster: a flexible data-driven framework for identifying clinical phenotypes using latent class and profile analysis

    Python 1

  6. nestkit nestkit Public

    nestkit: a nested cross-validation toolkit for scikit-learn

    Jupyter Notebook 1