Skip to content


This repository contains the code for ML analyses performed in Chapter 4 of my PhD thesis "Interpretable Machine Learning on omics data for biomarker discovery in Parkinson's disease". The project consists on performing Parkinson's disease (PD) case-control classification from blood plasma metabolomics measurements at the baseline clinical visit from the LuxPARK cohort, and from whole blood transcriptomics data at baseline as well as dynamic features engineered from a short temporal series of 4 timepoints from the PPMI cohort. The study involves evaluation of different feature selection strategies, The goal was to build and test a collection of ML models and, most interestingly, identify molecular and higher-level functional representations associated with PD diagnosis.