About.
DPhil candidate at the Oxford Big Data Institute, working at the intersection of machine learning and health data.
Bio
I am a final-year DPhil student at the Oxford Big Data Institute, University of Oxford. My research focuses on machine learning for clinical decision-making, with a thesis on antimicrobial stewardship and treatment optimisation in hospital settings.
Before Oxford, I worked as a student research assistant at the Chair of Experimental Bioinformatics at TUM, contributing to projects in systems biology and systems medicine. Outside research, I enjoy cooking, building and breaking things.
Research Interests
- Machine Learning for Healthcare
- Natural Language Processing
- Reinforcement Learning
- Electronic Health Records
- Antimicrobial Resistance
- Bioinformatics
- Systems Medicine
Education
- 2020 – Present DPhil (PhD) in Health Data Science University of Oxford thesis · Machine Learning for Antimicrobial Stewardship and Treatment Optimisation in Hospital Settings
- 2017 – 2020 BSc in Bioinformatics Joint LMU München & Technical University of Munich (TUM) thesis · Linking gene expression data to transcription factor binding information
Publications
- "Machine learning and clinician predictions of antibiotic resistance in Enterobacterales bloodstream infections" Journal of Infection, 2025. doi:10.1016/j.jinf.2024.106388
- "Community-acquired pneumonia identification from electronic health records in the absence of a gold standard: A Bayesian latent class analysis" PLOS Digital Health, 2025. doi:10.1371/journal.pdig.0000936
- "Interplay between C-reactive protein responses and antibiotic prescribing in people with suspected infection" BMC Infectious Diseases, 2025. doi:10.1186/s12879-025-11381-9
- "Transformers and large language models are efficient feature extractors for electronic health record studies" Communications Medicine, 2025. doi:10.1038/s43856-025-00790-1
- "Changes in the investigation and management of suspected myocardial infarction and injury during COVID-19: a multi-centre study using routinely collected healthcare data" Frontiers in Cardiovascular Medicine, 2024. doi:10.3389/fcvm.2024.1406608
- "Distinct patterns of vital sign and inflammatory marker responses in adults with suspected bloodstream infection" Journal of Infection, 2024. doi:10.1016/j.jinf.2024.106156
- "Predicting individual patient and hospital-level discharge using machine learning" Communications Medicine, 2024. doi:10.1038/s43856-024-00673-x
- "TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors" GigaScience, 2023. doi:10.1093/gigascience/giad026
- "BiCoN: Network-constrained biclustering of patients and omics data" Bioinformatics, 2021. doi:10.1093/bioinformatics/btaa1076
- "DIGGER: exploring the functional role of alternative splicing in protein interactions" Nucleic Acids Research, 2021. doi:10.1093/nar/gkaa768
- "Exploring the SARS-CoV-2 virus-host-drug interactome for drug repurposing" Nature Communications, 2020. doi:10.1038/s41467-020-17189-2
Skills
- Python
- R
- SQL
- Rust
- Machine Learning
- NLP / LLMs
- Reinforcement Learning
- Statistical Modelling
- PyTorch
- Transformers
- FastAPI
- Docker
- AWS
- Terraform
- CI/CD
- Electronic Health Records
- Bioinformatics