About us
We are the Pathogen Informatics and Modelling group at EMBL-EBI (also known as the Lees Group). On this page you can find information about current and past group members, who we work with, and some nice pictures.
Lab members
Samuel Horsfield is a postdoctoral fellow

My PhD focused on pangenome evolution in pathogenic bacteria. I designed and developed ggcaller, a new annotation tool for bacterial pangenomes that works on pangenome graphs. In my Postdoc, I’m working on graph methods to improve Nanopore sequencing for enrichment of the accessory genome, using large population datasets to understand pangenome evolution, and developing large language models to simulate bacterial populations.
Languages and tools: C++, Rython, R
Organisms: Bacteria (S. pneumoniae, E. coli and many more) and viruses (polio).
shorsfield at ebi dot ac dot uk
John Lees is a research group leader

My previous work has attempted to understand the pathogenesis and transmission of infectious diseases. I primarily work on statistical genetics, genomic epidemiology and genome evolution of pathogenic bacteria. I have also worked on infectious disease modelling, GPU algorithms, and visualisation. I write methods and software to scale these analyses and make them more accessible to others in attempts to democratise bioinformatics.
Languages and tools: C++, CUDA, python, R, rust, stan
Organisms: Any microbe, but especially Streptococcal bacteria.
My personal website is www.johnlees.me and you can find my papers here.
jlees at ebi dot ac dot uk
Leonie Lorenz is a PhD student

In my PhD, I will focus on jointly using epidemiological and genomic data for developing mathematical models of pathogen transmission. While the spreading and evolution of pathogens are often analysed separately, exploiting both data streams together as input for epidemiological models could increase the models’ predictive power and result in new insights.
I studied biomathematics at the University of Greifswald, Germany. For my master thesis, I developed a Boolean model for the regulation of apoptosis at the Max Delbrück Center for Molecular Medicine in the Helmholtz Association.
Languages and tools: python, R
llorenz at ebi dot ac dot uk
Neil MacAlasdair is an EIPOD fellow

My fellowship project studies speciation in microbes – particularly in two genera of upper respiratory tract bacteria, Neisseria and Haemophilus – using large collections of whole-genome sequence data. These are opportune genera for this research because each contains well-sampled disease causing species, they inhabit the same niche (the human nasopharynx) and have similar life histories, but are distantly related. To advance this work, I use and develop pangenomic methods to cope with the scale and diversity of genus-wide data.
Previously, I was a Scientia Fellow with Jukka Corander at the University of Oslo, where I primarily studied the global population structure of Haemophilus influenzae and developed methods to infer recombination-free phylogenies from core genome alignments derived from pangenome methods such as panaroo. Prior to that I was a PhD student at the Wellcome Sanger Institute with Stephen Bentley, during which I studied the global population strucure and pangenome of Neisseria meningitidis.
macalasdair at ebi dot ac dot uk
Anlei Peng is an intern

I am working on automating quality control and model fitting in PopPUNK for clustering large genome datasets at the sub-species level.
I did my Bachelor’s degree and integrated master’s degree at the University of Cambridge with a focus on Genetics and Systems Biology.
Languages: Python, R
Organisms: I worked on Orientia tsutsugamushi in my master’s project in Dr Jeanne Salje’s group at CIMR.
apeng at ebi dot ac dot uk
Víctor Rodríguez Bouza is a Postdoctoral Fellow.

After high school, I started a BSc in Bioinformatics, but I decided to change after a year to Mathematics and Physics, both in the Universidad de Oviedo (Asturias, Spain). After a MSc in High Energy Physics (in the UAB in Barcelona, Spain), I did my PhD thesis in the same field in the Universidad de Oviedo, as a member of the CMS Collaboration at CERN. Now I have returned to the bio-sciences and started as a postdoctoral fellow in the Lees Group.
Languages, tools: Python, C++, R, ROOT.
Organisms Quarks: I have worked mainly in top quark physics.
vrbouza at ebi dot ac dot uk
Matt Russell is a postdoctoral fellow.

Currently, I’m working on a project to catalogue and characterise bacterial capsules, which are important for both infection and as vaccination targets, across all sequenced bacteria. In addition to this, I am improving the GPU backend of the dust
stochastic simulation package. The idea is that we can exploit the dependency structure of the underlying model to automatically produce a more efficient set of GPU kernels.
My background is in applied mathematics, mathematical biology, and computer science. My PhD was supervised by Oliver Jensen and Tobias Galla at the University of Manchester and focussed on mathematical models of transport processes in spatially disordered media. After that, I was a postdoc at the University of Nottingham working with Bindi Brook on models of chemokine gradient formation and immune cell migration.
Languages and tools: C++, CUDA, Mathematica, R, Rust, Stan
mjrussell at ebi dot ac dot uk
Jacqueline Toussaint is a PhD student

I’m broadly focused on developing more efficient and accurate methods for performing genome-wide association studies (GWAS) in bacteria, with a particular focus on antibiotic resistance and the possibility of expanding to real-time analysis. I’m also working on developing methods to improve the speed and consistency of bacterial genome annotation.
I previously studied biochemistry and microbiology at Montana State University, USA, with my Bachelor’s thesis research focused on generating and analyzing multi-omics datasets. I’m excited to further my education in the lab of Dr. John Lees at EMBL-EBI and the University of Cambridge, UK.
Languages and tools: python, R, Rust, Stan
Organisms: I work with a diverse set of microbes, including Streptococcus, Klebsiella, and Neisseria.
lilyjacqueline at ebi dot ac dot uk
Johanna von Wachsmann is a PhD student (based in Rob Finn’s lab).

I did my master’s degree in bioinformatics with a focus on advanced algorithms at the Freie Universität Berlin. I wrote my master’s thesis under the supervision of Dr. John Lees.
I am now working on developing tools and algorithms for the analysis of large metagenomic datasets.
Languages: C++, Python, R
wachsmannj at ebi dot ac dot uk
Ouli Xie is an ESPOD fellow

I am an infectious diseases physician-researcher with an interest in applied bacterial genomics to understand how pathogens evolve and spread. I completed my PhD at the Doherty Institute, Australia, with Steven Tong and Mark Davies. In my PhD I examined the clinical and evolutionary intersection between pathogenic streptococci integrating large-scale pathogen genomics, clinical, and surveillance data.
In my fellowship, I work across the Lees Group, Bateman Group and Harrison Group at EMBL-EBI and the Wellcome Sanger Institute to study determinants of Staphylococcus aureus carriage. In particular, I examine co-evolution and competition between Staphylococcus aureus and the nasal microbiome during carriage integrating large-scale pathogen, microbiome, and host data.
Languages and tools: Python, R
Organisms: Staphylococcus aureus, streptococci, microbiome
oxie at ebi dot ac dot uk
Alumni
Daniel Anderson (2024-2025) was a rehomed PhD student, supervised by Zam Iqbal. Now at Basecamp Research.
Raymond Cheng (2022-2025) was a visiting PhD student, co-supervised with Stephen Bentley at the Wellcome Sanger Institute. Now in Nick Croucher’s group at Imperial College London.
Joel Hellewell (2022-2025) was a postdoc, then EIPOD fellow, and founding group member. He’s now at MSF and the Jean Golding Institute at the University of Bristol.
Alireza Tajmirriahi (2025) was a research intern from Aalto University.
Aya Ben Taghaline (2025) was a master’s student from Université Paris-Saclay.
Ines Sanchez-Hombria (2025) was an intern.
Andrea Sanchez Serrano (2025) was a visiting PhD student from the University of Valencia.
Oluwaremilekun Grace Ajakaye (2024) was a visiting scientist (based at Adekunle Ajasin University).
Lale Maouloud (2024) was a master’s student from the University of Rouen Normandy.
Antoine Andréoletti (2024) was a master’s student from the Ecole Polytechnique in Paris.
Berk Gonenc (2023) was an intern.
Oladipo Omotosho (2023) was a visiting scientist on an EMBL training fellowship (based at the University of Ibidan).
Tendai Washaya (2023) was a visiting PhD student (based at the University of Zimbabwe).
Tommi Mäklin (2023) was a visiting postdoc on the Theory@EMBL visitor scheme (based at the University of Helsinki).
Timothy Russell (2023) was a visiting scientist (based at London School of Hygiene and Tropical Medicine).
Gherard Batisti Biffignandi (2022) was a visiting PhD student (based at the University of Pavia). Now at the University of Cambridge (Kate Baker’s lab) and still a visitor at PIM.
Bruhad Dave (2022) was an intern, now at the Wellcome Sanger Institute.
Social and photos
2025
Group retreat, Center Parcs. October 2025




Collage of destruction
EMBO course: Genomics and Modelling. Sept 2025

Group away day, Cambridge/Bury St Edmunds. February 2025



Gracious winners
2024
Winter 2024


Brought to you by Chouffre
Public engagement. Oct 2024


Group retreat, Colchester. August 2024



Cultural exchange dinner (jackets)

Minigolf with Augustus I

pints ~ years as postdoc


Temporary group member Hargreaves the cat
Microbiology society conference
Sam presenting his work on GNASTy

Sam after presenting his work on GNASTy
Group away day, Cambridge. April 2024


Lane 1 (high achievers)

Lane 2 (remedial bowling class)
2023
Xmas dinner 2023


Group retreat, Suffolk. July 2023





The finest minds

The finest minds (part 2)

Pub bike May 2023 – the hailstorm




2022

Infection Biology Transversal Theme Retreat

Group retreat at London Zoo (Gorilla sadly not in our group)
