I’m a permanent researcher / INRIA Starting Faculty at INRIA, the French National Research Institute for Computer Science and Automation. I’m based at the Rennes Research Center, where I’m part of the broader GenScale project team.
I work in the area of computational genomics, with a particular focus on pathogens and antibiotic resistance, and the goal of achieving their rapid diagnosis and real-time surveillance. To do so, I develop novel algorithms, data structures, software tools, and genomic databases, which are then provided to the scientific community as building blocks for larger efforts. I’m particularly interested in non-traditional applications of portable genomic technologies, such as nanopore sequencing and CRISPR-based tests, as well as in moving computation from large computational clusters to ordinary laptops and developing comprehensive sequence data search engines.
Download my CV.
Postdoctoral Fellow, Research Associate, 2017–2021
Harvard Medical School (Department of Biomedical Informatics) & Harvard TH Chan School of Public Health (Center for Communicable Disease Dynamics)
PhD in Computer Science, 2013–2016
Université Paris-Est (Gaspard Monge Institute), France
BSc, MSc in Math. Computer Science, 2018–2013
Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Czech Republic
For a full publication list, see my Google Scholar page.
A rapid, efficient, and exact metagenomic classifier based on k-mer propagation. Written in Python. See http://prophyle.github.io.
Rapid prediction of antibiotic resistance. Tool, pipeline, library and databases (S. pneumoniae and N. gonorrhoeae) for rapid inference of antibiotic resistance and susceptibility by genomic neighbor typing using nanopore sequencing. Written in Python/Snakemake/Make. See https://github.com/c2-d2/rase.
Tool for rapid and memory-efficient computation of simplitigs. Written in C++. See http://github.com/prophyle/prophasm..
An online variant and consensus caller. Call genomic consensus directly from an unsorted SAM/BAM stream. Written in C++. See http://github.com/karel-brinda/ococo.
An exact k-mer index based on the Burrows-Wheeler Transform. Co-developed with Kamil Salikhov. Written in C. See http://github.com/prophyle/prophex.
A format for simulating sequencing reads evaluating read mappers and an associated framework. Written in Snakemake/Python. See http://rnftools.github.io.
A simulator of dynamic read mapping. Written in Python/Snakemake. See http://github.com/karel-brinda/dymas.
Advanced filtering and tagging of SAM/BAM alignments using Python expressions. See http://github.com/karel-brinda/samsift.
Tool for computing a distance matrix from a core genome alignment. Written in C++. See http://github.com/c2-d2/disty.
Simulator of nanopore reads (a fork of the NanoSim package). See http://github.com/karel-brinda/nanosim-h.
Snakemake bioinformatics library (retired). See http://github.com/karel-brinda/smbl.
I’m hiring! If you are interested in working with me as a student (M1, M2, or PhD) or a postdoc on a topic related to rapid diagnostics of antibiotic resistance resistance, sequence data search engines, or computational metagenomics, please contact me at email@example.com.
Concepts in Genome Analysis (BMIF 201) (Fall 2019; TA)
Instructors: Profs. Shamil R. Sunyaev, Michael Baym, Cheng-Zhong Zhang, and Heng Li
The course focused on quantitative aspects of genetics and genomics, including computational and statistical methods of genomic analysis.
Assistive Technology (01ASTE) (Falls 2010–2012 ; Instructor)
Software Project (01SWP1, 01SWP2) (Falls and springs 2010–2012 ; Supervisor)
BBC World Service – Science in Action – 13 Feb 2020
Our paper about rapid diagnostics of antibiotic resistance was covered by BBC World Service in the show Science in Action (13 Feb 2020; starts at 8.10 minutes). 2020-BBC-ScienceInAction-GNT.mp3
The Bioinformatics Chat – Spectrum-preserving string sets and simplitigs – 28 Feb 2020
Our paper about simplitigs for an efficient and scalable representation of de Bruijn graph was covered by the The Bioinformatics Chat podcast series (#42, 28 Feb 2020). 2020-TheBioinformaticsChat-Simplitigs.mp3