I am seeking to apply my knowledge in computer science and statistics to understand real world data. I have interdisciplinary background spanning complex systems, Big Data, machine learning, biostatistics and genomics. I have completed a PhD in which I applied clustering and PCA to epigenomics data and discovered new insights on the coupling between replication and epigenetics. I studied the human microbiota during two years at MetaGenoPolis (MGP), an innovative research center. We aim at improving human health by developing strategies (eg. nutritional, therapeutical, preventive…) to restore dysbiosed microbiota with our industrial and academical partners. I currently work in the statistical genetics group at the Pasteur Institut where I apply my software development and data science skills to quantify the impact of the human genome variation on diverse health parameters. For example, I developed an imputation method tailored for GWAS Summary statistics (https://gitlab.pasteur.fr/statistical-genetics/raiss). I am mainly interested to understand the link between pleiotropy and biological function. I am implicated in the JASS project (http://jass.pasteur.fr/index.html) that aim at analysing jointly numerous GWAS datasets. By doing so, we aim at increasing statistical power and to understand the molecular pathways underlying pleiotropy.
Bioinformatics program for PhD students 2020-2021
This mandatory training program, required for all Institut Pasteur PhD students, begins with a group of common core courses including a knowledge base in reproducible research, R Programming and Statistics. Each student then can […]
(FR) Consortium EpiGenCOV (Epidemiologie Génétique de la COVID19)
Sorry, this entry is only available in French.
JOBIM 2021 pilot project – Gender Speaking Differences in Academia
In the JOBIM 2021 conference, we are launching a pilot project on gender speaking differences in academia. Through this evidence-based and mixed-method study, we intend to answer the following question: how to create the conditions for gender-equal expression in scientific conferences? Objective […]
MGMM : Missingness Aware Gaussian Mixture Models
Although missing data are prevalent in applications, existing implementations of Gaussian mixture models (GMMs) require complete data. Standard practice is to perform complete case analysis or imputation prior to model fitting. Both approaches have […]
Multi-trait analyses using public summary statistics from genome-wide association studies (GWASs) are becoming increasingly popular. A constraint of multi-trait methods is that they require complete summary data for all traits. Although methods for the […]
JASS – Joint Analysis of (GWAS) Summary Statistics
JASS is a python package that handles the computation of the joint statistics over sets of selected GWAS results, and the interactive exploration of the results through a web interface. The generation of joint […]
2021Multitrait GWAS to connect disease variants and biological mechanisms., PLoS Genet 2021 Aug; 17(8): e1009713.
2020JASS: command line and web interface for the joint analysis of GWAS results., NAR Genom Bioinform 2020 Mar; 2(1): lqaa003.
2019RAISS: robust and accurate imputation from summary statistics, Bioinformatics 2019 Nov;35(22):4837-4839.
2019A comprehensive study of metabolite genetics reveals strong pleiotropy and heterogeneity across time and context, Nat Commun 2019 10;10(1):4788.
2015Embryonic stem cell specific “master” replication origins at the heart of the loss of pluripotency, PLoS Comput. Biol. 2015 Feb;11(2):e1003969.
2014From the chromatin interaction network to the organization of the human genome into replication N/U-domains, New Journal of Physics.
2013Human genome replication proceeds through four chromatin states, PLoS Comput. Biol. 2013;9(10):e1003233.
2013Epigenetic regulation of the human genome: coherence between promoter activity and large-scale chromatin environment, Frontiers in Life Science.
2013Modelling adipocytes size distribution, J. Theor. Biol. 2013 Sep;332:89-95.
2012Multiscale analysis of genome-wide replication timing profiles using a wavelet-based signal-processing algorithm, Nat Protoc 2013 Jan;8(1):98-110.
+View full list of publications