Virus discovery and epidemic tracing from high throughput metagenomic sequencing
VIROGENESIS is a research project under the EC H2020 framework (2015-2018), that will develop bioinformatics software applications to fully exploit the potential of high-throughput Next-Generation-Sequencing (NGS) technologies in the field of virology.
NGS analysis pipelines are rapidly becoming part of the routine repertoire of research, clinical and public health laboratories in the public sector and private industry. Middle East Respiratory Syndrome-coronavirus (MERS-CoV) is only one example of the many recent virus discoveries made through analysis of next-generation sequencing (NGS) data. Yet, only a small part of the several millions of short-length sequence fragments generated by NGS machineries, many of which are expected to be of viral origin, can be analysed with current methods in bioinformatics. Even for well-known (pathogenic) viruses, proper epidemiological analyses are becoming more and more difficult due to the lack of bioinformatics tools that can handle the large and growing size of datasets.
The VIROGENESIS consortium will overcome the most pressing bioinformatics obstacles to making full use of NGS by developing a software platform for end-users with tools underpinned by novel algorithms, models and bioinformatics methods. The speed and flexibility of the tools will make it possible to run analyses on a daily basis for a variety of subjects, including diagnostics, phylogeography, phylodynamics and transmission of drug resistance.