Parallel computing for phylogenetic inference
Parallel computing for phylogenetic inference
DACH: Österreich - Deutschland - Schweiz
Disciplines
Biology (35%); Computer Sciences (65%)
Keywords
-
Bioinformatic Tools,
High Performance Computing,
Algorithms,
Modeling,
Parallel Computer Architectures,
Statistical Inference
Bioinformatics is currently facing two challenges. Firstly, significant advances in sequencing techniques (454, Solexa) are generating an unprecedented amount of molecular data. Hence, data acquisition is no longer a problem but rather data analysis, especially in molecular evolution. Secondly, the field of parallel computing is facing the multi-core revolution on general purpose CPUs and a plethora of novel accelerator technologies such as GPUs (Graphics Processing Units). Therefore, parallel computing is becoming feasible at the level of personal computers. Nevertheless, biological data stored in public databases (e.g., GenBank) increases at a significantly higher rate than computational power. Thus, we need to substantially improve the respective models, data structures, and algorithms for data analysis. Here, we propose to tackle these challenges for the two closely related and intertwined fields of Statistical Multiple Sequence Alignment (sMSA) and Phylogenetic Inference (PI) via an integrated approach. We will develop a highly optimized, portable, parallelized, and versatile library for sMSA and PI. We will also improve statistical models for sMSA, search heuristics for PI, and integrate them in a next-generation bioinformatics tool for evolutionary biology. The underlying idea is to develop models and methods in such a way that they will be scalable on all modern multi-core, accelerator, and supercomputer architectures.
During the funding phase of the project we have attained three main goals. First, we developed and made available as open-source code a highly optimized and parallelized phylogenetic likelihood library (PLL) that is already being adopted by the phylogenetics community and has been integrated into respective tools. This was joined work with Alexandros Stamatakis (Heidelberg). Secondly, we introduced IQ-TREE, a fast and effective stochastic algorithm to infer ML trees and an ultrafast approximation for the phylogenetic bootstrap (UFBoot). Both tools were developed by the Austrian sub-project. Third, the sub project headed by Dirk Metzler (Munich) implemented functions for sampling multiple sequence alignments from their joint posterior probability distribution under an explicit model of sequence evolution.
- Universität Wien - 100%
Research Output
- 24159 Citations
- 10 Publications
-
2013
Title Decisive Data Sets in Phylogenomics: Lessons from Studies on the Phylogenetic Relationships of Primarily Wingless Insects DOI 10.1093/molbev/mst196 Type Journal Article Author Dell’Ampio E Journal Molecular Biology and Evolution Pages 239-249 Link Publication -
2015
Title Whole genome analysis of a Vietnamese trio DOI 10.1007/s12038-015-9501-0 Type Journal Article Author Hai D Journal Journal of Biosciences Pages 113-124 -
0
Title Terrace aware phylogenomic inference from supermatrices. Type Other Author Chernomor O -
2013
Title Ultrafast Approximation for Phylogenetic Bootstrap DOI 10.1093/molbev/mst024 Type Journal Article Author Minh B Journal Molecular Biology and Evolution Pages 1188-1195 Link Publication -
2014
Title Terrace Aware Phylogenomic Inference from Supermatrices DOI 10.48550/arxiv.1411.3480 Type Preprint Author Chernomor O -
2014
Title IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies DOI 10.1093/molbev/msu300 Type Journal Article Author Nguyen L Journal Molecular Biology and Evolution Pages 268-274 Link Publication -
2014
Title Split diversity in constrained conservation prioritization using integer linear programming DOI 10.1111/2041-210x.12299 Type Journal Article Author Chernomor O Journal Methods in Ecology and Evolution Pages 83-91 Link Publication -
2014
Title Discovery of the first light-dependent protochlorophyllide oxidoreductase in anoxygenic phototrophic bacteria DOI 10.1111/mmi.12719 Type Journal Article Author Kaschner M Journal Molecular Microbiology Pages 1066-1078 Link Publication -
2014
Title A novel Fibroblast Growth Factor Receptor family member promotes neuronal outgrowth and synaptic plasticity in Aplysia DOI 10.1007/s00726-014-1803-2 Type Journal Article Author Pollak D Journal Amino Acids Pages 2477-2488 Link Publication -
2014
Title The Phylogenetic Likelihood Library DOI 10.1093/sysbio/syu084 Type Journal Article Author Flouri T Journal Systematic Biology Pages 356-362 Link Publication