News
7/25/24: BAli-Phy 4.0-beta15 is available for download (See NEWS).
BAli-Phy 4.0 will likely be released around Mar 2025.
08/24/21: BAli-Phy 3.6.1 released - Download
Bug fixes for BES (See release notes)
03/02/21: Paper - BAli-Phy version 3: Model-based co-estimation of Alignment and Phylogeny
Introduction
BAli-Phy is software by Ben Redelings and Marc Suchard that estimates multiple sequence alignments and evolutionary trees from DNA, amino acid, or codon sequences. It uses likelihood-based evolutionary models of substitutions and insertions and deletions to place gaps.
High alignment accuracy: Redelings (2014) showed that BAli-Phy had 3.5 times fewer alignment errors than MUSCLEv4 and MAFFT on simulated data:
Eliminate bias: Fletcher and Yang (2010) showed that relying on a ClustalW alignment estimate could lead to a 99% false-positive rate in detecting positive selection. Evolutionary trees and branch lengths that are inferred from a single alignment can also be biased if the alignment is ambiguous. BAli-Phy solves the problem of alignment bias by using MCMC to estimate evolutionary trees, positive selection, and branch lengths while simultaneously averaging over alternative alignments in a Bayesian paradigm.
uncertain certain |
. | . | . | . | 3 | 1 | 0 | . | . | . | . | . | . | . | 3 | 2 | 0 | . | . | . | . | . | . | . | 3 | 3 | 0 | . | . | . | . | . | . | . | 3 | 4 | 0 | . | . | . | . | . | . | . | 3 | 5 | 0 | . | . | . | . | . | . | . | 3 | 6 | 0 | . | . | . | . | . | . | . | 3 | 7 | 0 | . | . | . | . | . | . | . | |
Thermotoga | D | E | V | E | I | I | G | L | S | Y | E | I | K | K | T | V | - | - | - | V | T | S | V | E | M | F | R | K | E | L | D | E | G | I | A | G | D | N | V | G | C | L | L | R | G | I | D | K | D | E | V | E | R | G | Q | V | L | A | - | - | - | - | - | A | P | G | S | I | K | P | H | K | R | F |
Anacystis | E | T | I | E | I | V | G | L | R | - | D | T | R | S | T | T | - | - | - | V | T | G | V | E | M | F | Q | K | T | L | D | E | G | L | A | G | D | N | V | G | L | L | L | R | G | I | Q | K | T | D | I | E | R | G | M | V | L | A | - | - | - | - | - | K | P | G | S | I | T | P | H | T | K | F |
Escheria | E | E | V | E | I | V | G | I | K | - | E | T | Q | K | S | T | - | - | - | C | T | G | V | E | M | F | R | K | L | L | D | E | G | R | A | G | E | N | V | G | V | L | L | R | G | I | K | R | E | E | I | E | R | G | Q | V | L | A | - | - | - | - | - | K | P | G | T | I | K | P | H | T | K | F |
Pyrococcus | E | V | V | I | F | E | P | A | S | T | I | F | H | K | P | I | Q | G | E | V | K | S | I | E | M | H | H | E | P | L | E | E | A | L | P | G | D | N | I | G | F | N | V | R | G | V | S | K | N | D | I | K | R | G | D | V | A | G | H | T | T | N | - | P | P | T | V | V | R | T | K | D | T | F |
Halobacterium | D | N | V | S | F | Q | P | S | D | V | G | - | - | - | - | - | - | G | E | V | K | T | I | E | M | H | H | E | E | V | P | N | A | E | P | G | D | N | V | G | F | N | V | R | G | I | G | K | D | D | I | R | R | G | D | V | C | G | P | A | D | D | - | P | P | S | V | A | - | - | - | D | T | F |
Methanococcus | D | K | V | V | F | E | P | A | G | A | I | - | - | - | - | - | - | G | E | I | K | T | V | E | M | H | H | E | Q | L | P | S | A | E | P | G | D | N | I | G | F | N | V | R | G | V | G | K | K | D | I | K | R | G | D | V | L | G | H | T | T | N | - | P | P | T | V | A | - | - | - | T | D | F |
Aeropyrum | D | K | V | V | F | M | P | P | G | V | V | - | - | - | - | - | - | G | E | V | R | S | I | E | M | H | Y | Q | Q | L | Q | Q | A | E | P | G | D | N | I | G | F | A | V | R | G | V | S | K | S | D | I | K | R | G | D | V | A | G | H | L | D | K | - | P | P | T | V | A | - | - | - | E | E | F |
Sulfolobus | D | K | I | V | F | M | P | V | G | K | I | - | - | - | - | - | - | G | E | V | R | S | I | E | T | H | H | T | K | I | D | K | A | E | P | G | D | N | I | G | F | N | V | R | G | V | E | K | K | D | V | K | R | G | D | V | A | G | S | V | Q | N | - | P | P | T | V | A | - | - | - | D | E | F |
Giardia | M | K | V | V | F | A | P | T | S | Q | V | - | - | - | - | - | - | S | E | V | K | S | V | E | M | H | H | E | E | L | K | K | A | G | P | G | D | N | V | G | F | N | V | R | G | L | A | V | K | D | L | K | K | G | Y | V | V | G | D | V | T | N | D | P | P | V | G | C | - | - | - | K | S | F |
Homo | M | V | V | T | F | A | P | V | N | V | T | - | - | - | - | - | - | T | E | V | K | S | V | E | M | H | H | E | A | L | S | E | A | L | P | G | D | N | V | G | F | N | V | K | N | V | S | V | K | D | V | R | R | G | N | V | A | G | D | S | K | N | D | P | P | M | E | A | - | - | - | A | G | F |
Euglena | D | V | V | T | F | A | P | N | N | L | T | - | - | - | - | - | - | T | E | V | K | S | V | E | M | H | H | E | A | L | T | E | A | V | P | G | D | N | V | G | F | N | V | K | N | V | S | V | K | D | I | R | R | G | Y | V | A | S | N | A | K | N | D | P | A | K | E | A | - | - | - | A | D | F |
Nicotiana | M | V | V | T | F | G | P | T | G | L | T | - | - | - | - | - | - | T | E | V | K | S | V | E | M | H | H | E | A | L | Q | E | A | L | P | G | D | N | V | G | F | N | V | K | N | V | A | V | K | D | L | K | R | G | F | V | A | S | N | S | K | D | D | P | A | K | G | A | - | - | - | A | S | F |
This ambiguity can be displayed graphically in an alignment uncertainty (AU) plot.
Model-based alignment: BAli-Phy can make use of complex substitution models while estimating alignments (and trees). These include the free-rates and Gamma+INV models, codon models such as the M3 and M8 models, and covarion models such as Tuffley-Steel.
Fixed-alignment: BAli-Phy can also estimate phylogenies from a fixed alignment (like MrBayes and BEAST) using complex substitution models like GTR+gamma.
Multi-gene: BAli-Phy automatically estimates relative rates for each gene, as described in the Manual and the tutorial.
Ancestral sequence reconstruction: BAli-Phy automatically reconstructs ancestral sequences (with gaps) for each gene, while averaging over both topological and alignment uncertainty, as described in the Manual.
References
- Redelings BD BAli-Phy version 3: model-based co-estimation of alignment and phylogeny.
(2021) Bioinformatics 37(18). [doi:10.1093/bioinformatics/btab129] - Redelings BD Erasing Errors Due to Alignment Ambiguity When Estimating Positive Selection.
(2014) Mol. Biol. Evo. 31(8). [doi:10.1093/molbev/msu174] - Redelings BD and Suchard MA Incorporating indel information into phylogeny estimation for rapidly emerging pathogens.
(2007) BMC Evolutionary Biology, 7:40. [doi: 10.1186/1471-2148-7-40] - Suchard MA and Redelings BD BAli-Phy: simultaneous Bayesian inference of alignment and phylogeny.
(2006) Bioinformatics, 22:2047-2048. [doi:10.1093/bioinformatics/btl175] - Redelings BD and Suchard MA Joint Bayesian Estimation of Alignment and Phylogeny.
(2005) Systematic Biology, 54(3):401-418 [doi:10.1080/10635150590947041]
comments and suggestions: benjamin . redelings * gmail + com