Phylogenetic Mixtures: Concentration of Measure in the Large-Tree Limit

Loading...
Thumbnail Image
Penn collection
Statistics Papers
Degree type
Discipline
Subject
phylogenetic reconstruction
random trees
concentration of measure
Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Mossel, Elchanan
Roch, Sébastien
Contributor
Abstract

The reconstruction of phylogenies from DNA or protein sequences is a major task of computational evolutionary biology. Common phenomena, notably variations in mutation rates across genomes and incongruences between gene lineage histories, often make it necessary to model molecular data as originating from a mixture of phylogenies. Such mixed models play an increasingly important role in practice. Using concentration of measure techniques, we show that mixtures of large trees are typically identifiable. We also derive sequence-length requirements for high-probability reconstruction.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2012-01-01
Journal title
The Annals of Applied Probability
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection