
Statistics Papers
Document Type
Journal Article
Date of this Version
2004
Publication Source
Transactions of the American Mathematical Society
Volume
356
Start Page
2379
Last Page
2404
Abstract
We apply the theory of markov random fields on trees to derive a phase transition in the number of samples needed in order to reconstruct phylogenies.
We consider the Cavender-Farris-Neyman model of evolution on trees, where all the inner nodes have degree at least 3, and the net transition on each edge is bounded by ∈. Motivated by a conjecture by M. Steel, we show that if 2(1 − 2∈)2 > 1, then for balanced trees, the topology of the underlying tree, having n leaves, can be reconstructed from O(log n) samples (characters) at the leaves. On the other hand, we show that if 2(1 − 2∈)2 < 1, then there exist topologies which require at least nΩ(1) samples for reconstruction.
Our results are the first rigorous results to establish the role of phase transitions for markov random fields on trees as studied in probability, statistical physics and information theory to the study of phylogenies in mathematical biology.
Keywords
phylogeny, phase transition, Ising model
Recommended Citation
Mossel, E. (2004). Phase Transitions in Phylogeny. Transactions of the American Mathematical Society, 356 2379-2404. Retrieved from https://repository.upenn.edu/statistics_papers/186
Date Posted: 27 November 2017
This document has been peer reviewed.