63281.pdf (807.27 kB)
Download filePhylogenetic Tree Reconstruction Accuracy and Model Fit when Proportions of Variable Sites Change across the Tree
journal contribution
posted on 2023-05-17, 02:26 authored by Grievink, LS, Penny, D, Hendy, MD, Barbara HollandBarbara HollandCommonly used phylogenetic models assume a homogeneous process through time in all parts of the tree. However, it is known that these models can be too simplistic as they do not account for nonhomogeneous lineage-specific properties. In particular, it is now widely recognized that as constraints on sequences evolve, the proportion and positions of variable sites can vary between lineages causing heterotachy. The extent to which this model misspecification affects tree reconstruction is still unknown. Here, we evaluate the effect of changes in the proportions and positions of variable sites on model fit and tree estimation. We consider 5 current models of nucleotide sequence evolution in a Bayesian Markov chain Monte Carlo framework as well as maximum parsimony (MP). We show that for a tree with 4 lineages where 2 nonsister taxa undergo a change in the proportion of variable sites tree reconstruction under the best-fitting model, which is chosen using a relative test, often results in the wrong tree. In this case, we found that an absolute test of model fit is a better predictor of tree estimation accuracy. We also found further evidence that MP is not immune to heterotachy. In addition, we show that increased sampling of taxa that have undergone a change in proportion and positions of variable sites is critical for accurate tree reconstruction.
History
Publication title
Systematic BiologyVolume
59Pagination
288-297ISSN
1063-5157Department/School
School of Natural SciencesPublisher
Oxford University PressPlace of publication
Oxford, EnglandRights statement
© The Author(s) 2010. Published by Oxford University Press on behalf of Society of Systematic Biologists.Repository Status
- Open