Steenwyk, Jacob L. and Buida, Thomas J. and Li, Yuanning and Shen, Xing-Xing and Rokas, Antonis and Hejnol, Andreas (2020) ClipKIT: A multiple sequence alignment trimming software for accurate phylogenomic inference. PLOS Biology, 18 (12). e3001007. ISSN 1545-7885
file_id=10.1371%2Fjournal.pbio.3001007&type=printable - Published Version
Download (1MB)
Abstract
Highly divergent sites in multiple sequence alignments (MSAs), which can stem from erroneous inference of homology and saturation of substitutions, are thought to negatively impact phylogenetic inference. Thus, several different trimming strategies have been developed for identifying and removing these sites prior to phylogenetic inference. However, a recent study reported that doing so can worsen inference, underscoring the need for alternative alignment trimming strategies. Here, we introduce ClipKIT, an alignment trimming software that, rather than identifying and removing putatively phylogenetically uninformative sites, instead aims to identify and retain parsimony-informative sites, which are known to be phylogenetically informative. To test the efficacy of ClipKIT, we examined the accuracy and support of phylogenies inferred from 14 different alignment trimming strategies, including those implemented in ClipKIT, across nearly 140,000 alignments from a broad sampling of evolutionary histories. Phylogenies inferred from ClipKIT-trimmed alignments are accurate, robust, and time saving. Furthermore, ClipKIT consistently outperformed other trimming methods across diverse datasets, suggesting that strategies based on identifying and retaining parsimony-informative sites provide a robust framework for alignment trimming.
Item Type: | Article |
---|---|
Subjects: | Pacific Library > Biological Science |
Depositing User: | Unnamed user with email support@pacificlibrary.org |
Date Deposited: | 09 Feb 2023 07:59 |
Last Modified: | 30 May 2024 13:33 |
URI: | http://editor.classicopenlibrary.com/id/eprint/42 |