Sensitivity of the polyDetect computational pipeline for phylogenetic analyses

Jessica M. Storer, Louisiana State University
Jerilyn A. Walker, Louisiana State University
Vallmer E. Jordan, Louisiana State University
Mark A. Batzer, Louisiana State University

Abstract

© 2020 Alu elements are powerful phylogenetic markers. The combination of a recently-developed computational pipeline, polyDetect, with high copy number Alu insertions has previously been utilized to help resolve the Papio baboon phylogeny with high statistical support. Here, the polyDetect method was applied to the highly contentious Cebidae phylogeny within New World monkeys (NWM). The polyDetect method relies on conserved homology/identity of short read sequence data among the species being compared to accurately map predicted shared Alu insertions to each unique flanking sequence. The results of this comprehensive assessment indicate that there were insufficient sequence homology/identity stretches in non-repeated DNA sequences among the four Cebidae genera analyzed in this study to make this strategy phylogenetically viable. The ~20 million years of evolutionary divergence of the Cebidae genera has resulted in random sequence decay within the short read data, obscuring potentially orthologous elements in the species tested. These analyses suggest that the polyDetect pipeline is best suited to resolving phylogenies of more recently diverged lineages when high-quality assembled genomes are not available for the taxa of interest.