Statistical assignment of DNA sequences using Bayesian phylogenetics

Research output: Contribution to journalJournal articleResearchpeer-review

We provide a new automated statistical method for DNA barcoding based on a Bayesian phylogenetic analysis. The method is based on automated database sequence retrieval, alignment, and phylogenetic analysis using a custom-built program for Bayesian phylogenetic analysis. We show on real data that the method outperforms Blast searches as a measure of confidence and can help eliminate 80% of all false assignment based on best Blast hit. However, the most important advance of the method is that it provides statistically meaningful measures of confidence. We apply the method to a re-analysis of previously published ancient DNA data and show that, with high statistical confidence, most of the published sequences are in fact of Neanderthal origin. However, there are several cases of chimeric sequences that are comprised of a combination of both Neanderthal and modern human DNA.
Original languageEnglish
JournalSystematic Biology
Volume57
Issue number5
Pages (from-to)750-757
Number of pages8
ISSN1063-5157
DOIs
Publication statusPublished - 2008

Bibliographical note

Keywords: Animals; Base Sequence; Bayes Theorem; Computer Simulation; DNA; Insects; Models, Genetic; Models, Statistical; Phylogeny; Plants

ID: 9542024