Phylogenomics using low-depth whole genome sequencing: A case study with the olive tribe
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
Phylogenomics using low-depth whole genome sequencing : A case study with the olive tribe. / Olofsson, Jill K; Cantera, Isabel; Van de Paer, Céline; Hong-Wa, Cynthia; Zedane, Loubab; Dunning, Luke T; Alberti, Adriana; Christin, Pascal-Antoine; Besnard, Guillaume.
In: Molecular Ecology Resources, Vol. 19, No. 4, 07.2019, p. 877-892.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Phylogenomics using low-depth whole genome sequencing
T2 - A case study with the olive tribe
AU - Olofsson, Jill K
AU - Cantera, Isabel
AU - Van de Paer, Céline
AU - Hong-Wa, Cynthia
AU - Zedane, Loubab
AU - Dunning, Luke T
AU - Alberti, Adriana
AU - Christin, Pascal-Antoine
AU - Besnard, Guillaume
N1 - © 2019 John Wiley & Sons Ltd.
PY - 2019/7
Y1 - 2019/7
N2 - Species trees have traditionally been inferred from a few selected markers, and genome-wide investigations remain largely restricted to model organisms or small groups of species for which sampling of fresh material is available, leaving out most of the existing and historical species diversity. The genomes of an increasing number of species, including specimens extracted from natural history collections, are being sequenced at low depth. While these data sets are widely used to analyse organelle genomes, the nuclear fraction is generally ignored. Here we evaluate different reference-based methods to infer phylogenies of large taxonomic groups from such data sets. Using the example of the Oleeae tribe, a worldwide-distributed group, we build phylogenies based on single nucleotide polymorphisms (SNPs) obtained using two reference genomes (the olive and ash trees). The inferred phylogenies are overall congruent, yet present differences that might reflect the effect of distance to the reference on the amount of missing data. To limit this issue, genome complexity was reduced by using pairs of orthologous coding sequences as the reference, thus allowing us to combine SNPs obtained using two distinct references. Concatenated and coalescence trees based on these combined SNPs suggest events of incomplete lineage sorting and/or hybridization during the diversification of this large phylogenetic group. Our results show that genome-wide phylogenetic trees can be inferred from low-depth sequence data sets for eukaryote groups with complex genomes, and histories of reticulate evolution. This opens new avenues for large-scale phylogenomics and biogeographical analyses covering both the extant and the historical diversity stored in museum collections.
AB - Species trees have traditionally been inferred from a few selected markers, and genome-wide investigations remain largely restricted to model organisms or small groups of species for which sampling of fresh material is available, leaving out most of the existing and historical species diversity. The genomes of an increasing number of species, including specimens extracted from natural history collections, are being sequenced at low depth. While these data sets are widely used to analyse organelle genomes, the nuclear fraction is generally ignored. Here we evaluate different reference-based methods to infer phylogenies of large taxonomic groups from such data sets. Using the example of the Oleeae tribe, a worldwide-distributed group, we build phylogenies based on single nucleotide polymorphisms (SNPs) obtained using two reference genomes (the olive and ash trees). The inferred phylogenies are overall congruent, yet present differences that might reflect the effect of distance to the reference on the amount of missing data. To limit this issue, genome complexity was reduced by using pairs of orthologous coding sequences as the reference, thus allowing us to combine SNPs obtained using two distinct references. Concatenated and coalescence trees based on these combined SNPs suggest events of incomplete lineage sorting and/or hybridization during the diversification of this large phylogenetic group. Our results show that genome-wide phylogenetic trees can be inferred from low-depth sequence data sets for eukaryote groups with complex genomes, and histories of reticulate evolution. This opens new avenues for large-scale phylogenomics and biogeographical analyses covering both the extant and the historical diversity stored in museum collections.
KW - Fraxinus/classification
KW - Olea/classification
KW - Phylogeny
KW - Polymorphism, Single Nucleotide
KW - Whole Genome Sequencing/methods
U2 - 10.1111/1755-0998.13016
DO - 10.1111/1755-0998.13016
M3 - Journal article
C2 - 30934146
VL - 19
SP - 877
EP - 892
JO - Molecular Ecology
JF - Molecular Ecology
SN - 0962-1083
IS - 4
ER -
ID: 235066470