Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. / Meadows, Jennifer R. S.; Kidd, Jeffrey M.; Wang, Guo Dong; Parker, Heidi G.; Schall, Peter Z.; Bianchi, Matteo; Christmas, Matthew J.; Bougiouri, Katia; Buckley, Reuben M.; Hitte, Christophe; Nguyen, Anthony K.; Wang, Chao; Jagannathan, Vidhya; Niskanen, Julia E.; Frantz, Laurent A. F.; Arumilli, Meharji; Hundi, Sruthi; Lindblad-Toh, Kerstin; Ginja, Catarina; Agustina, Kadek Karang; André, Catherine; Boyko, Adam R.; Davis, Brian W.; Drögemüller, Michaela; Feng, Xin-Yao; Gkagkavouzis, Konstantinos; Iliopoulos, Giorgos; Harris, Alexander C.; Hytönen, Marjo K.; Kalthoff, Daniela C.; Liu, Yan-Hu; Lymberakis, Petros; Poulakakis, Nikolaos; Pires, Ana Elisabete; Racimo, Fernando; Ramos-Almodovar, Fabian; Savolainen, Peter; Venetsani, Semina; Tammen, Imke; Triantafyllidis, Alexandros; vonHoldt, Bridgett; Wayne, Robert K.; Larson, Greger; Nicholas, Frank W.; Lohi, Hannes; Leeb, Tosso; Zhang, Ya-Ping; Ostrander, Elaine A.

In: Genome Biology, Vol. 24, 187, 2023.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Meadows, JRS, Kidd, JM, Wang, GD, Parker, HG, Schall, PZ, Bianchi, M, Christmas, MJ, Bougiouri, K, Buckley, RM, Hitte, C, Nguyen, AK, Wang, C, Jagannathan, V, Niskanen, JE, Frantz, LAF, Arumilli, M, Hundi, S, Lindblad-Toh, K, Ginja, C, Agustina, KK, André, C, Boyko, AR, Davis, BW, Drögemüller, M, Feng, X-Y, Gkagkavouzis, K, Iliopoulos, G, Harris, AC, Hytönen, MK, Kalthoff, DC, Liu, Y-H, Lymberakis, P, Poulakakis, N, Pires, AE, Racimo, F, Ramos-Almodovar, F, Savolainen, P, Venetsani, S, Tammen, I, Triantafyllidis, A, vonHoldt, B, Wayne, RK, Larson, G, Nicholas, FW, Lohi, H, Leeb, T, Zhang, Y-P & Ostrander, EA 2023, 'Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture', Genome Biology, vol. 24, 187. https://doi.org/10.1186/s13059-023-03023-7

APA

Meadows, J. R. S., Kidd, J. M., Wang, G. D., Parker, H. G., Schall, P. Z., Bianchi, M., Christmas, M. J., Bougiouri, K., Buckley, R. M., Hitte, C., Nguyen, A. K., Wang, C., Jagannathan, V., Niskanen, J. E., Frantz, L. A. F., Arumilli, M., Hundi, S., Lindblad-Toh, K., Ginja, C., ... Ostrander, E. A. (2023). Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. Genome Biology, 24, [187]. https://doi.org/10.1186/s13059-023-03023-7

Vancouver

Meadows JRS, Kidd JM, Wang GD, Parker HG, Schall PZ, Bianchi M et al. Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. Genome Biology. 2023;24. 187. https://doi.org/10.1186/s13059-023-03023-7

Author

Meadows, Jennifer R. S. ; Kidd, Jeffrey M. ; Wang, Guo Dong ; Parker, Heidi G. ; Schall, Peter Z. ; Bianchi, Matteo ; Christmas, Matthew J. ; Bougiouri, Katia ; Buckley, Reuben M. ; Hitte, Christophe ; Nguyen, Anthony K. ; Wang, Chao ; Jagannathan, Vidhya ; Niskanen, Julia E. ; Frantz, Laurent A. F. ; Arumilli, Meharji ; Hundi, Sruthi ; Lindblad-Toh, Kerstin ; Ginja, Catarina ; Agustina, Kadek Karang ; André, Catherine ; Boyko, Adam R. ; Davis, Brian W. ; Drögemüller, Michaela ; Feng, Xin-Yao ; Gkagkavouzis, Konstantinos ; Iliopoulos, Giorgos ; Harris, Alexander C. ; Hytönen, Marjo K. ; Kalthoff, Daniela C. ; Liu, Yan-Hu ; Lymberakis, Petros ; Poulakakis, Nikolaos ; Pires, Ana Elisabete ; Racimo, Fernando ; Ramos-Almodovar, Fabian ; Savolainen, Peter ; Venetsani, Semina ; Tammen, Imke ; Triantafyllidis, Alexandros ; vonHoldt, Bridgett ; Wayne, Robert K. ; Larson, Greger ; Nicholas, Frank W. ; Lohi, Hannes ; Leeb, Tosso ; Zhang, Ya-Ping ; Ostrander, Elaine A. / Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. In: Genome Biology. 2023 ; Vol. 24.

Bibtex

@article{ceb4f830b4e24138972364ad6040c5f6,
title = "Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture",
abstract = "BACKGROUND: The international Dog10K project aims to sequence and analyze several thousand canine genomes. Incorporating 20 × data from 1987 individuals, including 1611 dogs (321 breeds), 309 village dogs, 63 wolves, and four coyotes, we identify genomic variation across the canid family, setting the stage for detailed studies of domestication, behavior, morphology, disease susceptibility, and genome architecture and function. RESULTS: We report the analysis of > 48 M single-nucleotide, indel, and structural variants spanning the autosomes, X chromosome, and mitochondria. We discover more than 75% of variation for 239 sampled breeds. Allele sharing analysis indicates that 94.9% of breeds form monophyletic clusters and 25 major clades. German Shepherd Dogs and related breeds show the highest allele sharing with independent breeds from multiple clades. On average, each breed dog differs from the UU_Cfam_GSD_1.0 reference at 26,960 deletions and 14,034 insertions greater than 50 bp, with wolves having 14% more variants. Discovered variants include retrogene insertions from 926 parent genes. To aid functional prioritization, single-nucleotide variants were annotated with SnpEff and Zoonomia phyloP constraint scores. Constrained positions were negatively correlated with allele frequency. Finally, the utility of the Dog10K data as an imputation reference panel is assessed, generating high-confidence calls across varied genotyping platform densities including for breeds not included in the Dog10K collection. CONCLUSIONS: We have developed a dense dataset of 1987 sequenced canids that reveals patterns of allele sharing, identifies likely functional variants, informs breed structure, and enables accurate imputation. Dog10K data are publicly available.",
keywords = "Canine, Demographic history, Dog, Genetic diversity, Genomics, Mitochondrial DNA, Variation",
author = "Meadows, {Jennifer R. S.} and Kidd, {Jeffrey M.} and Wang, {Guo Dong} and Parker, {Heidi G.} and Schall, {Peter Z.} and Matteo Bianchi and Christmas, {Matthew J.} and Katia Bougiouri and Buckley, {Reuben M.} and Christophe Hitte and Nguyen, {Anthony K.} and Chao Wang and Vidhya Jagannathan and Niskanen, {Julia E.} and Frantz, {Laurent A. F.} and Meharji Arumilli and Sruthi Hundi and Kerstin Lindblad-Toh and Catarina Ginja and Agustina, {Kadek Karang} and Catherine Andr{\'e} and Boyko, {Adam R.} and Davis, {Brian W.} and Michaela Dr{\"o}gem{\"u}ller and Xin-Yao Feng and Konstantinos Gkagkavouzis and Giorgos Iliopoulos and Harris, {Alexander C.} and Hyt{\"o}nen, {Marjo K.} and Kalthoff, {Daniela C.} and Yan-Hu Liu and Petros Lymberakis and Nikolaos Poulakakis and Pires, {Ana Elisabete} and Fernando Racimo and Fabian Ramos-Almodovar and Peter Savolainen and Semina Venetsani and Imke Tammen and Alexandros Triantafyllidis and Bridgett vonHoldt and Wayne, {Robert K.} and Greger Larson and Nicholas, {Frank W.} and Hannes Lohi and Tosso Leeb and Ya-Ping Zhang and Ostrander, {Elaine A.}",
note = "Publisher Copyright: {\textcopyright} 2023. BioMed Central Ltd., part of Springer Nature.",
year = "2023",
doi = "10.1186/s13059-023-03023-7",
language = "English",
volume = "24",
journal = "Genome Biology (Online Edition)",
issn = "1474-7596",
publisher = "BioMed Central Ltd.",

}

RIS

TY - JOUR

T1 - Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture

AU - Meadows, Jennifer R. S.

AU - Kidd, Jeffrey M.

AU - Wang, Guo Dong

AU - Parker, Heidi G.

AU - Schall, Peter Z.

AU - Bianchi, Matteo

AU - Christmas, Matthew J.

AU - Bougiouri, Katia

AU - Buckley, Reuben M.

AU - Hitte, Christophe

AU - Nguyen, Anthony K.

AU - Wang, Chao

AU - Jagannathan, Vidhya

AU - Niskanen, Julia E.

AU - Frantz, Laurent A. F.

AU - Arumilli, Meharji

AU - Hundi, Sruthi

AU - Lindblad-Toh, Kerstin

AU - Ginja, Catarina

AU - Agustina, Kadek Karang

AU - André, Catherine

AU - Boyko, Adam R.

AU - Davis, Brian W.

AU - Drögemüller, Michaela

AU - Feng, Xin-Yao

AU - Gkagkavouzis, Konstantinos

AU - Iliopoulos, Giorgos

AU - Harris, Alexander C.

AU - Hytönen, Marjo K.

AU - Kalthoff, Daniela C.

AU - Liu, Yan-Hu

AU - Lymberakis, Petros

AU - Poulakakis, Nikolaos

AU - Pires, Ana Elisabete

AU - Racimo, Fernando

AU - Ramos-Almodovar, Fabian

AU - Savolainen, Peter

AU - Venetsani, Semina

AU - Tammen, Imke

AU - Triantafyllidis, Alexandros

AU - vonHoldt, Bridgett

AU - Wayne, Robert K.

AU - Larson, Greger

AU - Nicholas, Frank W.

AU - Lohi, Hannes

AU - Leeb, Tosso

AU - Zhang, Ya-Ping

AU - Ostrander, Elaine A.

N1 - Publisher Copyright: © 2023. BioMed Central Ltd., part of Springer Nature.

PY - 2023

Y1 - 2023

N2 - BACKGROUND: The international Dog10K project aims to sequence and analyze several thousand canine genomes. Incorporating 20 × data from 1987 individuals, including 1611 dogs (321 breeds), 309 village dogs, 63 wolves, and four coyotes, we identify genomic variation across the canid family, setting the stage for detailed studies of domestication, behavior, morphology, disease susceptibility, and genome architecture and function. RESULTS: We report the analysis of > 48 M single-nucleotide, indel, and structural variants spanning the autosomes, X chromosome, and mitochondria. We discover more than 75% of variation for 239 sampled breeds. Allele sharing analysis indicates that 94.9% of breeds form monophyletic clusters and 25 major clades. German Shepherd Dogs and related breeds show the highest allele sharing with independent breeds from multiple clades. On average, each breed dog differs from the UU_Cfam_GSD_1.0 reference at 26,960 deletions and 14,034 insertions greater than 50 bp, with wolves having 14% more variants. Discovered variants include retrogene insertions from 926 parent genes. To aid functional prioritization, single-nucleotide variants were annotated with SnpEff and Zoonomia phyloP constraint scores. Constrained positions were negatively correlated with allele frequency. Finally, the utility of the Dog10K data as an imputation reference panel is assessed, generating high-confidence calls across varied genotyping platform densities including for breeds not included in the Dog10K collection. CONCLUSIONS: We have developed a dense dataset of 1987 sequenced canids that reveals patterns of allele sharing, identifies likely functional variants, informs breed structure, and enables accurate imputation. Dog10K data are publicly available.

AB - BACKGROUND: The international Dog10K project aims to sequence and analyze several thousand canine genomes. Incorporating 20 × data from 1987 individuals, including 1611 dogs (321 breeds), 309 village dogs, 63 wolves, and four coyotes, we identify genomic variation across the canid family, setting the stage for detailed studies of domestication, behavior, morphology, disease susceptibility, and genome architecture and function. RESULTS: We report the analysis of > 48 M single-nucleotide, indel, and structural variants spanning the autosomes, X chromosome, and mitochondria. We discover more than 75% of variation for 239 sampled breeds. Allele sharing analysis indicates that 94.9% of breeds form monophyletic clusters and 25 major clades. German Shepherd Dogs and related breeds show the highest allele sharing with independent breeds from multiple clades. On average, each breed dog differs from the UU_Cfam_GSD_1.0 reference at 26,960 deletions and 14,034 insertions greater than 50 bp, with wolves having 14% more variants. Discovered variants include retrogene insertions from 926 parent genes. To aid functional prioritization, single-nucleotide variants were annotated with SnpEff and Zoonomia phyloP constraint scores. Constrained positions were negatively correlated with allele frequency. Finally, the utility of the Dog10K data as an imputation reference panel is assessed, generating high-confidence calls across varied genotyping platform densities including for breeds not included in the Dog10K collection. CONCLUSIONS: We have developed a dense dataset of 1987 sequenced canids that reveals patterns of allele sharing, identifies likely functional variants, informs breed structure, and enables accurate imputation. Dog10K data are publicly available.

KW - Canine

KW - Demographic history

KW - Dog

KW - Genetic diversity

KW - Genomics

KW - Mitochondrial DNA

KW - Variation

U2 - 10.1186/s13059-023-03023-7

DO - 10.1186/s13059-023-03023-7

M3 - Journal article

C2 - 37582787

AN - SCOPUS:85168060188

VL - 24

JO - Genome Biology (Online Edition)

JF - Genome Biology (Online Edition)

SN - 1474-7596

M1 - 187

ER -

ID: 362888077