Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components. / Cheng, Jade Yu; Stern, Aaron J.; Racimo, Fernando; Nielsen, Rasmus.

In: Molecular Biology and Evolution, Vol. 39, No. 1, msab294, 2022.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Cheng, JY, Stern, AJ, Racimo, F & Nielsen, R 2022, 'Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components', Molecular Biology and Evolution, vol. 39, no. 1, msab294. https://doi.org/10.1093/molbev/msab294

APA

Cheng, J. Y., Stern, A. J., Racimo, F., & Nielsen, R. (2022). Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components. Molecular Biology and Evolution, 39(1), [msab294]. https://doi.org/10.1093/molbev/msab294

Vancouver

Cheng JY, Stern AJ, Racimo F, Nielsen R. Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components. Molecular Biology and Evolution. 2022;39(1). msab294. https://doi.org/10.1093/molbev/msab294

Author

Cheng, Jade Yu ; Stern, Aaron J. ; Racimo, Fernando ; Nielsen, Rasmus. / Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components. In: Molecular Biology and Evolution. 2022 ; Vol. 39, No. 1.

Bibtex

@article{b4c6c3b9aa89413f926b6ca44e1422bf,
title = "Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components",
abstract = "One of the most powerful and commonly used approaches for detecting local adaptation in the genome is the identification of extreme allele frequency differences between populations. In this article, we present a new maximum likelihood method for finding regions under positive selection. It is based on a Gaussian approximation to allele frequency changes and it incorporates admixture between populations. The method can analyze multiple populations simultaneously and retains power to detect selection signatures specific to ancestry components that are not representative of any extant populations. Using simulated data, we compare our method to related approaches, and show that it is orders of magnitude faster than the state-of-the-art, while retaining similar or higher power for most simulation scenarios. We also apply it to human genomic data and identify loci with extreme genetic differentiation between major geographic groups. Many of the genes identified are previously known selected loci relating to hair pigmentation and morphology, skin, and eye pigmentation. We also identify new candidate regions, including various selected loci in the Native American component of admixed Mexican-Americans. These involve diverse biological functions, such as immunity, fat distribution, food intake, vision, and hair development.",
keywords = "positive selection, admixture, population structure, human evolution, selective sweeps, GENOME-WIDE ASSOCIATION, POSITIVE SELECTION, NATURAL-SELECTION, LINKAGE DISEQUILIBRIUM, GENETIC SIGNATURES, MUTATION, LOCI, SNP, ADAPTATION, DOMINANT",
author = "Cheng, {Jade Yu} and Stern, {Aaron J.} and Fernando Racimo and Rasmus Nielsen",
year = "2022",
doi = "10.1093/molbev/msab294",
language = "English",
volume = "39",
journal = "Molecular Biology and Evolution",
issn = "0737-4038",
publisher = "Oxford University Press",
number = "1",

}

RIS

TY - JOUR

T1 - Detecting Selection in Multiple Populations by Modeling Ancestral Admixture Components

AU - Cheng, Jade Yu

AU - Stern, Aaron J.

AU - Racimo, Fernando

AU - Nielsen, Rasmus

PY - 2022

Y1 - 2022

N2 - One of the most powerful and commonly used approaches for detecting local adaptation in the genome is the identification of extreme allele frequency differences between populations. In this article, we present a new maximum likelihood method for finding regions under positive selection. It is based on a Gaussian approximation to allele frequency changes and it incorporates admixture between populations. The method can analyze multiple populations simultaneously and retains power to detect selection signatures specific to ancestry components that are not representative of any extant populations. Using simulated data, we compare our method to related approaches, and show that it is orders of magnitude faster than the state-of-the-art, while retaining similar or higher power for most simulation scenarios. We also apply it to human genomic data and identify loci with extreme genetic differentiation between major geographic groups. Many of the genes identified are previously known selected loci relating to hair pigmentation and morphology, skin, and eye pigmentation. We also identify new candidate regions, including various selected loci in the Native American component of admixed Mexican-Americans. These involve diverse biological functions, such as immunity, fat distribution, food intake, vision, and hair development.

AB - One of the most powerful and commonly used approaches for detecting local adaptation in the genome is the identification of extreme allele frequency differences between populations. In this article, we present a new maximum likelihood method for finding regions under positive selection. It is based on a Gaussian approximation to allele frequency changes and it incorporates admixture between populations. The method can analyze multiple populations simultaneously and retains power to detect selection signatures specific to ancestry components that are not representative of any extant populations. Using simulated data, we compare our method to related approaches, and show that it is orders of magnitude faster than the state-of-the-art, while retaining similar or higher power for most simulation scenarios. We also apply it to human genomic data and identify loci with extreme genetic differentiation between major geographic groups. Many of the genes identified are previously known selected loci relating to hair pigmentation and morphology, skin, and eye pigmentation. We also identify new candidate regions, including various selected loci in the Native American component of admixed Mexican-Americans. These involve diverse biological functions, such as immunity, fat distribution, food intake, vision, and hair development.

KW - positive selection

KW - admixture

KW - population structure

KW - human evolution

KW - selective sweeps

KW - GENOME-WIDE ASSOCIATION

KW - POSITIVE SELECTION

KW - NATURAL-SELECTION

KW - LINKAGE DISEQUILIBRIUM

KW - GENETIC SIGNATURES

KW - MUTATION

KW - LOCI

KW - SNP

KW - ADAPTATION

KW - DOMINANT

U2 - 10.1093/molbev/msab294

DO - 10.1093/molbev/msab294

M3 - Journal article

C2 - 34626111

VL - 39

JO - Molecular Biology and Evolution

JF - Molecular Biology and Evolution

SN - 0737-4038

IS - 1

M1 - msab294

ER -

ID: 291295429