Quantifying population genetic differentiation from next-generation sequencing data

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Quantifying population genetic differentiation from next-generation sequencing data. / Fumagalli, Matteo; Garrett Vieira, Filipe Jorge; Korneliussen, Thorfinn Sand; Linderoth, Tyler; Huerta-Sánchez, Emilia; Albrechtsen, Anders; Nielsen, Rasmus.

In: Genetics, Vol. 195, No. 3, 11.2013, p. 979-992.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Fumagalli, M, Garrett Vieira, FJ, Korneliussen, TS, Linderoth, T, Huerta-Sánchez, E, Albrechtsen, A & Nielsen, R 2013, 'Quantifying population genetic differentiation from next-generation sequencing data', Genetics, vol. 195, no. 3, pp. 979-992. https://doi.org/10.1534/genetics.113.154740

APA

Fumagalli, M., Garrett Vieira, F. J., Korneliussen, T. S., Linderoth, T., Huerta-Sánchez, E., Albrechtsen, A., & Nielsen, R. (2013). Quantifying population genetic differentiation from next-generation sequencing data. Genetics, 195(3), 979-992. https://doi.org/10.1534/genetics.113.154740

Vancouver

Fumagalli M, Garrett Vieira FJ, Korneliussen TS, Linderoth T, Huerta-Sánchez E, Albrechtsen A et al. Quantifying population genetic differentiation from next-generation sequencing data. Genetics. 2013 Nov;195(3):979-992. https://doi.org/10.1534/genetics.113.154740

Author

Fumagalli, Matteo ; Garrett Vieira, Filipe Jorge ; Korneliussen, Thorfinn Sand ; Linderoth, Tyler ; Huerta-Sánchez, Emilia ; Albrechtsen, Anders ; Nielsen, Rasmus. / Quantifying population genetic differentiation from next-generation sequencing data. In: Genetics. 2013 ; Vol. 195, No. 3. pp. 979-992.

Bibtex

@article{65856f0d36fa4ef78fc3d915fa990ba1,
title = "Quantifying population genetic differentiation from next-generation sequencing data",
abstract = "Over the last few years, new high-throughput DNA sequencing technologies have dramatically increased speed and reduced sequencing costs. However, the use of these sequencing technologies is often challenged by errors and biases associated with the bioinformatical methods used for analyzing the data. In particular, the use of na{\~A}¯ve methods to identify polymorphic sites and infer genotypes can inflate downstream analyses. Recently, explicit modeling of genotype probability distributions has been proposed as a method for taking genotype call uncertainty into account. Based on this idea, we propose a novel method for quantifying population genetic differentiation from next-generation sequencing data. In addition, we present a strategy to investigate population structure via Principal Components Analysis. Through extensive simulations, we compare the new method herein proposed to approaches based on genotype calling and demonstrate a marked improvement in estimation accuracy for a wide range of conditions. We apply the method to a large-scale genomic data set of domesticated and wild silkworms sequenced at low coverage. We find that we can infer the fine-scale genetic structure of the sampled individuals, suggesting that employing this new method is useful for investigating the genetic relationships of populations sampled at low coverage.",
author = "Matteo Fumagalli and {Garrett Vieira}, {Filipe Jorge} and Korneliussen, {Thorfinn Sand} and Tyler Linderoth and Emilia Huerta-S{\'a}nchez and Anders Albrechtsen and Rasmus Nielsen",
year = "2013",
month = nov,
doi = "10.1534/genetics.113.154740",
language = "English",
volume = "195",
pages = "979--992",
journal = "Genetics",
issn = "1943-2631",
publisher = "The Genetics Society of America (GSA)",
number = "3",

}

RIS

TY - JOUR

T1 - Quantifying population genetic differentiation from next-generation sequencing data

AU - Fumagalli, Matteo

AU - Garrett Vieira, Filipe Jorge

AU - Korneliussen, Thorfinn Sand

AU - Linderoth, Tyler

AU - Huerta-Sánchez, Emilia

AU - Albrechtsen, Anders

AU - Nielsen, Rasmus

PY - 2013/11

Y1 - 2013/11

N2 - Over the last few years, new high-throughput DNA sequencing technologies have dramatically increased speed and reduced sequencing costs. However, the use of these sequencing technologies is often challenged by errors and biases associated with the bioinformatical methods used for analyzing the data. In particular, the use of naïve methods to identify polymorphic sites and infer genotypes can inflate downstream analyses. Recently, explicit modeling of genotype probability distributions has been proposed as a method for taking genotype call uncertainty into account. Based on this idea, we propose a novel method for quantifying population genetic differentiation from next-generation sequencing data. In addition, we present a strategy to investigate population structure via Principal Components Analysis. Through extensive simulations, we compare the new method herein proposed to approaches based on genotype calling and demonstrate a marked improvement in estimation accuracy for a wide range of conditions. We apply the method to a large-scale genomic data set of domesticated and wild silkworms sequenced at low coverage. We find that we can infer the fine-scale genetic structure of the sampled individuals, suggesting that employing this new method is useful for investigating the genetic relationships of populations sampled at low coverage.

AB - Over the last few years, new high-throughput DNA sequencing technologies have dramatically increased speed and reduced sequencing costs. However, the use of these sequencing technologies is often challenged by errors and biases associated with the bioinformatical methods used for analyzing the data. In particular, the use of naïve methods to identify polymorphic sites and infer genotypes can inflate downstream analyses. Recently, explicit modeling of genotype probability distributions has been proposed as a method for taking genotype call uncertainty into account. Based on this idea, we propose a novel method for quantifying population genetic differentiation from next-generation sequencing data. In addition, we present a strategy to investigate population structure via Principal Components Analysis. Through extensive simulations, we compare the new method herein proposed to approaches based on genotype calling and demonstrate a marked improvement in estimation accuracy for a wide range of conditions. We apply the method to a large-scale genomic data set of domesticated and wild silkworms sequenced at low coverage. We find that we can infer the fine-scale genetic structure of the sampled individuals, suggesting that employing this new method is useful for investigating the genetic relationships of populations sampled at low coverage.

U2 - 10.1534/genetics.113.154740

DO - 10.1534/genetics.113.154740

M3 - Journal article

VL - 195

SP - 979

EP - 992

JO - Genetics

JF - Genetics

SN - 1943-2631

IS - 3

ER -

ID: 51121152