Error-prone polymerase activity causes multinucleotide mutations in humans

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Error-prone polymerase activity causes multinucleotide mutations in humans. / Harris, Kelley; Nielsen, Rasmus.

In: Genome Research, Vol. 24, No. 9, 01.09.2014, p. 1445-1454.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Harris, K & Nielsen, R 2014, 'Error-prone polymerase activity causes multinucleotide mutations in humans', Genome Research, vol. 24, no. 9, pp. 1445-1454. https://doi.org/10.1101/gr.170696.113

APA

Harris, K., & Nielsen, R. (2014). Error-prone polymerase activity causes multinucleotide mutations in humans. Genome Research, 24(9), 1445-1454. https://doi.org/10.1101/gr.170696.113

Vancouver

Harris K, Nielsen R. Error-prone polymerase activity causes multinucleotide mutations in humans. Genome Research. 2014 Sep 1;24(9):1445-1454. https://doi.org/10.1101/gr.170696.113

Author

Harris, Kelley ; Nielsen, Rasmus. / Error-prone polymerase activity causes multinucleotide mutations in humans. In: Genome Research. 2014 ; Vol. 24, No. 9. pp. 1445-1454.

Bibtex

@article{765d6f42083a4b67a2138bae30134250,
title = "Error-prone polymerase activity causes multinucleotide mutations in humans",
abstract = "About 2% of human genetic polymorphisms have been hypothesized to arise via multinucleotide mutations (MNMs), complex events that generate SNPs at multiple sites in a single generation. MNMs have the potential to accelerate the pace at which single genes evolve and to confound studies of demography and selection that assume all SNPs arise independently. In this paper, we examine clustered mutations that are segregating in a set of 1092 human genomes, demonstrating that the signature ofMNMbecomes enriched as large numbers of individuals are sampled. We estimate the percentage of linked SNP pairs that were generated by simultaneous mutation as a function of the distance between affected sites and show that MNMs exhibit a high percentage of transversions relative to transitions, findings that are reproducible in data from multiple sequencing platforms and cannot be attributed to sequencing error. Among tandem mutations that occur simultaneously at adjacent sites, we find an especially skewed distribution of ancestral and derived alleles, with GC→AA, GA→TT, and their reverse complements making up 27% of the total. These mutations have been previously shown to dominate the spectrum of the error-prone polymerase Pol ζ, suggesting that low-fidelity DNA replication by Pol ζ is at least partly responsible for the MNMs that are segregating in the human population. We develop statistical estimates of MNM prevalence that can be used to correct phylogenetic and population genetic inferences for the presence of complex mutations.",
author = "Kelley Harris and Rasmus Nielsen",
year = "2014",
month = sep,
day = "1",
doi = "10.1101/gr.170696.113",
language = "English",
volume = "24",
pages = "1445--1454",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "9",

}

RIS

TY - JOUR

T1 - Error-prone polymerase activity causes multinucleotide mutations in humans

AU - Harris, Kelley

AU - Nielsen, Rasmus

PY - 2014/9/1

Y1 - 2014/9/1

N2 - About 2% of human genetic polymorphisms have been hypothesized to arise via multinucleotide mutations (MNMs), complex events that generate SNPs at multiple sites in a single generation. MNMs have the potential to accelerate the pace at which single genes evolve and to confound studies of demography and selection that assume all SNPs arise independently. In this paper, we examine clustered mutations that are segregating in a set of 1092 human genomes, demonstrating that the signature ofMNMbecomes enriched as large numbers of individuals are sampled. We estimate the percentage of linked SNP pairs that were generated by simultaneous mutation as a function of the distance between affected sites and show that MNMs exhibit a high percentage of transversions relative to transitions, findings that are reproducible in data from multiple sequencing platforms and cannot be attributed to sequencing error. Among tandem mutations that occur simultaneously at adjacent sites, we find an especially skewed distribution of ancestral and derived alleles, with GC→AA, GA→TT, and their reverse complements making up 27% of the total. These mutations have been previously shown to dominate the spectrum of the error-prone polymerase Pol ζ, suggesting that low-fidelity DNA replication by Pol ζ is at least partly responsible for the MNMs that are segregating in the human population. We develop statistical estimates of MNM prevalence that can be used to correct phylogenetic and population genetic inferences for the presence of complex mutations.

AB - About 2% of human genetic polymorphisms have been hypothesized to arise via multinucleotide mutations (MNMs), complex events that generate SNPs at multiple sites in a single generation. MNMs have the potential to accelerate the pace at which single genes evolve and to confound studies of demography and selection that assume all SNPs arise independently. In this paper, we examine clustered mutations that are segregating in a set of 1092 human genomes, demonstrating that the signature ofMNMbecomes enriched as large numbers of individuals are sampled. We estimate the percentage of linked SNP pairs that were generated by simultaneous mutation as a function of the distance between affected sites and show that MNMs exhibit a high percentage of transversions relative to transitions, findings that are reproducible in data from multiple sequencing platforms and cannot be attributed to sequencing error. Among tandem mutations that occur simultaneously at adjacent sites, we find an especially skewed distribution of ancestral and derived alleles, with GC→AA, GA→TT, and their reverse complements making up 27% of the total. These mutations have been previously shown to dominate the spectrum of the error-prone polymerase Pol ζ, suggesting that low-fidelity DNA replication by Pol ζ is at least partly responsible for the MNMs that are segregating in the human population. We develop statistical estimates of MNM prevalence that can be used to correct phylogenetic and population genetic inferences for the presence of complex mutations.

UR - http://www.scopus.com/inward/record.url?scp=84907225774&partnerID=8YFLogxK

U2 - 10.1101/gr.170696.113

DO - 10.1101/gr.170696.113

M3 - Journal article

C2 - 25079859

AN - SCOPUS:84907225774

VL - 24

SP - 1445

EP - 1454

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 9

ER -

ID: 222642600