Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals

Research output: Contribution to journalJournal articleResearchpeer-review

  • Ines Hellmann
  • Yuan Mang
  • Zhiping Gu
  • Peter Li
  • Francisco M de la Vega
  • Andrew G Clark
  • Nielsen, Rasmus
We introduce a simple, broadly applicable method for obtaining estimates of nucleotide diversity from genomic shotgun sequencing data. The method takes into account the special nature of these data: random sampling of genomic segments from one or more individuals and a relatively high error rate for individual reads. Applying this method to data from the Celera human genome sequencing and SNP discovery project, we obtain estimates of nucleotide diversity in windows spanning the human genome and show that the diversity to divergence ratio is reduced in regions of low recombination. Furthermore, we show that the elevated diversity in telomeric regions is mainly due to elevated mutation rates and not due to decreased levels of background selection. However, we find indications that telomeres as well as centromeres experience greater impact from natural selection than intrachromosomal regions. Finally, we identify a number of genomic regions with increased or reduced diversity compared with the local level of human-chimpanzee divergence and the local recombination rate.
Original languageEnglish
JournalGenome Research
Volume18
Issue number7
Pages (from-to)1020-9
Number of pages9
ISSN1088-9051
DOIs
Publication statusPublished - 2008

Bibliographical note

Keywords: Animals; Genetic Variation; Genetics, Population; Genome, Human; Humans; Likelihood Functions; Models, Genetic; Pan troglodytes; Polymorphism, Single Nucleotide; Sequence Analysis, DNA

ID: 9855418