Digesting the data: Proper validation in ancient metagenomic studies is essential

Research output: Working paperPreprintResearch

Standard

Digesting the data : Proper validation in ancient metagenomic studies is essential. / Pach, Aleksandra Laura; Lanigan, Liam T.; Niemann, Jonas; Pedersen, Mikkel Winther; Schroeder, Hannes.

bioRxiv, 2024.

Research output: Working paperPreprintResearch

Harvard

Pach, AL, Lanigan, LT, Niemann, J, Pedersen, MW & Schroeder, H 2024 'Digesting the data: Proper validation in ancient metagenomic studies is essential' bioRxiv. https://doi.org/10.1101/2024.02.27.581519

APA

Pach, A. L., Lanigan, L. T., Niemann, J., Pedersen, M. W., & Schroeder, H. (2024). Digesting the data: Proper validation in ancient metagenomic studies is essential. bioRxiv. https://doi.org/10.1101/2024.02.27.581519

Vancouver

Pach AL, Lanigan LT, Niemann J, Pedersen MW, Schroeder H. Digesting the data: Proper validation in ancient metagenomic studies is essential. bioRxiv. 2024. https://doi.org/10.1101/2024.02.27.581519

Author

Pach, Aleksandra Laura ; Lanigan, Liam T. ; Niemann, Jonas ; Pedersen, Mikkel Winther ; Schroeder, Hannes. / Digesting the data : Proper validation in ancient metagenomic studies is essential. bioRxiv, 2024.

Bibtex

@techreport{4ab30e0b372d4e07a3271e67092adb53,
title = "Digesting the data: Proper validation in ancient metagenomic studies is essential",
abstract = "In a recent publication in this journal, Reynoso-Garc{\'i}a et al. [1] used shotgun sequencing to analyze human coprolites (paleofeces) from two pre-Columbian contexts in Puerto Rico to reconstruct the diet of the island{\textquoteright}s Indigenous population before the arrival of Europeans. Based on the results, the team claim to have identified various edible New World plant species, including maize (Zea mays), sweet potato (Ipomoea batatas), chili pepper (Capsicum annuum), peanut (Arachis spp.), papaya (Carica papaya), and tomato (Solanum lycopersicum), as well as other cultivars such as cotton (Gossypium barbadense) and tobacco (Nicotiana sylvestris) [1]. Reynoso-Garc{\'i}a et al. [1] also claim to have identified edible fungi, including Ustilago spp., which according to the authors, further supports their findings and points to the consumption of huitlacoche or corn smut, a known delicacy in Mexico today that is believed to have originated in Aztec times [2].Shotgun DNA sequencing of archaeological samples, such as dental calculus or coprolites, provides a powerful tool to reconstruct ancient microbial communities and to study the evolution of the human microbiome [e.g. 3,4–6]. In some instances, shotgun sequencing results can also provide insights into the diet and subsistence strategies of past communities [e.g. 7,8]. However, identifying DNA from dietary sources in complex shotgun metagenomic datasets is far from straightforward. As has been discussed previously [9,10], and as we demonstrate below, one of the main challenges is the risk of false positives. Any potential dietary signals should, therefore, be carefully assessed. Unfortunately, we feel that Reynoso-Garc{\'i}a et al.{\textquoteright}s study [1] falls short in that regard and we conclude that while it is entirely possible (and even likely, based on other evidence [e.g. 11,12]) that the Indigenous inhabitants of Puerto Rico subsisted on a diet that included some, or even all of the edible plant taxa the team identified, the DNA results they present do not, in and of themselves, support that claim.",
author = "Pach, {Aleksandra Laura} and Lanigan, {Liam T.} and Jonas Niemann and Pedersen, {Mikkel Winther} and Hannes Schroeder",
year = "2024",
doi = "10.1101/2024.02.27.581519",
language = "English",
publisher = "bioRxiv",
type = "WorkingPaper",
institution = "bioRxiv",

}

RIS

TY - UNPB

T1 - Digesting the data

T2 - Proper validation in ancient metagenomic studies is essential

AU - Pach, Aleksandra Laura

AU - Lanigan, Liam T.

AU - Niemann, Jonas

AU - Pedersen, Mikkel Winther

AU - Schroeder, Hannes

PY - 2024

Y1 - 2024

N2 - In a recent publication in this journal, Reynoso-García et al. [1] used shotgun sequencing to analyze human coprolites (paleofeces) from two pre-Columbian contexts in Puerto Rico to reconstruct the diet of the island’s Indigenous population before the arrival of Europeans. Based on the results, the team claim to have identified various edible New World plant species, including maize (Zea mays), sweet potato (Ipomoea batatas), chili pepper (Capsicum annuum), peanut (Arachis spp.), papaya (Carica papaya), and tomato (Solanum lycopersicum), as well as other cultivars such as cotton (Gossypium barbadense) and tobacco (Nicotiana sylvestris) [1]. Reynoso-García et al. [1] also claim to have identified edible fungi, including Ustilago spp., which according to the authors, further supports their findings and points to the consumption of huitlacoche or corn smut, a known delicacy in Mexico today that is believed to have originated in Aztec times [2].Shotgun DNA sequencing of archaeological samples, such as dental calculus or coprolites, provides a powerful tool to reconstruct ancient microbial communities and to study the evolution of the human microbiome [e.g. 3,4–6]. In some instances, shotgun sequencing results can also provide insights into the diet and subsistence strategies of past communities [e.g. 7,8]. However, identifying DNA from dietary sources in complex shotgun metagenomic datasets is far from straightforward. As has been discussed previously [9,10], and as we demonstrate below, one of the main challenges is the risk of false positives. Any potential dietary signals should, therefore, be carefully assessed. Unfortunately, we feel that Reynoso-García et al.’s study [1] falls short in that regard and we conclude that while it is entirely possible (and even likely, based on other evidence [e.g. 11,12]) that the Indigenous inhabitants of Puerto Rico subsisted on a diet that included some, or even all of the edible plant taxa the team identified, the DNA results they present do not, in and of themselves, support that claim.

AB - In a recent publication in this journal, Reynoso-García et al. [1] used shotgun sequencing to analyze human coprolites (paleofeces) from two pre-Columbian contexts in Puerto Rico to reconstruct the diet of the island’s Indigenous population before the arrival of Europeans. Based on the results, the team claim to have identified various edible New World plant species, including maize (Zea mays), sweet potato (Ipomoea batatas), chili pepper (Capsicum annuum), peanut (Arachis spp.), papaya (Carica papaya), and tomato (Solanum lycopersicum), as well as other cultivars such as cotton (Gossypium barbadense) and tobacco (Nicotiana sylvestris) [1]. Reynoso-García et al. [1] also claim to have identified edible fungi, including Ustilago spp., which according to the authors, further supports their findings and points to the consumption of huitlacoche or corn smut, a known delicacy in Mexico today that is believed to have originated in Aztec times [2].Shotgun DNA sequencing of archaeological samples, such as dental calculus or coprolites, provides a powerful tool to reconstruct ancient microbial communities and to study the evolution of the human microbiome [e.g. 3,4–6]. In some instances, shotgun sequencing results can also provide insights into the diet and subsistence strategies of past communities [e.g. 7,8]. However, identifying DNA from dietary sources in complex shotgun metagenomic datasets is far from straightforward. As has been discussed previously [9,10], and as we demonstrate below, one of the main challenges is the risk of false positives. Any potential dietary signals should, therefore, be carefully assessed. Unfortunately, we feel that Reynoso-García et al.’s study [1] falls short in that regard and we conclude that while it is entirely possible (and even likely, based on other evidence [e.g. 11,12]) that the Indigenous inhabitants of Puerto Rico subsisted on a diet that included some, or even all of the edible plant taxa the team identified, the DNA results they present do not, in and of themselves, support that claim.

U2 - 10.1101/2024.02.27.581519

DO - 10.1101/2024.02.27.581519

M3 - Preprint

BT - Digesting the data

PB - bioRxiv

ER -

ID: 384576339