Mining metagenomes for natural product biosynthetic gene clusters: unlocking new potential with ultrafast techniques

Research output: Working paperPreprintResearch

Standard

Mining metagenomes for natural product biosynthetic gene clusters : unlocking new potential with ultrafast techniques. / Pereira-Flores, Emiliano; Medema, Marnix; Buttigieg, Pier Luigi; Meinicke, Peter; Glöckner, Frank Oliver; Fernández-Guerra, Antonio.

2021.

Research output: Working paperPreprintResearch

Harvard

Pereira-Flores, E, Medema, M, Buttigieg, PL, Meinicke, P, Glöckner, FO & Fernández-Guerra, A 2021 'Mining metagenomes for natural product biosynthetic gene clusters: unlocking new potential with ultrafast techniques'. https://doi.org/10.1101/2021.01.20.427441

APA

Pereira-Flores, E., Medema, M., Buttigieg, P. L., Meinicke, P., Glöckner, F. O., & Fernández-Guerra, A. (2021). Mining metagenomes for natural product biosynthetic gene clusters: unlocking new potential with ultrafast techniques. bioRxiv https://doi.org/10.1101/2021.01.20.427441

Vancouver

Pereira-Flores E, Medema M, Buttigieg PL, Meinicke P, Glöckner FO, Fernández-Guerra A. Mining metagenomes for natural product biosynthetic gene clusters: unlocking new potential with ultrafast techniques. 2021. https://doi.org/10.1101/2021.01.20.427441

Author

Pereira-Flores, Emiliano ; Medema, Marnix ; Buttigieg, Pier Luigi ; Meinicke, Peter ; Glöckner, Frank Oliver ; Fernández-Guerra, Antonio. / Mining metagenomes for natural product biosynthetic gene clusters : unlocking new potential with ultrafast techniques. 2021. (bioRxiv).

Bibtex

@techreport{814a56ce89194082bb12c6e0699edb48,
title = "Mining metagenomes for natural product biosynthetic gene clusters: unlocking new potential with ultrafast techniques",
abstract = "Microorganisms produce an immense variety of natural products through the expression of Biosynthetic Gene Clusters (BGCs): physically clustered genes that encode the enzymes of a specialized metabolic pathway. These natural products cover a wide range of chemical classes (e.g., aminoglycosides, lantibiotics, nonribosomal peptides, oligosaccharides, polyketides, terpenes) that are highly valuable for industrial and medical applications1. Metagenomics, as a culture-independent approach, has greatly enhanced our ability to survey the functional potential of microorganisms and is growing in popularity for the mining of BGCs. However, to effectively exploit metagenomic data to this end, it will be crucial to more efficiently identify these genomic elements in highly complex and ever-increasing volumes of data2. Here, we address this challenge by developing the ultrafast Biosynthetic Gene cluster MEtagenomic eXploration toolbox (BiG-MEx). BiG-MEx rapidly identifies a broad range of BGC protein domains, assess their diversity and novelty, and predicts the abundance profile of natural product BGC classes in metagenomic data. We show the advantages of BiG-MEx compared to standard BGC-mining approaches, and use it to explore the BGC domain and class composition of samples in the TARA Oceans3 and Human Microbiome Project datasets4. In these analyses, we demonstrate BiG-MEx{\textquoteright}s applicability to study the distribution, diversity, and ecological roles of BGCs in metagenomic data, and guide the exploration of natural products with clinical applications.Competing Interest StatementThe authors have declared no competing interest.",
author = "Emiliano Pereira-Flores and Marnix Medema and Buttigieg, {Pier Luigi} and Peter Meinicke and Gl{\"o}ckner, {Frank Oliver} and Antonio Fern{\'a}ndez-Guerra",
year = "2021",
doi = "10.1101/2021.01.20.427441",
language = "English",
series = "bioRxiv",
publisher = "Cold Spring Harbor Laboratory",
type = "WorkingPaper",
institution = "Cold Spring Harbor Laboratory",

}

RIS

TY - UNPB

T1 - Mining metagenomes for natural product biosynthetic gene clusters

T2 - unlocking new potential with ultrafast techniques

AU - Pereira-Flores, Emiliano

AU - Medema, Marnix

AU - Buttigieg, Pier Luigi

AU - Meinicke, Peter

AU - Glöckner, Frank Oliver

AU - Fernández-Guerra, Antonio

PY - 2021

Y1 - 2021

N2 - Microorganisms produce an immense variety of natural products through the expression of Biosynthetic Gene Clusters (BGCs): physically clustered genes that encode the enzymes of a specialized metabolic pathway. These natural products cover a wide range of chemical classes (e.g., aminoglycosides, lantibiotics, nonribosomal peptides, oligosaccharides, polyketides, terpenes) that are highly valuable for industrial and medical applications1. Metagenomics, as a culture-independent approach, has greatly enhanced our ability to survey the functional potential of microorganisms and is growing in popularity for the mining of BGCs. However, to effectively exploit metagenomic data to this end, it will be crucial to more efficiently identify these genomic elements in highly complex and ever-increasing volumes of data2. Here, we address this challenge by developing the ultrafast Biosynthetic Gene cluster MEtagenomic eXploration toolbox (BiG-MEx). BiG-MEx rapidly identifies a broad range of BGC protein domains, assess their diversity and novelty, and predicts the abundance profile of natural product BGC classes in metagenomic data. We show the advantages of BiG-MEx compared to standard BGC-mining approaches, and use it to explore the BGC domain and class composition of samples in the TARA Oceans3 and Human Microbiome Project datasets4. In these analyses, we demonstrate BiG-MEx’s applicability to study the distribution, diversity, and ecological roles of BGCs in metagenomic data, and guide the exploration of natural products with clinical applications.Competing Interest StatementThe authors have declared no competing interest.

AB - Microorganisms produce an immense variety of natural products through the expression of Biosynthetic Gene Clusters (BGCs): physically clustered genes that encode the enzymes of a specialized metabolic pathway. These natural products cover a wide range of chemical classes (e.g., aminoglycosides, lantibiotics, nonribosomal peptides, oligosaccharides, polyketides, terpenes) that are highly valuable for industrial and medical applications1. Metagenomics, as a culture-independent approach, has greatly enhanced our ability to survey the functional potential of microorganisms and is growing in popularity for the mining of BGCs. However, to effectively exploit metagenomic data to this end, it will be crucial to more efficiently identify these genomic elements in highly complex and ever-increasing volumes of data2. Here, we address this challenge by developing the ultrafast Biosynthetic Gene cluster MEtagenomic eXploration toolbox (BiG-MEx). BiG-MEx rapidly identifies a broad range of BGC protein domains, assess their diversity and novelty, and predicts the abundance profile of natural product BGC classes in metagenomic data. We show the advantages of BiG-MEx compared to standard BGC-mining approaches, and use it to explore the BGC domain and class composition of samples in the TARA Oceans3 and Human Microbiome Project datasets4. In these analyses, we demonstrate BiG-MEx’s applicability to study the distribution, diversity, and ecological roles of BGCs in metagenomic data, and guide the exploration of natural products with clinical applications.Competing Interest StatementThe authors have declared no competing interest.

U2 - 10.1101/2021.01.20.427441

DO - 10.1101/2021.01.20.427441

M3 - Preprint

T3 - bioRxiv

BT - Mining metagenomes for natural product biosynthetic gene clusters

ER -

ID: 306183036