A computational framework to explore large-scale biosynthetic diversity

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

A computational framework to explore large-scale biosynthetic diversity. / Navarro-Muñoz, Jorge C; Selem-Mojica, Nelly; Mullowney, Michael W; Kautsar, Satria A; Tryon, James H; Parkinson, Elizabeth I; De Los Santos, Emmanuel L C; Yeong, Marley; Cruz-Morales, Pablo; Abubucker, Sahar; Roeters, Arne; Lokhorst, Wouter; Fernandez-Guerra, Antonio; Cappelini, Luciana Teresa Dias; Goering, Anthony W; Thomson, Regan J; Metcalf, William W; Kelleher, Neil L; Barona-Gomez, Francisco; Medema, Marnix H.

In: Nature Chemical Biology, Vol. 16, No. 1, 01.2020, p. 60-68.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Navarro-Muñoz, JC, Selem-Mojica, N, Mullowney, MW, Kautsar, SA, Tryon, JH, Parkinson, EI, De Los Santos, ELC, Yeong, M, Cruz-Morales, P, Abubucker, S, Roeters, A, Lokhorst, W, Fernandez-Guerra, A, Cappelini, LTD, Goering, AW, Thomson, RJ, Metcalf, WW, Kelleher, NL, Barona-Gomez, F & Medema, MH 2020, 'A computational framework to explore large-scale biosynthetic diversity', Nature Chemical Biology, vol. 16, no. 1, pp. 60-68. https://doi.org/10.1038/s41589-019-0400-9

APA

Navarro-Muñoz, J. C., Selem-Mojica, N., Mullowney, M. W., Kautsar, S. A., Tryon, J. H., Parkinson, E. I., De Los Santos, E. L. C., Yeong, M., Cruz-Morales, P., Abubucker, S., Roeters, A., Lokhorst, W., Fernandez-Guerra, A., Cappelini, L. T. D., Goering, A. W., Thomson, R. J., Metcalf, W. W., Kelleher, N. L., Barona-Gomez, F., & Medema, M. H. (2020). A computational framework to explore large-scale biosynthetic diversity. Nature Chemical Biology, 16(1), 60-68. https://doi.org/10.1038/s41589-019-0400-9

Vancouver

Navarro-Muñoz JC, Selem-Mojica N, Mullowney MW, Kautsar SA, Tryon JH, Parkinson EI et al. A computational framework to explore large-scale biosynthetic diversity. Nature Chemical Biology. 2020 Jan;16(1):60-68. https://doi.org/10.1038/s41589-019-0400-9

Author

Navarro-Muñoz, Jorge C ; Selem-Mojica, Nelly ; Mullowney, Michael W ; Kautsar, Satria A ; Tryon, James H ; Parkinson, Elizabeth I ; De Los Santos, Emmanuel L C ; Yeong, Marley ; Cruz-Morales, Pablo ; Abubucker, Sahar ; Roeters, Arne ; Lokhorst, Wouter ; Fernandez-Guerra, Antonio ; Cappelini, Luciana Teresa Dias ; Goering, Anthony W ; Thomson, Regan J ; Metcalf, William W ; Kelleher, Neil L ; Barona-Gomez, Francisco ; Medema, Marnix H. / A computational framework to explore large-scale biosynthetic diversity. In: Nature Chemical Biology. 2020 ; Vol. 16, No. 1. pp. 60-68.

Bibtex

@article{9a48ebc69b37453d8fa77fdf33012ebb,
title = "A computational framework to explore large-scale biosynthetic diversity",
abstract = "Genome mining has become a key technology to exploit natural product diversity. Although initially performed on a single-genome basis, the process is now being scaled up to mine entire genera, strain collections and microbiomes. However, no bioinformatic framework is currently available for effectively analyzing datasets of this size and complexity. In the present study, a streamlined computational workflow is provided, consisting of two new software tools: the 'biosynthetic gene similarity clustering and prospecting engine' (BiG-SCAPE), which facilitates fast and interactive sequence similarity network analysis of biosynthetic gene clusters and gene cluster families; and the 'core analysis of syntenic orthologues to prioritize natural product gene clusters' (CORASON), which elucidates phylogenetic relationships within and across these families. BiG-SCAPE is validated by correlating its output to metabolomic data across 363 actinobacterial strains and the discovery potential of CORASON is demonstrated by comprehensively mapping biosynthetic diversity across a range of detoxin/rimosamide-related gene cluster families, culminating in the characterization of seven detoxin analogues.",
author = "Navarro-Mu{\~n}oz, {Jorge C} and Nelly Selem-Mojica and Mullowney, {Michael W} and Kautsar, {Satria A} and Tryon, {James H} and Parkinson, {Elizabeth I} and {De Los Santos}, {Emmanuel L C} and Marley Yeong and Pablo Cruz-Morales and Sahar Abubucker and Arne Roeters and Wouter Lokhorst and Antonio Fernandez-Guerra and Cappelini, {Luciana Teresa Dias} and Goering, {Anthony W} and Thomson, {Regan J} and Metcalf, {William W} and Kelleher, {Neil L} and Francisco Barona-Gomez and Medema, {Marnix H}",
year = "2020",
month = jan,
doi = "10.1038/s41589-019-0400-9",
language = "English",
volume = "16",
pages = "60--68",
journal = "Nature Chemical Biology",
issn = "1552-4450",
publisher = "nature publishing group",
number = "1",

}

RIS

TY - JOUR

T1 - A computational framework to explore large-scale biosynthetic diversity

AU - Navarro-Muñoz, Jorge C

AU - Selem-Mojica, Nelly

AU - Mullowney, Michael W

AU - Kautsar, Satria A

AU - Tryon, James H

AU - Parkinson, Elizabeth I

AU - De Los Santos, Emmanuel L C

AU - Yeong, Marley

AU - Cruz-Morales, Pablo

AU - Abubucker, Sahar

AU - Roeters, Arne

AU - Lokhorst, Wouter

AU - Fernandez-Guerra, Antonio

AU - Cappelini, Luciana Teresa Dias

AU - Goering, Anthony W

AU - Thomson, Regan J

AU - Metcalf, William W

AU - Kelleher, Neil L

AU - Barona-Gomez, Francisco

AU - Medema, Marnix H

PY - 2020/1

Y1 - 2020/1

N2 - Genome mining has become a key technology to exploit natural product diversity. Although initially performed on a single-genome basis, the process is now being scaled up to mine entire genera, strain collections and microbiomes. However, no bioinformatic framework is currently available for effectively analyzing datasets of this size and complexity. In the present study, a streamlined computational workflow is provided, consisting of two new software tools: the 'biosynthetic gene similarity clustering and prospecting engine' (BiG-SCAPE), which facilitates fast and interactive sequence similarity network analysis of biosynthetic gene clusters and gene cluster families; and the 'core analysis of syntenic orthologues to prioritize natural product gene clusters' (CORASON), which elucidates phylogenetic relationships within and across these families. BiG-SCAPE is validated by correlating its output to metabolomic data across 363 actinobacterial strains and the discovery potential of CORASON is demonstrated by comprehensively mapping biosynthetic diversity across a range of detoxin/rimosamide-related gene cluster families, culminating in the characterization of seven detoxin analogues.

AB - Genome mining has become a key technology to exploit natural product diversity. Although initially performed on a single-genome basis, the process is now being scaled up to mine entire genera, strain collections and microbiomes. However, no bioinformatic framework is currently available for effectively analyzing datasets of this size and complexity. In the present study, a streamlined computational workflow is provided, consisting of two new software tools: the 'biosynthetic gene similarity clustering and prospecting engine' (BiG-SCAPE), which facilitates fast and interactive sequence similarity network analysis of biosynthetic gene clusters and gene cluster families; and the 'core analysis of syntenic orthologues to prioritize natural product gene clusters' (CORASON), which elucidates phylogenetic relationships within and across these families. BiG-SCAPE is validated by correlating its output to metabolomic data across 363 actinobacterial strains and the discovery potential of CORASON is demonstrated by comprehensively mapping biosynthetic diversity across a range of detoxin/rimosamide-related gene cluster families, culminating in the characterization of seven detoxin analogues.

U2 - 10.1038/s41589-019-0400-9

DO - 10.1038/s41589-019-0400-9

M3 - Journal article

C2 - 31768033

VL - 16

SP - 60

EP - 68

JO - Nature Chemical Biology

JF - Nature Chemical Biology

SN - 1552-4450

IS - 1

ER -

ID: 238527438