A review of the heterogeneous landscape of biodiversity databases: Opportunities and challenges for a synthesized biodiversity knowledge base

Research output: Contribution to journalReviewResearchpeer-review

Standard

A review of the heterogeneous landscape of biodiversity databases : Opportunities and challenges for a synthesized biodiversity knowledge base. / Feng, Xiao; Enquist, Brian J.; Park, Daniel S.; Boyle, Brad; Breshears, David D.; Gallagher, Rachael V.; Lien, Aaron; Newman, Erica A.; Burger, Joseph R.; Maitner, Brian S.; Merow, Cory; Li, Yaoqi; Huynh, Kimberly M.; Ernst, Kacey; Baldwin, Elizabeth; Foden, Wendy; Hannah, Lee; Jørgensen, Peter M.; Kraft, Nathan J. B.; Lovett, Jon C.; Marquet, Pablo A.; McGill, Brian J.; Morueta-Holme, Naia; Neves, Danilo M.; Núñez-Regueiro, Mauricio M.; Oliveira-Filho, Ary T.; Peet, Robert K.; Pillet, Michiel; Roehrdanz, Patrick R.; Sandel, Brody; Serra-Diaz, Josep M.; Šímová, Irena; Svenning, Jens-Christian; Violle, Cyrille; Weitemier, Trang D.; Wiser, Susan; López-Hoffman, Laura.

In: Global Ecology and Biogeography, Vol. 31, No. 7, 2022, p. 1242-1260.

Research output: Contribution to journalReviewResearchpeer-review

Harvard

Feng, X, Enquist, BJ, Park, DS, Boyle, B, Breshears, DD, Gallagher, RV, Lien, A, Newman, EA, Burger, JR, Maitner, BS, Merow, C, Li, Y, Huynh, KM, Ernst, K, Baldwin, E, Foden, W, Hannah, L, Jørgensen, PM, Kraft, NJB, Lovett, JC, Marquet, PA, McGill, BJ, Morueta-Holme, N, Neves, DM, Núñez-Regueiro, MM, Oliveira-Filho, AT, Peet, RK, Pillet, M, Roehrdanz, PR, Sandel, B, Serra-Diaz, JM, Šímová, I, Svenning, J-C, Violle, C, Weitemier, TD, Wiser, S & López-Hoffman, L 2022, 'A review of the heterogeneous landscape of biodiversity databases: Opportunities and challenges for a synthesized biodiversity knowledge base', Global Ecology and Biogeography, vol. 31, no. 7, pp. 1242-1260. https://doi.org/10.1111/geb.13497

APA

Feng, X., Enquist, B. J., Park, D. S., Boyle, B., Breshears, D. D., Gallagher, R. V., Lien, A., Newman, E. A., Burger, J. R., Maitner, B. S., Merow, C., Li, Y., Huynh, K. M., Ernst, K., Baldwin, E., Foden, W., Hannah, L., Jørgensen, P. M., Kraft, N. J. B., ... López-Hoffman, L. (2022). A review of the heterogeneous landscape of biodiversity databases: Opportunities and challenges for a synthesized biodiversity knowledge base. Global Ecology and Biogeography, 31(7), 1242-1260. https://doi.org/10.1111/geb.13497

Vancouver

Feng X, Enquist BJ, Park DS, Boyle B, Breshears DD, Gallagher RV et al. A review of the heterogeneous landscape of biodiversity databases: Opportunities and challenges for a synthesized biodiversity knowledge base. Global Ecology and Biogeography. 2022;31(7):1242-1260. https://doi.org/10.1111/geb.13497

Author

Feng, Xiao ; Enquist, Brian J. ; Park, Daniel S. ; Boyle, Brad ; Breshears, David D. ; Gallagher, Rachael V. ; Lien, Aaron ; Newman, Erica A. ; Burger, Joseph R. ; Maitner, Brian S. ; Merow, Cory ; Li, Yaoqi ; Huynh, Kimberly M. ; Ernst, Kacey ; Baldwin, Elizabeth ; Foden, Wendy ; Hannah, Lee ; Jørgensen, Peter M. ; Kraft, Nathan J. B. ; Lovett, Jon C. ; Marquet, Pablo A. ; McGill, Brian J. ; Morueta-Holme, Naia ; Neves, Danilo M. ; Núñez-Regueiro, Mauricio M. ; Oliveira-Filho, Ary T. ; Peet, Robert K. ; Pillet, Michiel ; Roehrdanz, Patrick R. ; Sandel, Brody ; Serra-Diaz, Josep M. ; Šímová, Irena ; Svenning, Jens-Christian ; Violle, Cyrille ; Weitemier, Trang D. ; Wiser, Susan ; López-Hoffman, Laura. / A review of the heterogeneous landscape of biodiversity databases : Opportunities and challenges for a synthesized biodiversity knowledge base. In: Global Ecology and Biogeography. 2022 ; Vol. 31, No. 7. pp. 1242-1260.

Bibtex

@article{470d14c81fd24971b169ae009add63a3,
title = "A review of the heterogeneous landscape of biodiversity databases: Opportunities and challenges for a synthesized biodiversity knowledge base",
abstract = "Aim: Addressing global environmental challenges requires access to biodiversity data across wide spatial, temporal and taxonomic scales. Availability of such data has increased exponentially recently with the proliferation of biodiversity databases. However, heterogeneous coverage, protocols, and standards have hampered integration among these databases. To stimulate the next stage of data integration, here we present a synthesis of major databases, and investigate (a) how the coverage of databases varies across taxonomy, space, and record type; (b) what degree of integration is present among databases; (c) how integration of databases can increase biodiversity knowledge; and (d) the barriers to database integration. Location: Global. Time period: Contemporary. Major taxa studied: Plants and vertebrates. Methods: We reviewed 12 established biodiversity databases that mainly focus on geographic distributions and functional traits at global scale. We synthesized information from these databases to assess the status of their integration and major knowledge gaps and barriers to full integration. We estimated how improved integration can increase the data coverage for terrestrial plants and vertebrates. Results: Every database reviewed had a unique focus of data coverage. Exchanges of biodiversity information were common among databases, although not always clearly documented. Functional trait databases were more isolated than those pertaining to species distributions. Variation and potential incompatibility of taxonomic systems used by different databases posed a major barrier to data integration. We found that integration of distribution databases could lead to increased taxonomic coverage that corresponds to 23 years{\textquoteright} advancement in data accumulation, and improvement in taxonomic coverage could be as high as 22.4% for trait databases. Main conclusions: Rapid increases in biodiversity knowledge can be achieved through the integration of databases, providing the data necessary to address critical environmental challenges. Full integration across databases will require tackling the major impediments to data integration: taxonomic incompatibility, lags in data exchange, barriers to effective data synchronization, and isolation of individual initiatives.",
keywords = "big data, biodiversity informatics, biogeography, database integration, functional trait, taxonomic system",
author = "Xiao Feng and Enquist, {Brian J.} and Park, {Daniel S.} and Brad Boyle and Breshears, {David D.} and Gallagher, {Rachael V.} and Aaron Lien and Newman, {Erica A.} and Burger, {Joseph R.} and Maitner, {Brian S.} and Cory Merow and Yaoqi Li and Huynh, {Kimberly M.} and Kacey Ernst and Elizabeth Baldwin and Wendy Foden and Lee Hannah and J{\o}rgensen, {Peter M.} and Kraft, {Nathan J. B.} and Lovett, {Jon C.} and Marquet, {Pablo A.} and McGill, {Brian J.} and Naia Morueta-Holme and Neves, {Danilo M.} and N{\'u}{\~n}ez-Regueiro, {Mauricio M.} and Oliveira-Filho, {Ary T.} and Peet, {Robert K.} and Michiel Pillet and Roehrdanz, {Patrick R.} and Brody Sandel and Serra-Diaz, {Josep M.} and Irena {\v S}{\'i}mov{\'a} and Jens-Christian Svenning and Cyrille Violle and Weitemier, {Trang D.} and Susan Wiser and Laura L{\'o}pez-Hoffman",
note = "Publisher Copyright: {\textcopyright} 2022 John Wiley & Sons Ltd.",
year = "2022",
doi = "10.1111/geb.13497",
language = "English",
volume = "31",
pages = "1242--1260",
journal = "Global Ecology and Biogeography",
issn = "1466-822X",
publisher = "Wiley-Blackwell",
number = "7",

}

RIS

TY - JOUR

T1 - A review of the heterogeneous landscape of biodiversity databases

T2 - Opportunities and challenges for a synthesized biodiversity knowledge base

AU - Feng, Xiao

AU - Enquist, Brian J.

AU - Park, Daniel S.

AU - Boyle, Brad

AU - Breshears, David D.

AU - Gallagher, Rachael V.

AU - Lien, Aaron

AU - Newman, Erica A.

AU - Burger, Joseph R.

AU - Maitner, Brian S.

AU - Merow, Cory

AU - Li, Yaoqi

AU - Huynh, Kimberly M.

AU - Ernst, Kacey

AU - Baldwin, Elizabeth

AU - Foden, Wendy

AU - Hannah, Lee

AU - Jørgensen, Peter M.

AU - Kraft, Nathan J. B.

AU - Lovett, Jon C.

AU - Marquet, Pablo A.

AU - McGill, Brian J.

AU - Morueta-Holme, Naia

AU - Neves, Danilo M.

AU - Núñez-Regueiro, Mauricio M.

AU - Oliveira-Filho, Ary T.

AU - Peet, Robert K.

AU - Pillet, Michiel

AU - Roehrdanz, Patrick R.

AU - Sandel, Brody

AU - Serra-Diaz, Josep M.

AU - Šímová, Irena

AU - Svenning, Jens-Christian

AU - Violle, Cyrille

AU - Weitemier, Trang D.

AU - Wiser, Susan

AU - López-Hoffman, Laura

N1 - Publisher Copyright: © 2022 John Wiley & Sons Ltd.

PY - 2022

Y1 - 2022

N2 - Aim: Addressing global environmental challenges requires access to biodiversity data across wide spatial, temporal and taxonomic scales. Availability of such data has increased exponentially recently with the proliferation of biodiversity databases. However, heterogeneous coverage, protocols, and standards have hampered integration among these databases. To stimulate the next stage of data integration, here we present a synthesis of major databases, and investigate (a) how the coverage of databases varies across taxonomy, space, and record type; (b) what degree of integration is present among databases; (c) how integration of databases can increase biodiversity knowledge; and (d) the barriers to database integration. Location: Global. Time period: Contemporary. Major taxa studied: Plants and vertebrates. Methods: We reviewed 12 established biodiversity databases that mainly focus on geographic distributions and functional traits at global scale. We synthesized information from these databases to assess the status of their integration and major knowledge gaps and barriers to full integration. We estimated how improved integration can increase the data coverage for terrestrial plants and vertebrates. Results: Every database reviewed had a unique focus of data coverage. Exchanges of biodiversity information were common among databases, although not always clearly documented. Functional trait databases were more isolated than those pertaining to species distributions. Variation and potential incompatibility of taxonomic systems used by different databases posed a major barrier to data integration. We found that integration of distribution databases could lead to increased taxonomic coverage that corresponds to 23 years’ advancement in data accumulation, and improvement in taxonomic coverage could be as high as 22.4% for trait databases. Main conclusions: Rapid increases in biodiversity knowledge can be achieved through the integration of databases, providing the data necessary to address critical environmental challenges. Full integration across databases will require tackling the major impediments to data integration: taxonomic incompatibility, lags in data exchange, barriers to effective data synchronization, and isolation of individual initiatives.

AB - Aim: Addressing global environmental challenges requires access to biodiversity data across wide spatial, temporal and taxonomic scales. Availability of such data has increased exponentially recently with the proliferation of biodiversity databases. However, heterogeneous coverage, protocols, and standards have hampered integration among these databases. To stimulate the next stage of data integration, here we present a synthesis of major databases, and investigate (a) how the coverage of databases varies across taxonomy, space, and record type; (b) what degree of integration is present among databases; (c) how integration of databases can increase biodiversity knowledge; and (d) the barriers to database integration. Location: Global. Time period: Contemporary. Major taxa studied: Plants and vertebrates. Methods: We reviewed 12 established biodiversity databases that mainly focus on geographic distributions and functional traits at global scale. We synthesized information from these databases to assess the status of their integration and major knowledge gaps and barriers to full integration. We estimated how improved integration can increase the data coverage for terrestrial plants and vertebrates. Results: Every database reviewed had a unique focus of data coverage. Exchanges of biodiversity information were common among databases, although not always clearly documented. Functional trait databases were more isolated than those pertaining to species distributions. Variation and potential incompatibility of taxonomic systems used by different databases posed a major barrier to data integration. We found that integration of distribution databases could lead to increased taxonomic coverage that corresponds to 23 years’ advancement in data accumulation, and improvement in taxonomic coverage could be as high as 22.4% for trait databases. Main conclusions: Rapid increases in biodiversity knowledge can be achieved through the integration of databases, providing the data necessary to address critical environmental challenges. Full integration across databases will require tackling the major impediments to data integration: taxonomic incompatibility, lags in data exchange, barriers to effective data synchronization, and isolation of individual initiatives.

KW - big data

KW - biodiversity informatics

KW - biogeography

KW - database integration

KW - functional trait

KW - taxonomic system

U2 - 10.1111/geb.13497

DO - 10.1111/geb.13497

M3 - Review

AN - SCOPUS:85128256664

VL - 31

SP - 1242

EP - 1260

JO - Global Ecology and Biogeography

JF - Global Ecology and Biogeography

SN - 1466-822X

IS - 7

ER -

ID: 305111185