Biodiversity Information Science and Standards :
Conference Abstract
|
Corresponding author: Pieter Huybrechts (pieter.huybrechts@plantentuinmeise.be)
Received: 31 Aug 2022 | Published: 07 Sep 2022
© 2022 Pieter Huybrechts, Maarten Trekels, Quentin Groom
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Huybrechts P, Trekels M, Groom Q (2022) How Much of Biodiversity is Represented in Collections: A big data workflow of aggregated occurrence data. Biodiversity Information Science and Standards 6: e94279. https://doi.org/10.3897/biss.6.94279
|
Natural history collections play a pivotal role in taxonomy, which in turn supports all of biology, but particularly conservation and biodiversity policy. However, to provide this role, it is necessary to know what specimens are stored where, and how complete the collection is. The biodiversity held within collections globally remains uncertain, with an estimated 1.2 to 2.1 billion (109) specimens (
Data aggregators such as GBIF provide an ever-changing window into the contents of collections. We use non-parametric estimators that allow for the approximation of the number of classes in an incomplete set, such as the number of species within a collection, but also the proportion of biodiversity preserved on a national or continental level (for example within a taxonomic group, compared to the world or to a continent). Because the contents of data aggregators such as GBIF, are in constant flux, our workflow is made to be repeatable on the monthly snapshots that GBIF provides.
The results of the workflow expose data gaps in GBIF, namely that collections from some large geographical regions, such as Asia, are poorly represented, but also taxonomic gaps exist, such as several Coleoptera families where many more species are accepted in the backbone than are represented on GBIF. As more data are published to GBIF the estimates for these taxon groups and geographical regions will improve. The detection of data gaps within data aggregators such as GBIF, and the subsequent mobilisation of missing information remains a priority for both aggregators and researchers (
natural history collections, GBIF, extrapolation, data gap
Pieter Huybrechts
TDWG 2022
This work was facilitated by the Research Foundation – Flanders (FWO) as part of the Flemish contribution to the DiSSCo Research Infrastructure under grant n° I001721N