Biodiversity Information Science and Standards : Conference Abstract
PDF
Conference Abstract
Want to Describe and Share Biodiversity Inventory and Monitoring Data? The Humboldt Extension for Ecological Inventories Can Help!
expand article infoYanina V Sica, Wesley M Hochachka§, Yi-Ming Gan|, Kate Ingenloff, Dmitry Schigel, Robert D. Stevenson#, Steven J. Baskauf¤, Peter Brenton«, Anahita J. N. Kazem», John Wieczorek˄
‡ Yale University, New Haven, United States of America
§ Lab of Ornithology, Cornell University, Ithaca, New York, United States of America
| Royal Belgian Institute of Natural Sciences, Brussels, Belgium
¶ GBIF, Copenhagen, Denmark
# University of Massachusetts, Boston, United States of America
¤ Vanderbilt University Libraries, Nashville, United States of America
« Atlas of Living Australia, Canberra, Australia
» German Centre for Integrative Biodiversity Research, Leipzig, Germany
˄ VertNet, Bariloche, Argentina
Open Access

Abstract

Access to high-quality ecological data is critical to assessing and modeling biodiversity and its changes through space and time. The Darwin Core standard has proven to be immensely helpful in sharing species occurrence data (see Wieczorek et al. 2012, Global Biodiversity Information Facility, GBIF) and promoting biodiversity research following the FAIR principles of findability, accessibility, interoperability and reusability (Wilkinson et al. 2016). However, it is limited in its ability to fully accommodate inventory data (i.e., linked records of multiple taxa at a specific place and time). Information about the inventory processes is often either unreported or described in an unstructured manner, limiting its potential re-use for larger-scale analyses. Two key aspects that are not captured in a structured manner yet are: i) information about the species that were not detected during an inventory, and ii) ancillary information about sampling effort and completeness.

Non-detections (i.e., reported counts of zero) potentially enable more accurate and precise estimates of distribution, abundance, and changes in abundance. This becomes possible when variation in effort is used to estimate the likelihood that a non-detection represents a true absence of that taxon during the inventory. Currently, ecological inventory data, when shared at all, are typically discoverable through dataset catalogs (e.g., governmental data repositories) and supplementary materials to publications. With few exceptions, indexing of such data with the detail and structure needed has not been attempted at broad temporal and spatial scales, despite the potentially high value resulting from making inventory data more readily accessible.

To address these limitations in documenting inventory data using the Darwin CoreGuralnick et al. (2018) proposed the Humboldt Core. Subsequent discussions within the biodiversity standards community made it clear that greater integration could be achieved by creating an extension of the Darwin Core, rather than developing a new standard in isolation. Extension design work began in 2021 and progress has been reported by Brenton (2021) and Sica et al. (2022)

Over the last year the Humboldt Extension Task Group has sought advice from data providers and aggregators and updated its vocabulary terms. A challenging aspect has been creating terminology for the parent-child relationships (see Properties of Hierarchical Events) needed to describe surveys that may be as simple as a collection of checklists (one level of hierarchy) or as complex as species records from traps within plots along transects across habitats over multiple years (at least four levels of hierarchy). The Task Group has committed to completing a User Guide for the Humboldt Extension. Group members who contributed to the Darwin Core (Darwin Core Task Group 2009) and the Vocabulary Maintenance Specification (Vocabulary Maintenance Specification Task Group 2017) have provided valuable expertise on term refinement and process.

Through ratification of the Humboldt Extension as a Darwin Core Event extension, we expect to provide the community with a usable solution, tied to well-established data publication mechanisms, for sharing and using inventory data. This effort promises to overcome a key bottleneck in the sharing of critically important ecological data, enhancing data discoverability, interoperability and re-use while lowering reporting burden and data and metadata heterogeneity. Global data aggregation initiatives, such as GBIF, will benefit from this development as they develop their data models and the range of standards and extensions they support. We anticipate that the Humboldt Extension will be attractive both to data publishers and data users, by facilitating the representation and indexing of data in richer, more meaningful ways. Despite the data-intensive nature of fundamental ecological research and applied monitoring for management and policy, ecological data have remained as one of the FAIR data frontiers. We anticipate that the Humboldt Extension will address most data exchange needs of all professional communities involved.

Keywords

Darwin Core, species surveys, sampling, standards

Presenting author

Yanina V. Sica

Presented at

TDWG 2023

Acknowledgements

The advances in the Humboldt Extension for ecological inventories would not have been possible without the collaborative work of all members of the TDWG Humboldt Extension Task Group.

Conflicts of interest

The authors have declared that no competing interests exist.

References

login to comment