Biodiversity Information Science and Standards : Conference Abstract
Print
Conference Abstract
Making LTER Data FAIR: A workbench using DEIMS datasets and GBIF Tools
expand article infoFrancisco Pando, Francisco Bonet§
‡ Real Jardin Botanico -CSIC, Madrid, Spain
§ Universidad de Córdoba, Córdoba, Spain
Open Access

Abstract

DEIMS-SDR (Dynamic Ecological Information Management System - Site and dataset registry, Wohner et al. 2019) is one of the largest repositories of long-term ecological research (LTER) datasets. It provides sophisticated searching tools by metadata elements and identifiers for all the 930 contained datasets, most of them from European sites. Whereas datasets' metadata are highly structured and searchable, datasets themselves have little standardization in terms of content, identifiers or license, making data integration difficult or cumbersome. Adopting the data FAIR guiding principles(Wilkinson et al. 2016) for LTER data would result in better data integration and reutilization to support knowledge discovery and innovation in ecological research.

The Global Biodiversity Information Facility (GBIF 2019a). is the largest repository of species distribution data in the world, providing access to more than a billion records from over 43,000 datasets. GBIF is a good example of FAIR principles implementation: GBIF data is highly standardized, using Darwin Core (Wieczorek et al. 2012) for data and ecological metadata language (EML, Fegraus et al. 2005) for metadata, allowing record-level search; and has implemented globally unique and persistent identifiers for datasets and downloads. Relevant in this context is that GBIF has recently introduced a new data format intended for monitoring projects and sampling event protocols (GBIF 2019b). 

In this presentation, we explore the suitability of GBIF data formats and workflows to serve LTER datasets, and the work it may take to transform typical LTER datasets into these formats. For this exercise, we use some datasets available via the DEIMS platform, corresponding to the same territory, (Sierra Nevada, Spain (e.g. Bonet 2016Bonet 2018) and transform them into the GBIF's sample-based Event core publish them in the GBIF data network, and then perform an analysis to assess how the standardized datasets work in practice, both among themselves and also with typical “occurrence-based” GBIF datasets. Finally, we discuss our findings and make recommendations for the GBIF and LTER communities.

Keywords

Long-Term Ecological Research, LTER, environmental data, DEIMS-SDR, FAIR principles, GBIF

Presenting author

Francisco Pando

Presented at

Biodiversity_Next 2019

References