Biodiversity Information Science and Standards : Conference Abstract
PDF
Conference Abstract
The Future of Referencing Specimens Is Near: Cite the Digital Specimen DOI
expand article infoWouter Addink‡,§, Soulaine Theocharides§,
‡ Distributed System of Scientific Collections - DiSSCo, Leiden, Netherlands
§ Naturalis Biodiversity Center, Leiden, Netherlands
Open Access

Abstract

Specimens are often mentioned in scholarly publications or data infrastructures by referencing the local identifiers attached to the objects held in specimen collections. However, these are often only unique to the issuing institution and not resolvable. Transforming these into globally unique identifiers, such as the 'Darwin Core Triplet' constructed from codes that specify the institution, the collection and the accession or catalog number, is an imperfect solution. These cannot be unambiguously validated nor be dereferenced (Guralnick et al. 2015Groom et al. 2019), are vulnerable to human error, and rely on accession numbers that sometimes change over time. Earlier approaches to give specimens globally unique and resolvable identifiers had limited success (Page 2009). Attaching labels with new identifiers is too costly and time consuming, apart from digital infrastructure challenges.

The Digital Extended Specimen is a new object on the internet, containing all known information about the specimen, thus acting as a surrogate for the physical object. To distinguish it from its physical counterpart, it needs a new identifier, which also provides a solution to unambiguously and persistently refer to specimens. Digital specimens get Digital Object Identifiers (DOIs) and Persistent Identifiers, in the infrastructure developed in the BiCIKL project (see deliverable D7.1 and this blog post). Ties to these accession or catalog numbers of physical specimens, allow these local identifiers to change over time, while still persistently resolving to the digital object.

DOIs guarantee global uniqueness, persistence and reliable resolution through the oversight of the DOI Foundation and its registration agencies (RAs). DiSSCo, Distributed System of Scientific Collections, is creating a partnership with DataCite RA, where both partners invest to provide Digital Specimen DOIs and enhance their findability, leveraging from their metadata services. This aids the potential of Digital Specimen DOIs to reliably create links with data in other infrastructures, cite individual specimens, and improve the FAIR-ness (Findable, Accessible, Interoperable, Reusable) of specimen data worldwide. We will present how to use these new DOIs in practice.

The use of Digital Specimen DOIs in a publication to reference a specimen is already piloted (Deeleman-Reinhold et al. 2024). All specimen records cited in the publication have been assigned a DOI that points to a digital representation of the specimen, the Digital Specimen, which can evolve over time to become a digital extended specimen by digitally linking it to relevant ecological, environmental, and related data such as the publication itself. The DOIs redirect both to a machine readable and a human readable version and include metadata in the DOI record for machine actionability that is also findable in DataCite Commons. The Crosscite citation formatter offers guidance to users how to cite a Digital Specimen DOI, similar to DOIs that refer to other works.

Keywords

identifiers, Digital Extended Specimen, BiCIKL, PID, data infrastructure

Presenting author

Wouter Addink

Presented at

SPNHC-TDWG 2024

Funding program

Horizon 2020 Research and Innovation Action, Grant Agreement No. 101007492

Horizon 2020 INFRADEV-2019-2020 project, Grant Agreement No. 871043

Grant title

BiCIKL, Biodiversity Community Integrated Knowledge Library

DiSSCo Transition

Conflicts of interest

The authors have declared that no competing interests exist.

References

login to comment