Biodiversity Information Science and Standards :
Conference Abstract
|
Corresponding author: Quentin Groom (quentin.groom@plantentuinmeise.be)
Received: 12 Sep 2021 | Published: 13 Sep 2021
© 2021 Sofie Meeus, Wouter Addink, Donat Agosti, Christos Arvanitidis, Mariya Dimitrova, Juan Miguel González-Aranda, Jörg Holetschek, Sharif Islam, Thomas Jeppesen, Daniel Mietchen, Tim Robertson, Francisco Sanchez Cano, Maarten Trekels, Quentin Groom
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Meeus S, Addink W, Agosti D, Arvanitidis C, Dimitrova M, González-Aranda JM, Holetschek J, Islam S, Jeppesen TS, Mietchen D, Robertson T, Sanchez Cano FM, Trekels M, Groom Q (2021) Hacking Infrastructures Together: Towards better interoperability of infrastructures. Biodiversity Information Science and Standards 5: e74325. https://doi.org/10.3897/biss.5.74325
|
The BiCIKL Project is born from a vision that biodiversity data are most useful if they are viewed as a network of data that can be integrated and viewed from different starting points. BiCIKL’s goal is to realize that vision by linking biodiversity data infrastructures, particularly for literature, molecular sequences, specimens, nomenclature and analytics. BiCIKL is an Open Science project creating Open FAIR data and services for the whole research community. BiCIKL intends to inspire novel, innovative, research and build services that can produce new and valuable knowledge, necessary for the protection of biodiversity and of our environment. BiCIKL will develop methods and workflows to harvest, link and access data extracted from literature. Yet, as the project gets underway, we need to better understand the existing infrastructures, their limitations, the nature of the data they hold, the services they provide and particularly how they can interoperate. To do this we organised a week-long hackathon where small teams worked on a number of pilot projects (Table
Topics proposed for the BiCIKL Hackathon (September 2021) and the infrastructures and related organizations that will be linked. Details of each topic can be found on GitHub (https://github.com/pensoft/BiCIKL).
Title | Partners/infrastructures involved |
---|---|
Finding the lost parents | SIB, GBIF, CoL, Plazi, Wikidata |
How good are Triple IDs in ENA? | ENA, GBIF |
Enhance the GBIF clustering algorithms | GBIF |
Assigning Latin scientific names to operational taxonomic units based on sequence clusters | UNITE/PlutoF, CoL |
Registering biodiversity-related vocabulary as Wikidata lexemes and link their senses to Wikidata items | Wikidata, Plazi, CoL |
FAIR Digital Object design from multiple sources | TDWG, GBIF, ENA, OpenBiodiv, DiSSCo |
Enriching Wikidata with information from OpenBiodiv about type specimens in context from different literature sources | OpenBiodiv, Wikidata, GBIF, Plazi, Zoobank |
Linking specimen with material citation and vice versa | SIB, BFH, MBG, GBIF, BGBM |
Hidden women in science | Wikidata, Plazi, GBIF, CETAF, Science stories |
An IPFS-Blockchain Interface | GBIF, Plazi, CETAF, Species 2000 |
We will present our experience of running a hackathon and our evaluation of how successfully it achieved its aims. We will also give examples of the projects we conducted and how successful they were. Finally we will give our preliminary evaluation of what we learned about the interoperability of infrastructures and what recommendations we can give to improve their interoperability, whether that is improvements to the data standards used, the means to access the data and analyse them, or even the physical bandwidth and computational restrictions that limit the potential for research.
molecular sequence data, nomenclature, specimens, biodiversity informatics, FAIR Data, Wikidata, literature, linked data
Quentin Groom
TDWG 2021
The BiCIKL project receives funding from the European Union's Horizon 2020 Research and Innovation action under grant agreement No 101007492.
Biodiversity Community Integrated Knowledge Library (BiCIKL)
SM and QG hosted the hackathon, had the initial concept and are the primary organizers. All authors provided research topics for the teams, led teams in the hackathon and contributed to the outcomes.
The authors do not declare any conflict of interests.