Biodiversity Information Science and Standards :
Conference Abstract
|
Corresponding author: Laurence Benichou (laurence.benichou@mnhn.fr)
Received: 09 Aug 2023 | Published: 09 Aug 2023
© 2023 Laurence Benichou, Marianne Salaün, Iva Boyadzhieva, Seyhan Demirov, Teodor Georgiev, Lyubomir Penev
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Benichou L, Salaün M, Boyadzhieva I, Demirov S, Georgiev T, Penev L (2023) Pre-Publication Data Linking in Taxonomy and Biodiversity: The ARPHA and Metotaxa-Metostem Publishing Systems. Biodiversity Information Science and Standards 7: e110919. https://doi.org/10.3897/biss.7.110919
|
|
The traditional way of publishing in PDF makes it difficult to retrospectively convert the legacy literature into data. This presentation will discuss pre-publication tagging as an alternative solution for publishing FAIR (Findable, Accessible, Interoperable, Resuable) biodiversity data.
The Metotaxa-Metostem workflow
Тhe MetoTaxa project aims to create a new digital production chain for the European Journal of Taxonomy, which enables the pre-publication semantic structuring of text, automatic tagging and semantic enrichment (annotation).
The system is based on a single-source publishing model, where the development of an XML file enables technical editors to automatically enrich text and produce multiple digital outputs. This makes it possible to structure generic or domain-specific sections of articles (e.g., Introduction; Material and methods; Taxon names or Мaterial examined). Thanks to the GoldenGate API developed by Plazi, the Text Encoding Intiative (TEI) XML source file is automatically annotated with JATS TaxPub tags: taxon names are labeled and each authorship can be checked via Catalogue of Life, each element of the material examined is parsed thanks to the preformatting of the text (
We will also briefly present MetoStem, which offers a technical solution for the digital transformation of monographs, and particularly floras. The tools and methods developed by this project will enable advanced publication of interoperable structured text and data.
ARPHA Publishing Platform
Launched in 2010 by Pensoft, ARPHA (
The second development stage of ARPHA was marked by the launch of ARPHA Writing Tool (AWT)*
Currently, AWT is being redeveloped into a standalone, freely accessible installation (AWT 2.0), based on a micro-service architecture. It enables new semantic enhancements during the authoring process, which can be confirmed by the authors before manuscript submission. Such enhancements include the in-text citations context by CiTO ontology; automated tagging of taxon names and linking to their identifiers in authoritative sources; annotator tool; nanopublication module; automated search and import of references; treatment citation module; export/import to/from JATS TaxPub; and internal communication tool for contributors.
semantic publishing, data dissemination, data liberation
Laurence Bénichou
TDWG 2023
Part of the work was supported by the Biodiversity Community Integrated Knowledge Library (BiCIKL) project, funded by the European Union's Horizon 2020 under grant agreement No 101007492; MetoStem project funded by the Fond National pour la Science ouverte No.2, France.
BiCIKL - Biodiversity Community Integrated Knowledge Library