Biodiversity Information Science and Standards :
Conference Abstract
|
Corresponding author: Margret Steinthorsdottir (margret.steinthorsdottir@nrm.se)
Received: 08 Sep 2023 | Published: 11 Sep 2023
© 2023 Margret Steinthorsdottir, Veronika Johansson, Manash Shah
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Steinthorsdottir M, Johansson VA, Shah M (2023) Swedish Biodiversity Data Infrastructure (SBDI): Insights from the Swedish ALA installation. Biodiversity Information Science and Standards 7: e112429. https://doi.org/10.3897/biss.7.112429
|
The Swedish Biodiversity Data Infrastructure (SBDI) is a biodiversity informatics infrastructure and is the key national resource for data-driven biodiversity and ecosystems research. SBDI rests on three pillars:
SBDI includes tools for mobilising and working with a wide range of biodiversity data. The figure shows examples of data types and data mobilisation tools, as well as examples of web-based graphical front ends and software wrappers relying on SBDI APIs to provide data access, analysis and visualisation capabilities. Illustration by Johan Samuelsson for the Swedish Biodiversity Data Infrastructure.
SBDI was formed in early 2021 and represents the final step in an amalgamation of national infrastructures for biodiversity and ecosystems research. SBDI includes the Swedish node of the Global Biodiversity Information Facility (GBIF), the key international infrastructure for sharing biodiversity data.
SBDI's predecessor Biodiversity Atlas Sweden (BAS) was an early adopter of the Atlas of Living Australia (ALA) platform. SBDI pioneered the container-based deployment of the platform using Docker and Docker Swarm. This container-based approach helps simplify deployment of the platform, which is characterised by a microservice architecture with loosely coupled services. This enables scalability, modularity, integration of services, and new technology insertions.
SBDI has customised the BioCollect module to remove region-specific constraints so that it can be more readily improved for environmental monitoring in Sweden. To further support this, there are plans to develop services for the distribution of terrestrial map layers, which will provide important habitat information for artificial intelligence and machine learning research projects.
The Amplicon Sequence Variants (ASVs) portal, an interface to sequence-based observations, is an example of integration and new technology insertion. The portal developed in SBDI and seamlessly integrated with the ALA platform provides basic functionalities for searching ASVs and occurrence records using the Basic Local Alignment Search Tool (BLAST) or filters on sequencing details and taxonomy and for submitting metabarcoding dataset Fig.
Flow of metabarcoding data in SBDI: A sequencing service provider, such as the National Genomics Infrastructure (NGI), delivers sequencing data to the user. The user submits raw sequencing data and contextual data to the European Nucleotide Archive (ENA), denoises the data using e.g., nf-core/ampliseq, and submits the denoised data (ASVs and their counts in different samples, and contextual data) to the SBDI ASV portal. Metabarcoding data in the ASV database can be searched and downloaded using the portal or using the SBDI4R package (wrapped on the ALA galah package). Figure created by Manash Shah.
Future developments for SBDI include a continued focus on eDNA and monitoring data as well as the implementation of procedures for handling sensitive data.
SBDI, Docker, containers, microservice architecture
Margret Steinthorsdottir
TDWG 2023
Swedish Museum of Natural History / Naturhistoriska riksmuseet