Biodiversity Information Science and Standards :
Conference Abstract
|
Corresponding author: Takeru Nakazato (nakazato@dbcls.rois.ac.jp)
Received: 01 Sep 2021 | Published: 01 Sep 2021
© 2021 Takeru Nakazato
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Nakazato T (2021) Knowledge Extraction from Specimen-Derived Data from GenBank to Enrich Biodiversity Information. Biodiversity Information Science and Standards 5: e73787. https://doi.org/10.3897/biss.5.73787
|
DNA barcoding and environmental DNA (eDNA) are increasing the need for the utilization of gene sequences in the field of biodiversity. GBIF (Global Biodiversity Information Facility) and GGBN (Global Genome Biodiversity Network) are taking action on the treatment of gene sequences in the field of biodiversity (
In this study, as an example of linking gene sequence information with biodiversity information, I attempted to construct an infrastructure for knowledge extraction by utilising gene sequence entries derived from museum specimens from GenBank (
In the future, I plan to map these extracted IDs to the collection IDs in the biodiversity information database. This will enable us to enrich the biodiversity information with GenBank descriptions, for example, by adding articles listed in GenBank as references to the specimen data.
RDF, linked open data, Wikidata, voucher specimen, natural language processing, taxonomic name
Takeru Nakazato
TDWG 2021