Biodiversity Information Science and Standards :
Conference Abstract
|
Corresponding author: Fabricio De Jesus Rios Montero (fabrirm15@gmail.com)
Received: 21 Nov 2024 | Published: 22 Nov 2024
© 2024 Fabricio Rios Montero, Ervin Rodríguez, Maria Mora Cross
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Rios Montero FJ, Rodríguez E, Mora Cross M (2024) Relation Extraction From Unstructured Species Descriptions Using TaxonNERD and LLaMA 2 7B. Biodiversity Information Science and Standards 8: e142382. https://doi.org/10.3897/biss.8.142382
|
Ontologies are essential tools for organizing information on taxonomy, ecology, and inter-species relationships, helping to standardize ecological data and facilitate integration of large datasets. Combining ontologies with advanced Natural Language Processing (NLP) techniques, such as Named Entity Recognition (NER) and Relation Extraction (RE), has greatly improved the discovery of insights from unstructured scientific texts, particularly in biodiversity (
This study combines ontologies and NLP to analyze complex trophic interactions among animal species (
Our results (Fig.
Distribution of BERTScore (Bidirectional Encoder Representations from Transformers) metrics (F1, Precision, Recall) with most scores between 0.6 and 0.9. Outliers in Precision suggest areas for improving accuracy and reducing false positives.
Moreover, this research highlights the potential of integrating AI-driven tools with ontological frameworks to manage and analyze biodiversity data at scale (
biodiversity, ontologies, Named Entity Recognition (NER), Relation Extraction (RE), LLaMA2-7b, feeding relationships
Fabricio Ríos Montero
SPNHC-TDWG 2024
This work was made possible through the support provided by the Instituto Nacional de Costa Rica (ITCR), the International Development Research Center (IDRC) through the Central American Higher University Council (CSUCA), and the Costa Rican Innovation and Research Promoter of the Ministry of Science, Innovation, Technology, and Telecommunications (MICITT) of Costa Rica.