Biodiversity Information Science and Standards : Conference Abstract
PDF
Conference Abstract
Specimods: A web-based tool for producing Genbank submission files for sequenced museum specimens
expand article infoRichard A. Levy, Jessie W. Berta-Thompson, Gary Olds, Andrew W. Wilson
‡ Denver Botanic Gardens, Denver, United States of America
Open Access

Abstract

The expectation of published sequence data on Genbank and other sequence databases is not only a requirement for journal publication, but is incredibly valuable for continuing research in organismal evolution and ecology. The process of formatting and submitting sequence data from vouchered specimens is burdensome and tedious. Improperly formatted submissions lead to lengthy delays and frustrations on behalf of the submitters and reviewers. Specimods*1 is a new tool that utilizes the Global Biodiversity Information Facility (GBIF) API to produce formatted source modifier files that will facilitate the batch upload of specimen sequences to Genbank, using the sequence submission platform. Once a template CSV is populated with the specimen catalog numbers, institution codes, SeqID, sequence descriptions, and the sequences, Specimods will pull specimen metadata from GBIF to produce two files: a FASTA file with a complete definition line (SeqID, organism name, sequence title) and the sequences as well as a source modifier file containing specimen metadata. Currently, Specimods supports Internal Transcribed Spacer (ITS) (1, 2, or both), SSUrRNA_18s, and LSUrRNA_28s sequences. The application*2 uses an Express server and Node.js with data linked to a user account stored in a MySQL relational database. This presentation will demonstrate the tool using data from fungal specimens and show how submitting dozens of sequences with metadata can be accomplished in a matter of minutes.

Keywords

web tool, GBIF, API

Presenting author

Richard A. Levy

Presented at

TDWG 2022

Endnotes
login to comment