Investment in the Long-Tail of Biodiversity Data: From local research to global knowledge

In business, the “long-tail economy” refers to a market strategy where the gravity center shifts from a few high-demand products to many, varied products focused on small niches. Commercialization of individually low-demand products can be profitable as long as their production cost is low and, all taken together, they aggregate into a big chunk of the market. Similarly, in the “business” of biodiversity data acquisition, we can find several mainstream products that produce zillions of bits of information every year and account for most of the budget allocated to increase our primary data-based knowledge about Earth’s biological diversity. These products play a crucial role in biodiversity research. However, along with these large global projects, there is a constellation of small-scale institutions that work locally, but whose contribution to our understanding of natural processes should not be dismissed. These information datasets can be collectively referred to as the “long-tail biodiversity data”. Here we present the case of the Museum of Sciences University of Navarra, which harbors the research activity of the Museum of Zoology (MZNA) and herbarium (PAMP) of the University of Navarra in Spain. For the last 40 years, its members have been involved in ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ ‡ © Galicia D et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. hundreds of research projects, from local to international level—but quantitatively, the vast majority of its biodiversity records come from Navarra, a smallish (10,000 sq. km) administrative region in the north of Spain. Despite its modest area, the available information about the region in the Museum database approaches one million records of thousands of species, including dozens of type series. Fifteen years ago, a series of national research grants enabled boosting digitization and public access to the database records through the Global Biodiversity Information Facility ( GBIF). Although those grants were never renewed, the Museum continued its digitizing and standardizing program on vouchered collections, as well as sampling additional raw biodiversity data through long term ecological sites using the Museum’s resources, which annually provide thousands of new records at the local level. Currently, the Museum has already published through GBIF’s IPT (Integrated Publishing Toolkit) 30 datasets, containing more than half-million records of about 5700 taxa. Its records have contributed to more than 60 peer-reviewed publications over the last five years. Institutions that basically harvest biodiversity data at a local scale usually show a high degree of specialization, gathering researchers with strong (albeit often narrow) expertise in the taxonomy and ecology of nearby ecosystems. They are thus an extremely valuable tool when dealing with processes resulting in diversity changes that can be identified rather precisely, especially when their work can be traced back many decades. Making all this fine-scale information accessible and actionable requires, in most of the cases, a rather modest investment in staff training on data management (e.g., standards, database interoperability) or museum curation procedures, and on informatics infrastructure. As in the case of business, it is not a matter of choosing between producing blockbusters or independent cinema--but of leveraging available resources and maximizing output.


Abstract
In business, the "long-tail economy" refers to a market strategy where the gravity center shifts from a few high-demand products to many, varied products focused on small niches. Commercialization of individually low-demand products can be profitable as long as their production cost is low and, all taken together, they aggregate into a big chunk of the market. Similarly, in the "business" of biodiversity data acquisition, we can find several mainstream products that produce zillions of bits of information every year and account for most of the budget allocated to increase our primary data-based knowledge about Earth's biological diversity. These products play a crucial role in biodiversity research. However, along with these large global projects, there is a constellation of small-scale institutions that work locally, but whose contribution to our understanding of natural processes should not be dismissed. These information datasets can be collectively referred to as the "long-tail biodiversity data".
Here we present the case of the Museum of Sciences University of Navarra, which harbors the research activity of the Museum of Zoology (MZNA) and herbarium (PAMP) of the University of Navarra in Spain. For the last 40 years, its members have been involved in ‡ hundreds of research projects, from local to international level-but quantitatively, the vast majority of its biodiversity records come from Navarra, a smallish (10,000 sq. km) administrative region in the north of Spain. Despite its modest area, the available information about the region in the Museum database approaches one million records of thousands of species, including dozens of type series.
Fifteen years ago, a series of national research grants enabled boosting digitization and public access to the database records through the Global Biodiversity Information Facility ( GBIF). Although those grants were never renewed, the Museum continued its digitizing and standardizing program on vouchered collections, as well as sampling additional raw biodiversity data through long term ecological sites using the Museum's resources, which annually provide thousands of new records at the local level. Currently, the Museum has already published through GBIF's IPT (Integrated Publishing Toolkit) 30 datasets, containing more than half-million records of about 5700 taxa. Its records have contributed to more than 60 peer-reviewed publications over the last five years.
Institutions that basically harvest biodiversity data at a local scale usually show a high degree of specialization, gathering researchers with strong (albeit often narrow) expertise in the taxonomy and ecology of nearby ecosystems. They are thus an extremely valuable tool when dealing with processes resulting in diversity changes that can be identified rather precisely, especially when their work can be traced back many decades. Making all this fine-scale information accessible and actionable requires, in most of the cases, a rather modest investment in staff training on data management (e.g., standards, database interoperability) or museum curation procedures, and on informatics infrastructure. As in the case of business, it is not a matter of choosing between producing blockbusters or independent cinema--but of leveraging available resources and maximizing output.
Keywords long-tail biodiversity, resource allocation, investment, local biodiversity, staff training

Presenting author
David Galicia