63urn:lsid:arphahub.com:pub:0E0032F4-55AE-5263-8B3C-F4DD637C30C2Biodiversity Information Science and StandardsBISS2535-0897Pensoft Publishers10.3897/tdwgproceedings.1.20214202147871Conference AbstractSymposium: Biological Interaction Data - towards data standardizationGlobal Biotic Interactions: A Catalyst For Integrating Existing Interaction Datasets, Connecting Data Curators And Developing Data Exchange MethodsPoelenJorrit Hjhpoelen@xs4all.nlhttps://orcid.org/0000-0003-3138-41181400 Perkins St Apt 104, Oakland, CA, United States of America400 Perkins St Apt 104Oakland, CAUnited States of America
Corresponding author: Jorrit H Poelen (jhpoelen@xs4all.nl).
Academic editor:
2017110820171e20214D33E7784-551E-5EF5-93FA-1FDC16323832114048411082017Jorrit H PoelenThis is an open access article distributed under the terms of the Creative Commons Attribution License 4.0 (CC-BY), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Since 2013, Global Biotic Interactions (GloBI, globalbioticinteractions.org, Poelen et al. 2014) has taken an opportunistic, decentralized approach to integrating, and make accessible, existing species interaction datasets. Rather than expecting dataset curators to conform to some publication regime, methods were developed to automatically and algorithmically discover, parse and link existing datasets without the need to reformat, relocate, or transfer ownership of, the existing dataset. The automated nature of GloBI helps to: (a) automate propagation of dataset updates (b) quickly detect data integration issues (e.g. outage, change in data format), (c) integrate new datasets without having to contact some central office, (d) avoid permanent data loss due to software integration bugs, and, last but not least, (e) access to datasets even after GloBI goes away.
As far back as 1927, Charles Elton, a founding father of modern ecology, realized the importance of linking natural history knowledge stored in professional journals while acknowledging the value of local (amateur) knowledge. Despite technological advances, details on how species interact are only still largely available by studying professional journals, manually inspecting datasets or striking up a conversation with a ecologist, farmer or citizen scientist. The lack of access to species interaction data is known as the Eltonian shortfall (Hortal et al. 2015). GloBI’s mission is to address this shortcoming.
By borrowing from software engineering practices such as test driven development and continuous integration, re-purposing freely available platforms such as GitHub, Zenodo, Travis CI and integrating with many existing biodiversity services (e.g. globalnames.org, eol.org,crossref.org, geonames.org), GloBI has grown to include about 2.8M interaction records spanning 100k taxa (source: globalbioticinteractions.org/references, 17 July 2017) and has established bi-directional links to projects including, but not limited to, the NCBI Taxonomy, World Register of Marine Species, Encyclopedia of Life and iNaturalist.
As GloBI continues to link existing species interaction datasets, and form a loosely affiliated community of data curators, educators and (citizen) scientists, the data integration platform is well-suited to play an active and experimental role in the development of novel methods to more easily mobilize and integrate species interaction data in an effort to realize Charles Elton's dream to "[...] provide conceptions which can link up into some complete scheme the colossal store of facts about natural history which has accumulated up to date in this rather haphazard manner. [...]" (Elton 1927).
ecologyspecies interactionsecological informaticsspecies associationsdata integration1-6 October 2017TDWG 2017 Annual ConferenceTDWG 2017Ottawa, CanadaData Integration in a Big Data Universe: Associating Occurrences with Genes, Phenotypes, and EnvironmentsPresenting author
Jorrit H. Poelen
ReferencesEltonCharles S.1927Macmillian Companhttp://dx.doi.org/10.5962/bhl.title.743510.5962/bhl.title.7435HortalJoaquínBelloFrancesco deF. Diniz-FilhoJosé AlexandreLewinsohnThomas M.LoboJorge M.LadleRichard J.2015Seven Shortfalls that Beset Large-Scale Knowledge of Biodiversity461523549http://dx.doi.org/10.1146/annurev-ecolsys-112414-05440010.1146/annurev-ecolsys-112414-054400PoelenJorrit H.SimonsJames D.MungallChris J.2014Global biotic interactions: An open infrastructure to share and analyze species-interaction datasets24148159http://dx.doi.org/10.1016/j.ecoinf.2014.08.00510.1016/j.ecoinf.2014.08.005