Proceedings of TDWG : Conference Abstract
Print
Conference Abstract
Darwin Cloud: Mapping real-world data to Darwin Core
expand article infoJohn Wieczorek‡,§, Paul J. Morris§, James Hanken§, David B. Lowery§, Bertram Ludäscher|, James Macklin, Timothy McPhillips|, Robert A. Morris#,§, Qian Zhang|
‡ Museum of Vertebrate Zoology, University of California, Berkeley, United States of America
§ Museum of Comparative Zoology, Harvard University, Cambridge, MA, United States of America
| University of Illinois Urbana-Champaign, Champaign, United States of America
¶ Agriculture and Agri-Food Canada, Ottawa, Canada
# University of Massachusetts, Boston, Boston, United States of America
Open Access

Abstract

Since its ratification as a TDWG standard in 2009, data publishers have had to struggle with the essential step of mapping fields in working databases to the terms in Darwin Core Wieczorek et al. 2012 in order to publish and share data using that standard. Doing so requires a good understanding of both the data set and Darwin Core. The accumulated knowledge about these mappings constitutes what we call the "Darwin Cloud." We will explore the nature of data mapping challenges and the potential for semi-automated solutions to them. Specifically, we will look at the "Darwinizer" actor and its usage in related workflows within the Kurator data quality framework and the implications for community-managed vocabularies.

Keywords

Biodiversity Informatics, Darwin Core, Data Quality

Presenting author

John Wieczorek

Funding program

NSF DBI 1356438 and 1356751

References

login to comment