Biodiversity Information Science and Standards : Conference Abstract
PDF
Conference Abstract
The Verification of Ecological Citizen Science Data: Current approaches and future possibilities
expand article infoEmily Baker, Jonathan P Drury, Johanna Judge§, David B. Roy|, Graham C Smith, Philip A Stephens
‡ Conservation Ecology Group, Department of Biosciences, Durham University, Durham, United Kingdom
§ National Biodiversity Network Trust, Nottingham, United Kingdom
| Centre for Ecology & Hydrology, Crowmarsh Gifford, Wallingford, Wallingford, United Kingdom
¶ National Wildlife Management Centre, Animal and Plant Health Agency, York, United Kingdom
Open Access

Abstract

Citizen science schemes (projects) enable ecological data collection over very large spatial and temporal scales, producing datasets of high value for both pure and applied research. However, the accuracy of citizen science data is often questioned, owing to issues surrounding data quality and verification, the process by which records are checked after submission for correctness. Verification is a critical process for ensuring data quality and for increasing trust in such datasets, but verification approaches vary considerably among schemes. Here, we systematically review approaches to verification across ecological citizen science schemes, which feature in published research, aiming to identify the options available for verification, and to examine factors that influence the approaches used (Baker et al. 2021). We reviewed 259 schemes and were able to locate verification information for 142 of those. Expert verification was most widely used, especially among longer-running schemes. Community consensus was the second most common verification approach, used by schemes such as Snapshot Serengeti (Swanson et al. 2016) and MammalWeb (Hsing et al. 2018). It was more common among schemes with a larger number of participants and where photos or video had to be submitted with each record. Automated verification was not widely used among the schemes reviewed. Schemes that used automation, such as eBird (Kelling et al. 2011) and Project FeederWatch (Bonter and Cooper 2012) did so in conjunction with other methods such as expert verification. Expert verification has been the default approach for schemes in the past, but as the volume of data collected through citizen science schemes grows and the potential of automated approaches develops, many schemes might be able to implement approaches that verify data more efficiently. We present an idealised system for data verification, identifying schemes where this hierachical system could be applied and the requirements for implementation. We propose a hierarchical approach in which the bulk of records are verified by automation or community consensus, and any flagged records can then undergo additional levels of verification by experts.

Keywords

ecology, big data, verification, crowdsourcing, data quality

Presenting author

Emily Baker

Presented at

TDWG 2021

Acknowledgements

EB was supported by the Natural Environment Research Council’s IAPETUS2 Doctoral Training Partnership award number NE/S007431/1. DR was supported by the Natural Environment Research Council award number NE/R016429/1 as part of the UK-SCAPE programme delivering National Capability.

References