Biodiversity Information Science and Standards : Conference Abstract
|
Corresponding author: Tomer Gueta (tomer.gu@gmail.com)
Received: 05 Apr 2018 | Published: 22 May 2018
© 2018 Tomer Gueta, Vijay Barve, Thiloshon Nagarajah, Ashwin Agrawal, Yohay Carmel
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation: Gueta T, Barve V, Nagarajah T, Agrawal A, Carmel Y (2018) Introducing bdclean: a user friendly biodiversity data cleaning pipeline. Biodiversity Information Science and Standards 2: e25564. https://doi.org/10.3897/biss.2.25564
|
|
A new R package for biodiversity data cleaning, 'bdclean', was initiated in the Google Summer of Code (GSoC) 2017 and is available on github. Several R packages have great data validation and cleaning functions, but 'bdclean' provides features to manage a complete pipeline for biodiversity data cleaning; from data quality explorations, to cleaning procedures and reporting. Users are able go through the quality control process in a very structured, intuitive, and effective way. A modular approach to data cleaning functionality should make this package extensible for many biodiversity data cleaning needs. Under GSoC 2018, 'bdclean' will go through a comprehensive upgrade. New features will be highlighted in the demonstration.
Tomer Gueta
ISF Grant No. 127/16
Google Summer of Code program