Using a Deep Convolutional Neural Network for Extracting Morphological Traits from Herbarium Images

Yue Zhu; Thibaut Durand; Eric Chenin; Marc Pignal; Patrick Gallinari; Régine Vignes-Lebbe

doi:10.3897/tdwgproceedings.1.20400

Proceedings of TDWG : Conference Abstract

Conference Abstract

Using a Deep Convolutional Neural Network for Extracting Morphological Traits from Herbarium Images

Yue Zhu^‡, Thibaut Durand^§, Eric Chenin^|, Marc Pignal^¶, Patrick Gallinari^§, Régine Vignes-Lebbe^#

‡ Ecole Polytechnique, Palaiseau, France

§ UPMC Paris 6 - Sorbonne Universités -- LIP6, Paris, France

| IRD, Paris, France

¶ Institut de Systématique, Évolution, Biodiversité ISYEB - UMR 7205 – CNRS, MNHN, UPMC, EPHE Muséum national d’Histoire naturelle, Sorbonne Universités, Paris, France

# Institut de Systématique, Évolution, Biodiversité ISYEB - UMR 7205 – CNRS, MNHN, UPMC, EPHE UPMC Univ. Paris 06, Sorbonne Universités, Paris, France

Corresponding author: Yue Zhu (yue.zhu@polytechnique.edu), Régine Vignes-Lebbe (regine.vignes_lebbe@upmc.fr)

Received: 17 Aug 2017 | Published: 18 Aug 2017

This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Citation: Zhu Y, Durand T, Chenin E, Pignal M, Gallinari P, Vignes-Lebbe R (2017) Using a Deep Convolutional Neural Network for Extracting Morphological Traits from Herbarium Images. Proceedings of TDWG 1: e20400. https://doi.org/10.3897/tdwgproceedings.1.20400

Abstract

Natural history collection data are now accessible through databases and web portals. However, ecological or morphological traits describing specimens are rarely recorded while gathering data. This lack limits queries and analyses. Manual tagging of millions of specimens will be a huge task even with the help of citizen science projects such as “les herbonautes” (http://lesherbonautes.mnhn.fr). On the other hand, deep learning methods that use convolutional neural networks (CNN) demonstrate their efficiency in various domains, such as computer vision (Krizhevsky et al. 2012, Azizpour et al. 2016), speech recognition (Abdel-Hamid et al. 2014) or face identification (Li et al. 2015, Freytag et al. 2016).

We aim to use deep learning to provide a visual representation of words used to describe plants (e.g. simple, or compound leaf), and to associate those words with specimens in the Paris herbarium. This will provide a semantic description of each of the 7 millions images of the fully digitized collection of the Paris herbarium in the Muséum National d'Histoire Naturelle (MNHN, Paris, France). In a proof of concept project, we have used a CNN - pre-trained on the image database ImageNet (http://www.image-net.org) - in order to identify 4 morphological traits of leaves, using 103,000 herbarium images from 11 different taxa: margin (entire / dentate), leaf attachment (opposite / alternate), leaf organization (simple / compound), plant (woody / non-woody) (see Fig. 1) Seventy percent of images are used to train the weights of the model (in a supervised learning process that uses a training set already tagged for the 4 characteristics), 10% are used as validation set to tune the hyper-parameters of the model and to avoid overfitting, and 20% are used as test set to evaluate the generalization performances of the final neural network. The first results are encouraging with over 80% success on the test set. In a second step, we test if the neural network is not overfitting the training examples, and can generalize to new taxa. If we restrict the training set to a small number of taxa (4 taxa containing 76% of images), the success rate on the 7 other taxa (unseen during training) decreases drastically. A good sampling of the taxonomic diversity of plants appears crucial to train the neural network. A second method visualizes the area of each image that was detected by the CNN as the most important for morphological character recognition (Durand et al. 2017). This method provides an explanatory view of the automatic recognition process. In this poster we describe methods and results on botanical images for the different taxa. We discuss perspectives on image tagging of the Paris herbarium, and how to combine the citizen science project in order to annotate images with CNN automatic image description.

Figure 1.

The deep learning process using the convolutionnal neural network ResNet.

Keywords

Deep learning, convolutionnal neural network, herbarium, image, morphology

Presenting author

Régine Vignes-Lebbe

Acknowledgements

Funding program

Grant title

Hosting institution

Ethics and security

Author contributions

Conflicts of interest

References

Abdel-Hamid O, Mohamed A, Jiang H, Deng L, Penn G, Yu D (2014)

Convolutional Neural Networks for Speech Recognition

IEEE/ACM Transactions on Audio, Speech, and Language Processing

(

1533

‑

1545

. https://doi.org/10.1109/taslp.2014.2339736

Azizpour H, Razavian AS, Sullivan J, Maki A, Carlsson S (2016)

Factors of Transferability for a Generic ConvNet Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence

(

1790

‑

1802

. https://doi.org/10.1109/tpami.2015.2500224

Durand T, Mordan T, Thome N, Cord M (2017)

WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation

29th IEEE Conference on Computer Vision and Pattern Recognition

Freytag A, Rodner E, Simon M, Loos A, Kühl HS, Denzler J (2016)

Chimpanzee faces in the wild: Log-Euclidean CNNs for predicting identities and attributes of primates

. In: Rosenhahn B, Bjoern A (Eds)

Pattern recognition 38th German conference, GCPR 2016

Springer International Publishing AG

Hannover, Germany

Krizhevsky A, Sutskever I, Hinton G (2012)

ImageNet Classification with Deep Convolutional Neural Networks.

NIPS 2012

Li H, Lin Z, Shen X, Brandt J, Hua G (2015)

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

2015

. https://doi.org/10.1109/cvpr.2015.7299170

Supplementary material

Endnotes