Accurate Machine Learning–Based Germination Detection, Prediction and Quality Assessment of Three Grain Crops

Description

We present a machine learning–based method, using modern convolutional neural networks with region proposals, for an automated and high-throughput assessment of seed germination experiments for various species. The purpose of this study is to reduce the time-consuming and labor-intensive human visual inspections of seed germination experiments and to develop an improved germination prediction method that is (1) independent of custom color-based thresholds and thus can be applied to multiple seed cultivars and illumination settings and (2) can be used to better explore the dynamics of seed germination.

Example Predictions on a Germination Experiment

Color-coding:

lime: Non-germinated Prediction
darkgreen: Non-germinated Ground Truth
pink: Germinated Prediction
purple: Germinated Ground Truth

Requirements

Python 3.6
Compatible Graphics Card with CUDA >9 and cuDNN installed
nvidia-docker2 Installation Guide

Installation via Docker

Download tensorflow-object-detection Image

git clone https://github.com/tensorflow/models.git
# From the root of the git repository
docker build -f research/object_detection/dockerfiles/tf1/Dockerfile -t od .
docker run -it od

Clone this Project

git clone https://github.com/grimmlab/GerminationPrediction

Download Version 3 of the data from http://dx.doi.org/10.17632/4wkt6thgp6.3 The Data consists of 3 zipped folders:

GermPredDataset: Images (.jpg) and the corresponding Bounding Box Annotations (.xml) in PASCAL VOC format for Germination Experiments of 824 Zea mays, 811 Secale cereale and 814 Pennisetum glaucum seeds. This data can be used to train new models from scratch.
GermPredRecords: In order to reproduce the results from this paper, we provide the dataset split into training, validation and hol-out test set. Aditionally, labelmaps and configuration files for the Tensorflow Object Detection API are also available.
GermPredModels: The training checkpoint files for the model architecture and hyperparameters with the highest mean Average Precision (mAP) on the validation set and their corresponding inference graph is also published for everyone without the need of re-training the models.

Extract and move the subfolders into the correspoding project folder

GermPredRecords: move to data/
GermPredModels: move to workspace/

Overall, the project folder structure should look like this after importing the data:

GerminationPrediction
├── data
│   ├── PennisetumGlaucum   # GermPredRecords
│   ├── SecaleCereale       # GermPredRecords
│   └── ZeaMays             # GermPredRecords
│       ├── configs         # GermPredRecords
│       ├── records         # GermPredRecords
│       └── test_images     # GitHub Project
├── gifs
├── pretrained_models       # Tensorflow Model Zoo
├── scripts
└── workspace             
    ├── PennisetumGlaucum   # GermPredModels
    ├── SecaleCereale       # GermPredModels
    └── ZeaMays             # GermPredModels

Train new Models

Download and Extract additional Data to PATH/TO/PROJECT/FOLDER/data
Download pretrained Models (on COCO dataset) for specific Architectures from Tensorflow Model Zoo and extract them to PATH/TO/PROJECT/FOLDER/pretrained_models

Following models were used in this project:

faster_rcnn_inception_v2_coco
faster_rcnn_resnet50_coco
faster_rcnn_resnet101_coco
faster_rcnn_inception_resnet_v2_atrous_coco

Run Docker Image with GPU Support

docker run -it --gpus all -p 0.0.0.0:6006:6006 -v PATH/TO/PROJECT/FOLDER:/home/GerminationPrediction od

-it: Interactive Docker Shell
--gpus: lets Docker use GPUs
-p: opens port 6006 which is used by tensorboard to visualize the training/validation process
-v: binds the project folder to be used by Docker.
od: name of the pulled Docker Image

Execute Training Script inside Docker Shell (indicated by tensorflow>)

cd /home/GerminationPrediction
python scripts/train_model.py -m NEWMODELNAME -c PATH/TO/CONFIG/FILE.config
# Example:
python scripts/train_model.py -m ZeaMays -c ./data/ZeaMays/configs/INCRES_ZM_5.config

-m: Name of the new model that will be trained, a new folder will be created in /home/GerminationPrediction/workspace
-c: Configuration File for Training

Test Models Accuracy on a Hold-Out Test Set

Execute Training Script inside Docker Shell (indicated by tensorflow>)

cd /home/GerminationPrediction
python scripts/predict_testset.py -m NEWMODELNAME -c PATH/TO/TEST/CONFIG/FILE.config
# Example:
python scripts/predict_testset.py -m ZeaMays -c ./data/ZeaMays/configs/INCRES_ZM_5_test.config

-m: Name of the new model that has been trained, a new folder will be created in /home/GerminationPrediction/workspace
-c: Configuration File for Testing

optional: Change the Checkpoint that is used for Testing Tensorflow saves checkpoints of the Training process in /home/GerminationPrediction/workspace/NEWMODELNAME/ckpt/. Change the variable model_checkpoint_path in the file called checkpoint to the checkpoint that needs to be tested.

Run Inference on the Trained Models

Export the Inference Graph from a Checkpoint (this step has already been done if GermPredModels is downloaded)

cd /home/GerminationPrediction
python scripts/export_inference_graph.py -m NEWMODELNAME -c PATH/TO/CONFIG/FILE.config -p checkpoint
# Example:
python scripts/export_inference_graph.py -m ZeaMays -c ./data/ZeaMays/configs/INCRES_ZM_5.config -p 9000

-m: Name of the new model that has been trained, a new folder will be created in /home/GerminationPrediction/workspace
-c: Configuration File
-p: Checkpoint Integer (/home/GerminationPrediction/workspace/NEWMODELNAME/ckpt/model.ckpt-XXXX)

Run Inference on Images (change paths accordingly in predict_image.py)

python scripts/predict_image.py

Run Inference on a new Germination Experiment

python scripts/predict_record.py -m NEWMODELNAME -i PATH/TOPETRIDISH/FILE.record
# Example:
python scripts/predict_record.py -m ZeaMays -i ./data/ZeaMays/records/petridishes/PD_zm1_11.record

-m: Name of the new model that has been trained, a new folder will be created in /home/GerminationPrediction/workspace
-i: .record file with all captures of a Germination Experiment (single petri dish)

FAQ

Permission Denied Error

This error occurs, because the user inside the docker image is called "tensorflow" and has an uid and gid of 1000, which might not be equal to the user who downloaded the Repository or Data. To solve issues regarding Permission Denied Errors, make sure the user with the uid/gid 1000 has Read and Write permissions to all folders and files contained in the Repository and Data. From inside the folder of the GerminationPrediction project run the following command:

chmod -R 777 ./

Resource exhausted: OOM when allocating tensor with shape

This error occurs, because the memory of the GPU is large not enough to hold the model and the data. The models were trained on a RTX2080 Ti with 12 GB of VRAM. If the available GPU has less VRAM available, change the batch_size parameter in the config file.

Contributors

This best-practice pipeline is developed and maintened by members of the Bioinformatics lab of Prof. Dr. Dominik Grimm:

M.Sc. Nikita Genze

Citation

When using this workflow, please cite our publication:

Accurate Machine Learning–Based Germination Detection, Prediction and Quality Assessment of Three Grain Crops

N Genze, R Bharti, M Grieb, SJ Schultheiss, DG Grimm
Plant Methods

@article{genze_accurate_2020, title = {Accurate machine learning-based germination detection, prediction and quality assessment of three grain crops}, volume = {16}, issn = {1746-4811}, url = {https://doi.org/10.1186/s13007-020-00699-x}, doi = {10.1186/s13007-020-00699-x}, abstract = {Assessment of seed germination is an essential task for seed researchers to measure the quality and performance of seeds. Usually, seed assessments are done manually, which is a cumbersome, time consuming and error-prone process. Classical image analyses methods are not well suited for large-scale germination experiments, because they often rely on manual adjustments of color-based thresholds. We here propose a machine learning approach using modern artificial neural networks with region proposals for accurate seed germination detection and high-throughput seed germination experiments.}, number = {1}, journal = {Plant Methods}, author = {Genze, Nikita and Bharti, Richa and Grieb, Michael and Schultheiss, Sebastian J. and Grimm, Dominik G.}, month = dec, year = {2020}, pages = {157}, }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Accurate Machine Learning–Based Germination Detection, Prediction and Quality Assessment of Three Grain Crops

Description

Example Predictions on a Germination Experiment

Requirements

Installation via Docker

Train new Models

Test Models Accuracy on a Hold-Out Test Set

Run Inference on the Trained Models

FAQ

Permission Denied Error

Resource exhausted: OOM when allocating tensor with shape

Contributors

Citation

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
data		data
gifs		gifs
pretrained_models		pretrained_models
scripts		scripts
workspace		workspace
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

License

grimmlab/GerminationPrediction

Folders and files

Latest commit

History

Repository files navigation

Accurate Machine Learning–Based Germination Detection, Prediction and Quality Assessment of Three Grain Crops

Description

Example Predictions on a Germination Experiment

Requirements

Installation via Docker

Train new Models

Test Models Accuracy on a Hold-Out Test Set

Run Inference on the Trained Models

FAQ

Permission Denied Error

Resource exhausted: OOM when allocating tensor with shape

Contributors

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages