Skip to content

‘SorghumWeedDataset_Segmentation’ is created to address real-time weed challenges precisely and encourage weed research using computer vision applications.

License

Notifications You must be signed in to change notification settings

JustinaMichael/SorghumWeedDataset_Segmentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SorghumWeedDataset_Segmentation

First appeared at https://data.mendeley.com/datasets/y9bmtf4xmr/1

Purpose of dataset creation

‘SorghumWeedDataset_Segmentation’ is created to address real-time weed challenges precisely and encourage weed research using computer vision applications.

About the dataset

‘SorghumWeedDataset_Segmentation’ is a crop-weed research dataset with 5555 manually pixel-wise annotated data segments from 252 data samples which can be used for object detection, instance segmentation, and semantic segmentation. The data segments consist of sorghum samplings (Class 0), Grasses (Class 1), and Broad-leaf weeds (Class 2) which are the three research objects focused during this data acquisition process. The TVT (Train: Validate: Test) ratio is set as 8:1:1 to split the data samples into training, validation, and testing. The ground truth preparation is carried out by manually annotating the data segments pixel-wise, using VIA (VGG Image Annotator) software. The respective annotation files for training, validation, and testing are provided in JSON, CSV, and COCO formats.

Equipment used for data acquisition

To record a rich set of information on the research objects, a state-of-the-art instrument - Canon EOS 80D – a Digital Single Lens Reflex (DSLR) camera with a sensor type of 22.3mm x 14.9 mm CMOS is used.

Data type, format, and size

Each data sample is an RGB image represented in JPEG format with 6000 × 4000 pixels making an average size of 13MB. This rich set of information from the data sample assisted in annotating data segments of plant length lesser than 0.5cm.

Data acquisition

This dataset emphasizes the early stages of crop growth to meet the challenges faced during the ‘Critical period of weed competition’. Data samples are captured from agriculture fields that follow both uniform crop spacing and random crop spacing.

Temporal coverage

Data is acquired during April and May 2023. To generalize the dataset, data is acquired in various light and weather conditions with varying distances.

Geographical coverage

Data is acquired from Sri Ramaswamy Memorial (SRM) Care Farm, Chengalpattu district, Tamil Nadu, India. To the best of our knowledge, ‘SorghumWeedDataset_Segmentation’ is the first open-access crop-weed research dataset from Indian fields for segmentation that deals with weed issues in uniform and random crop-spacing fields.

Expected outcome

The expected outcome of this dataset will be an Artificial Intelligence (AI) model that localizes and segments all the research objects present in a particular data sample.

Detailed description

The data article in the journal "Data in Brief" provides a thorough explanation of the dataset and the data acquisition procedure. It can be accessed at https://www.sciencedirect.com/science/article/pii/S2352340923009678

Citation

If you find this dataset helpful and use it in your work, kindly cite this dataset using “Michael, Justina; M, Thenmozhi (2023), “SorghumWeedDataset_Segmentation”, Mendeley Data, V1, doi: 10.17632/y9bmtf4xmr.1”

Contributors profile

  1. Justina Michael. J
    Google Scholar: https://scholar.google.com/citations?user=pEEzO14AAAAJ&hl=en&oi=ao
    ORCID: https://orcid.org/0000-0001-8072-3230
  2. Dr. M. Thenmozhi
    Google Scholar: https://scholar.google.com/citations?user=Es49w08AAAAJ&hl=en&oi=ao
    ORCID: https://orcid.org/0000-0002-8064-5938

About

‘SorghumWeedDataset_Segmentation’ is created to address real-time weed challenges precisely and encourage weed research using computer vision applications.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published