Dataset, labels, and codes for coral condition classification paper: Deep learning for multi-label classification of coral conditions in the Indo-Pacific using underwater photo transect method
The Koh Tao Coral Condition Survey Project was initiated by Jun Sasaki’s laboratory and implemented in collaboration with the New Heaven Reef Conservation NPO, Koh Tao, Thailand.
Given the Koh Tao Island case context, this research aims to apply state-of-the-art deep learning models to devise a new multi-label classification model for coral reef conservation, mainly aiming at revolutionizing the current monitoring program by integrating underwater images and citizen science text data. Coral images were collected by divers holding an Olympus TG-6 with waterproof housing. Geographical parameters were collected as metadata and saved as tabular data of the supplementary materials.
The project covered most of the coral habitats surrounding Koh Tao, selected based on the weather conditions and characteristics of the habitats. The dataset consists of (1) the original, uncropped 3000*4000 pixels images; (2) cropped, 512x512 pixels image patches for the training and test of the model; (3) classification labels annotated by human experts; and (4) metadata of each field survey.
There are 23,965 image patches generated from nine times of field surveys. At each site, the sampling strategy is as follows: All the image patches are in JPG format. The images are organized based on the survey they were collected from. For example, the folder ‘20230804_CBK’ is named after the survey date_abbreviation of the site. The full name of each site can be found in the ‘surveys_metadata’ in the tabular_data folder.
The name of each image consists of five parts separated by underscores: the three-letter site code of the sampling site (e.g., ALK=Ao Leuk, CBK=Chalok Baan Kao Bay), the unique four-digit survey number, the two-digit transect number, the date of the survey formatted as YYYYMMDD, the four-digit image number, and the number of the image patch. Examples of the images are as follows: CBK_0001_11_20230805_0001_12, TTB_0002_00_20230815_0002_02
1.Coral images: (coral_images.zip) this file contains all the underwater coral images taken during the field survey. All the image patches are in 512x512 pixels.
2.Metadata: (surveys_metadata.csv) this file provides additional information about the images, and it is organized by the survey id, which refers to every single time of the field observation. The parameters contained in this table are as follows:
- surveyid: the unique four-digit code representing each observation. For instance, the first field observation was conducted on August 4th, 2023, at Chalok Baan Kao Bay; all the images from this observation will be named with survey id 0001.
- transectid: the two-digit code representing the transect lines. The first digit identifies if the transect is permanent (1-permanent, 0-temporal), and the second digit reflects if the transect is the 1-shallow line (2-4m depth) or 2-deep line (9-10m depth). Additionally, 00 stands for images collected randomly without setting up transects.
- survey_date: in YYYYMMDD form, reflecting the date the survey is conducted.
- site: the three-letter site identification code representing which diving site those data were collected from.
- folder_name: name of the folder where the images were saved.
- lat_start, lng_start: the latitude and longitude of the starting point of this survey.
- camera: camera used to capture the images.
- depth: average depth of this survey.
- temp: average temperature of this survey.
3.Labelset of the annotation: (labelsets.csv) this table contains the set of possible labels that can be assigned to the objects within the images. It includes four labels reflecting the coral conditions and four for stressors, which are 1 = Healthy coral, 2 = Compromised coral, 3 = Dead coral, 4 = Rubble, 5 = Competition, 6 = Disease, 7 = Predation, and 8 = Physical issues
4.Annotations: (annotations.csv) annotation file contains the image patch id and its corresponding classification label, which is used for model training and image classification. This annotation was labeled by human experts in marine ecology and coral conservation.
An ensemble learning-based model combining Swin-Transformer-Small, Swin-Transformer-Base, and EfficientNet-B7 was proposed to automatically classify coral images with multiple labels, following the classification standard used in coral reef conservation programs. Codes are available in this repository, and the overall framework of this proposed approach is illustrated as follows:
If this dataset contributes to your research, please consider citing our paper:
@article{shao2024coral,
author = {Shao, Xinlei and Chen, Hongruixuan and Magson, Kirsty and Wang, Jiaqi and Song, Jian and Chen, Jundong and Sasaki, Jun},
title = {Deep Learning for Multilabel Classification of Coral Reef Conditions in the Indo-Pacific Using Underwater Photo Transect Method},
journal = {Aquatic Conservation: Marine and Freshwater Ecosystems},
volume = {34},
number = {9},
pages = {e4241},
doi = {https://doi.org/10.1002/aqc.4241},
note = {e4241 AQC-24-0036.R1},
year = {2024}
}
For any questions, please contact us.