SCUT-CTW1500 Datasets download link
Please download the data from the website above and unzip the file. After unzipping the file, the data structure should be like:
ctw1500
├── ctw1500_train_labels
│ ├── 0001.xml
│ ├── 0002.xml
│ ├── ...
├── gt_ctw_1500
│ ├── 0001001.txt
│ ├── 0001002.txt
│ ├── ...
├── test_images
│ ├── 1001.jpg
│ ├── 1002.jpg
│ ├── ...
├── train_images
│ ├── 0001.jpg
│ ├── 0002.jpg
│ ├── ...
To prepare the data for text detection, you can run the following commands:
python tools/dataset_converters/convert.py \
--dataset_name ctw1500 --task det \
--image_dir path/to/ctw1500/train_images/ \
--label_dir path/to/ctw1500/ctw_1500_train_labels \
--output_path path/to/ctw1500/train_det_gt.txt
python tools/dataset_converters/convert.py \
--dataset_name ctw1500 --task det \
--image_dir path/to/ctw1500/test_images/ \
--label_dir path/to/ctw1500/gt_ctw_1500 \
--output_path path/to/ctw1500/test_det_gt.txt
Then you can have two annotation files train_det_gt.txt
and test_det_gt.txt
under the folder ctw1500/
.