The Street View Text Dataset (SVT) Download Link
Please download the data from the website above and unzip the file. After unzipping the file, the data structure should be like:
svt1
├── img
│ ├── 00_00.jpg
│ ├── 00_01.jpg
│ ├── 00_02.jpg
│ ├── 00_03.jpg
│ ├── ...
├── test.xml
└── train.xml
To prepare the data for text recognition, you can run the following command:
python tools/dataset_converters/convert.py \
--dataset_name svt --task rec \
--image_dir path/to/svt1/ \
--label_dir path/to/svt1/train.xml \
--output_path path/to/svt1/rec_train_gt.txt
Then you can have a folder cropped_images/
and an annotation file rec_train_gt.txt
under the folder svt1/
.