labelme-to-datasets-for-OCR-model-training

Convert 'labelme' project's output data to a dataset for OCR model training.

References

'labelme' project: https://github.com/wkentaro/labelme

Procedures

Read 'label' and 'bbox' from json file.
Crop the image using 'bbox' and save it.
Write a label for each cropped image to the label file.

Usage example:

python3 convert.py \
        --input_path ./input \
        --output_path ./output

Input data structure:

/input
├── image00001.png
├── image00001.json
├── image00002.png
├── image00002.json
└── ...

For the 'images.json' file structure, refer to the 'https://github.com/wkentaro/labelme'

Output data structure:

/output
└── /images
    #   [filename]_[idx].[ext]
    ├── image00001_00001.png
    ├── image00001_00002.png
    ├── image00002_00001.png
    ├── image00002_00002.png
    ├── ...
    └── labels.txt

Label file structure:

{filename}\t{label}\n

image00001_00001.png	abcd
image00001_00002.png	efgh
image00002_00001.png	ijkl
image00002_00002.png	mnop
...

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
convert.py		convert.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

labelme-to-datasets-for-OCR-model-training

References

Procedures

Usage example:

Input data structure:

Output data structure:

{filename}\t{label}\n

About

Languages

DaveLogs/labelme-to-datasets-for-OCR-model-training

Folders and files

Latest commit

History

Repository files navigation

labelme-to-datasets-for-OCR-model-training

References

Procedures

Usage example:

Input data structure:

Output data structure:

{filename}\t{label}\n

About

Resources

Stars

Watchers

Forks

Languages