Skip to content

Nexdata-AI/101-People-4538-Images-Japanese-Handwriting-OCR-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

101-People-4538-Images-Japanese-Handwriting-OCR-Data

Description

101 People - 4,538 Images Japanese Handwriting OCR Data. The text carrier is A4 paper. The dataset content includes social livelihood, entertainment, tour, sport, movie, composition and other fields. For annotation, character-level rectangular bounding box annotation and text transcription were adopted. The dataset can be used for tasks such as Japanese handwriting OCR.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1087?source=Github

Data size

101 people, 4,538 images

Collecting environment

A4 paper

Device

scanner

Photographic angle

eye-level angle

Data format

the image data format is .jpg, the annotation file format is .json

Data content

including social livelihood, entertainment, tour, sport, movie, composition and other fields

Annotation content

character-level rectangular bounding box annotation and text transcription

Accuracy

the error bound of each vertex of rectangular bounding box is within 2 pixels, which is a qualified annotation, the accuracy of bounding boxes is not less than 98%; the characters transcription accuracy is not less than 98%

Licensing Information

Commercial License