1535-Hours-Mixed-Speech-with-Chinese-and-English-Data-by-Mobile-Phone

Description

The data is recorded by 3972 Chinese native speakers with accents covering seven major dialect areas. The recorded text is a mixture of Chinese and English sentences, covering general scenes and human-computer interaction scenes. It is rich in content and accurate in transcription. It can be used for improving the recognition effect of the speech recognition system on Chinese-English mixed reading speech.

For more details, please refer to the link: https://www.nexdata.ai/datasets/939?source=Github

Format

16kHz, 16bit, uncompressed wav, mono channel

Recording environment

quiet indoor environment, without echo

Recording content (read speech)

general category; human-machine interaction category

Demographics

3,972 speakers totally, with 43% males and 57% females, and 68% speakers of all are in the age group of 12-25, 31% speakers of all in the age group of 26-45, 1% speakers of all are in the age group of 46-60

Device

Android mobile phone, iPhone;

Language

mandarin; English

Application scenarios

speech recognition; voiceprint recognition.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
G00001S1002.txt		G00001S1002.txt
G00001S1002.wav		G00001S1002.wav
G05003S1010.txt		G05003S1010.txt
G05003S1010.wav		G05003S1010.wav
G05003S2309.txt		G05003S2309.txt
G05003S2309.wav		G05003S2309.wav
G25216S1252.txt		G25216S1252.txt
G25216S1252.wav		G25216S1252.wav
G25216S2450.txt		G25216S2450.txt
G25216S2450.wav		G25216S2450.wav
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1535-Hours-Mixed-Speech-with-Chinese-and-English-Data-by-Mobile-Phone

Description

Format

Recording environment

Recording content (read speech)

Demographics

Device

Language

Application scenarios

Licensing Information

About

Releases

Packages

Contributors 2

Nexdata-AI/1535-Hours-Mixed-Speech-with-Chinese-and-English-Data-by-Mobile-Phone

Folders and files

Latest commit

History

Repository files navigation

1535-Hours-Mixed-Speech-with-Chinese-and-English-Data-by-Mobile-Phone

Description

Format

Recording environment

Recording content (read speech)

Demographics

Device

Language

Application scenarios

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages