Skip to content

Nexdata-AI/849-Hours-Mandarin-Interactive-Speech-Data-by-Mobile-Phone

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

849-Hours-Mandarin-Interactive-Speech-Data-by-Mobile-Phone

Description

Mandarin home interaction mobile phone language audio data (Far-field home collected audio data subset), with duration of 849 hours, recorded in the real home scene; content focuses on home instructions, functional assistants and wake-up words, specially designed for smart home, more close to data application scenes.

For more details, please refer to the link: https://www.nexdata.ai/datasets/981?source=Github

Format

48kHz, 16bit, uncompressed wav, mono channel

Recording environment

quiet indoor environment

Recording content (read speach)

common used sentences; home environment related commands; functional assistant; wake up words; numbers.

Speaker

998 people; 54% of which are female; about 800 utterances per speaker.

Device

Android mobile phone

Language

Mandarin

Accuracy rate

98%

Application scenarios

speech recognition, voiceprint recognition

Licensing Information

Commercial License