156-Hours-Lip-Sync-Multimodal-Video-Data

Description

Voice and matching lip language video filmed with 250 people by multi-devices simultaneously, aligned precisely by pulse signal, with high accuracy. It can be used in multi-modal learning algorithms research in speech and image fields.

For more details, please refer to the link: https://www.nexdata.ai/datasets/996?source=Github

Format

Video: mp4 format, 1,280*720, Audio: wav format, 16HZ, 16bit mono

Recording Environment

Using quiet sunny room to stimulate daytime outdoor driving scenes,Signal to noise ratio 25~20dB

Recording Scenes

divide to big scenes and sub scenes by different intense of sunlight

Recording Content

Short signals and spoken sentences

Recording People

250 Chinese, balance for gender

Recording Device

Camera, HD microphone, Audio board

Recording angle

Recording videos of front face, single side face, looking up, looking down, side face looking down and side face looking up all 6 different angles, and proximal and distant audio at the same time

Language

Mandarin

Application scenario

Lip Language recognization

Accuracy

Accuracy of sentence should not below 95%

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
11-1_2.mp4		11-1_2.mp4
18-1_6.mp4		18-1_6.mp4
23-1_8.mp4		23-1_8.mp4
31-1_2.mp4		31-1_2.mp4
39-1_7.mp4		39-1_7.mp4
44-1_7.wav		44-1_7.wav
5-1_4.mp4		5-1_4.mp4
71-1_17.wav		71-1_17.wav
8-1_6.wav		8-1_6.wav
99-1_18.wav		99-1_18.wav
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

156-Hours-Lip-Sync-Multimodal-Video-Data

Description

Format

Recording Environment

Recording Scenes

Recording Content

Recording People

Recording Device

Recording angle

Language

Application scenario

Accuracy

Licensing Information

About

Releases

Packages

Contributors 2

Nexdata-AI/156-Hours-Lip-Sync-Multimodal-Video-Data

Folders and files

Latest commit

History

Repository files navigation

156-Hours-Lip-Sync-Multimodal-Video-Data

Description

Format

Recording Environment

Recording Scenes

Recording Content

Recording People

Recording Device

Recording angle

Language

Application scenario

Accuracy

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages