A diverse collection of high-definition human speaking videos.

Algerian Arabic Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Algerian Arabic speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Multi-Modal Datasets

Visual Speech Datasets

Visual Speech Model

AR/VR Apps

Lip Reading Models

Emotion Recognition

Generative AI

Saudi Arabian Arabic Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Saudi Arabian Arabic speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

American English Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native American English speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

German Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native German speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Indian English Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Indian English speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

French Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native French speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Gujarati Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Gujarati speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Korean Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Korean speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

British English Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native British English speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

European Portuguese Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native European Portuguese speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Hindi Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Hindi speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Japanese Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Japanese speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Spain Spanish Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Spain Spanish speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Tamil Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Tamil speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Filipino Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Filipino speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Telugu Visual Speech Dataset

This Multi-Modal dataset offers a rich collection of unscripted, high-definition videos featuring native Telugu speakers responding to open-ended questions. The videos capture a wide range of emotions and come with detailed metadata for comprehensive analysis.

Visual Speech Datasets

I want to explore

Multimodal Datasets!

Type

Filter

Languages