High Quality Speech / Voice / Audio Datasets

Explore the collection of different types of high quality and diverse speech datasets including general conversations, call center conversations, wake words, voice commands, and scripted monologues across languages and industries. Leverage these ready to use audio datasets to train and fine tune your automatic speech recognition (ASR) and conversational AI models. These audio datasets includes high quality speech data, accurate transcription and detailed metadata.

Filter IconFilter Close

Filter

Clear

Apply

filter-mobile-icon

Arabic Speech Datasets

Explore ready-to-deploy audio datasets in Arabic language.

15+ Datasets

Bahasa Speech Datasets

Explore ready-to-deploy audio datasets in Bahasa language.

15+ Datasets

Bengali Speech Datasets

Explore ready-to-deploy audio datasets in Bengali language.

15+ Datasets

Bulgarian Speech Datasets

Explore ready-to-deploy audio datasets in Bulgarian language.

15+ Datasets

Danish Speech Datasets

Explore ready-to-deploy audio datasets in Danish language.

15+ Datasets

Dutch Speech Datasets

Explore ready-to-deploy audio datasets in Dutch language.

15+ Datasets

English Speech Datasets

Explore ready-to-deploy audio datasets in English language.

15+ Datasets

Finnish Speech Datasets

Explore ready-to-deploy audio datasets in Finnish language.

15+ Datasets

French Speech Datasets

Explore ready-to-deploy audio datasets in French language.

15+ Datasets

German Speech Datasets

Explore ready-to-deploy audio datasets in German language.

15+ Datasets

Gujarati Speech Datasets

Explore ready-to-deploy audio datasets in Gujarati language.

15+ Datasets

Hindi Speech Datasets

Explore ready-to-deploy audio datasets in Hindi language.

15+ Datasets

Italian Speech Datasets

Explore ready-to-deploy audio datasets in Italian language.

15+ Datasets

Japanese Speech Datasets

Explore ready-to-deploy audio datasets in Japanese language.

15+ Datasets

Kannada Speech Datasets

Explore ready-to-deploy audio datasets in Kannada language.

15+ Datasets

Korean Speech Datasets

Explore ready-to-deploy audio datasets in Korean language.

15+ Datasets

Malayalam Speech Datasets

Explore ready-to-deploy audio datasets in Malayalam language.

15+ Datasets

Mandarin Speech Datasets

Explore ready-to-deploy audio datasets in Mandarin language.

15+ Datasets

Marathi Speech Datasets

Explore ready-to-deploy audio datasets in Marathi language.

15+ Datasets

Norwegian Speech Datasets

Explore ready-to-deploy audio datasets in Norwegian language.

15+ Datasets

Odia Speech Datasets

Explore ready-to-deploy audio datasets in Odia language.

15+ Datasets

Polish Speech Datasets

Explore ready-to-deploy audio datasets in Polish language.

15+ Datasets

Portuguese Speech Datasets

Explore ready-to-deploy audio datasets in Portuguese language.

15+ Datasets

Punjabi Speech Datasets

Explore ready-to-deploy audio datasets in Punjabi language.

15+ Datasets

Russian Speech Datasets

Explore ready-to-deploy audio datasets in Russian language.

15+ Datasets

Spanish Speech Datasets

Explore ready-to-deploy audio datasets in Spanish language.

15+ Datasets

Swedish Speech Datasets

Explore ready-to-deploy audio datasets in Swedish language.

15+ Datasets

Filipino Speech Datasets

Explore ready-to-deploy audio datasets in Filipino language.

15+ Datasets

Tamil Speech Datasets

Explore ready-to-deploy audio datasets in Tamil language.

15+ Datasets

Telugu Speech Datasets

Explore ready-to-deploy audio datasets in Telugu language.

15+ Datasets

Turkish Speech Datasets

Explore ready-to-deploy audio datasets in Turkish language.

15+ Datasets

Ukrainian Speech Datasets

Explore ready-to-deploy audio datasets in Ukrainian language.

15+ Datasets

Urdu Speech Datasets

Explore ready-to-deploy audio datasets in Urdu language.

15+ Datasets

Train & Fine-tune Your ASR Models with High-quality Multilingual Datasets!

Collect custom dataset with crowd community