Go back

Odia (India) Scripted Monologue Speech Dataset for Delivery & Logistics Domain

The audio dataset comprises scripted monologue speech data in the Delivery & Logistics domain, featuring native Odia speakers from India. It includes speech data, detailed metadata, and accurate transcriptions.

Total Volume

6000+ prompts

Last updated

July 2024

Number of participants

60+

Get this Speech Dataset

Delivery & Logistics domain scripted monologue speech dataset in Odia (India)

Request Custom Collection

About this Off-the-shelf Speech Dataset

Introduction

Welcome to the Odia Scripted Monologue Speech Dataset for the Delivery & Logistics Domain. This meticulously curated dataset is designed to advance the development of Odia language speech recognition models, particularly for the Delivery & Logistics industry.

Speech Data

This training dataset comprises over 6,000 high-quality scripted prompt recordings in Odia. These recordings cover various topics and scenarios relevant to the Delivery & Logistics domain, designed to build robust and accurate customer service speech technology.

•Participant Diversity:

•

Speakers: 60 native Odia speakers from different regions of India.

•

Regions: Ensures a balanced representation of Odia accents, dialects, and demographics.

•

Participant Profile: Participants range from 18 to 70 years old, representing both males and females in a 60:40 ratio, respectively.

•Recording Details:

•

Recording Nature: Audio recordings of scripted prompts/monologues.

•

Audio Duration: Average duration of 5 to 30 seconds per recording.

•

Formats: WAV format with mono channels, a bit depth of 16 bits, and sample rates of 8 kHz and 16 kHz.

•

Environment: Recordings are conducted in quiet settings without background noise and echo.

•

Topic Diversity: The dataset encompasses a wide array of topics and conversational scenarios to ensure comprehensive coverage of the Delivery & Logistics sector. Topics include:

•Customer Service Interactions

•Order Management

•Shipping and Delivery

•Product and Service Inquiries

•Returns and Refunds

•Technical Support

•General Information and Advice

•Regulatory and Compliance Queries

•Service Upgrades and Changes

•Domain Specific Statements

•

Other Elements: To enhance realism and utility, the scripted prompts incorporate various elements commonly encountered in Delivery & Logistics interactions:

•

Names: Region-specific names of males and females in various formats.

•

Addresses: Region-specific addresses in different spoken formats.

•

Dates & Times: Inclusion of date and time in various delivery and logistics contexts, such as delivery dates and pick-up times.

•

Order Numbers: Specific order numbers and tracking codes relevant to delivery and logistics operations.

•

Quantities & Weights: Various quantities and weights related to shipments, package contents, and logistical requirements.

•

Logistics Providers: Names of delivery companies, courier services, and logistics providers.

Each scripted prompt is crafted to reflect real-life scenarios encountered in the Delivery & Logistics domain, ensuring applicability in training robust natural language processing and speech recognition models.

Transcription Data

In addition to high-quality audio recordings, the dataset includes meticulously prepared text files with verbatim transcriptions of each audio file. These transcriptions are essential for training accurate and robust speech recognition models.

•

Content: Each text file contains the exact scripted prompt corresponding to its audio file, ensuring consistency.

•

Format: Transcriptions are provided in plain text (.TXT) format, with files named to match their associated audio files for easy reference.

•

Quality: All transcriptions are verified for accuracy and consistency by native Odia transcribers.

Metadata

The dataset provides comprehensive metadata for each audio recording and participant:

•

Participant Metadata: Unique identifier, age, gender, country, state, and dialect.

•

Other Metadata: Recording transcript, recording environment, device details, sample rate, bit depth, and file format.

This metadata is a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of Odia language speech recognition models.

Usage and Applications

This dataset is a versatile resource for various applications within speech recognition, natural language processing, and AI-driven conversational technologies.

•

Speech Recognition Model Training: High-quality audio recordings and precise transcriptions for training and fine-tuning Odia speech recognition models.

•

Voice Synthesis: The diverse and high-quality audio data can train generative AI models for creating synthetic voices.

•

Voice Assistants: Ideal for training voice assistants tailored to the Delivery & Logistics domain.

•

Chatbots: Transcription data can train conversational models, enabling chatbots to respond to customer queries effectively.

•

Entity Recognition: Sentences include names, dates, currencies, and other domain-specific entities for training NLP models for named entity recognition (NER) tasks.

•

Language Understanding : Improve language understanding applications like sentiment analysis and topic modeling within the Delivery & Logistics sector.