English (India) Call Center Speech Dataset for Travel

The audio dataset includes call center conversations in Travel, featuring native English speakers from India, with detailed metadata and accurate transcriptions.

Category

Unscripted Call Center Conversations

Total Volume

30 Speech Hours

Last updated

July 2023

Number of participants

60

Get this Speech Dataset

Get Dataset Btn

About this Off-the-shelf Speech Dataset

About Gradiet Line

What’s Included

Welcome to the English Language Call Center Speech Dataset for the Travel domain. It is a specialized and comprehensive collection of voice data designed to enhance the development of call center speech recognition models specifically for the Travel industry.

With high-quality call center audio recordings, detailed metadata, and accurate transcriptions, it empowers researchers and developers to enhance natural language processing, conversational AI, and generative voice AI algorithms in the Travel domain. Moreover, it facilitates the creation of sophisticated voice assistants and voice bots tailored to the unique linguistic nuances found in the English language spoken in India.

Speech Data:

This training dataset comprises 30 hours of call center audio recordings covering various topics and scenarios related to the Travel domain, to build robust and accurate customer service speech technology.

To curate realistic call center interactions, we collaborated with a diverse network of 60 expert native English speakers from different states/provinces of India. This collaborative effort ensures a balanced representation of Indian accents, dialects, and demographics, promoting inclusivity and reducing biases in the dataset.

Each audio recording captures the essence of unscripted and spontaneous conversations between call center agents and customers, with an average duration ranging from 5 to 15 minutes per call. The dataset includes both inbound and outbound calls, covering scenarios such as inquiries, promotional offers, complaints, technical support, and more. Additionally, the dataset contains call center conversations with both positive and negative outcomes, providing a diverse and realistic dataset.

The speech data is available in WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 kHz, ensuring high-quality audio for accurate analysis. The recording environment is generally quiet, without background noise and echo.

Metadata:

In addition to the audio recordings, our dataset provides comprehensive metadata for each participant. This includes the participant’s age, gender, country, state, and dialect. Additionally, it includes metadata like domain, topic, call type, outcome, bit depth, and sample rate for each conversation.

The metadata serves as a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of English language call center speech recognition models for the Travel domain.

Transcription:

To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. The transcriptions capture speaker-wise transcription with time-coded segmentation along with non-speech labels and tags, covering both the agent and customer conversations.

These ready-to-use transcriptions accelerate the development of Travel call center conversational AI and ASR models for the English language.

Updates and Customization:

We understand the importance of collecting data in various environments to build robust ASR models. Therefore, our call center voice dataset is regularly updated with new audio data captured in diverse real-world conditions.

If you require a custom training dataset with specific environmental conditions, we can accommodate your request. We can provide voice data with customized sample rates ranging from 8kHz to 48kHz, allowing you to fine-tune your models for different audio recording setups. Additionally, we can also customize the transcription following your specific guidelines and requirements, to further support your ASR development process.

License:

This Travel call center audio dataset is created by FutureBeeAI and is available for commercial use!

Conclusion:

Whether you are training or fine-tuning speech recognition models, advancing NLP algorithms, or building state-of-the-art voice assistants to improve customer experiences in the Travel sector, our dataset serves as a trusted resource to meet your goals

Use Cases

Use of speech data for Automatic Speech Recognition

ASR

Use of speech data in Conversational AI

Conversational AI

Use of speech data for Chatbot & voicebot creation

Chatbot

Use of speech data in Language Modeling

Language Modelling

Use of speech data in Text-into-speech

TTS

Speech data usecase in Speech Analytics

Speech Analytics

Dataset Sample(s)

Sample Line

ATTRIBUTES

Channel 1Channel 2Format
Female(31)Female(32)wav, json

TRANSCRIPTION

LABELSTARTENDCHANNELTRANSCRIPT
Speech1.3752.750Speaker 1Hello Futurebee.
Speech1.9753.575Speaker 2Hello Futurebee.
Noise4.1254.825--
Speech5.5256.349Speaker 1Hello ma'am.
Speech6.7249.698Speaker 2Hello good morning. Welcome to <initial>SJ</initial> Holidays.
Noise10.12110.570--
Speech11.87513.000Speaker 2How can I help you?
Speech13.55016.750Speaker 1Ma'am I planning for trip with my husband.
Speech18.72519.600Speaker 2Okay.
Speech19.16722.568Speaker 1So, can you give me some suggestions to plan the trip?
Speech22.71323.789Speaker 1can you guide me?
Speech25.73431.152Speaker 2Yeah, yes of course. But I need some information first. How many days trip you have plan?
Speech32.37134.298Speaker 1I am planning to go for three days.
Speech35.63939.963Speaker 2Three days okay. what type of place you would like to go?
Noise38.46138.961--
Speech40.31042.536Speaker 2[filler]Talking about climate.
Noise40.98142.082--
Speech42.78344.283Speaker 2[filler]Especially.
Speech45.19547.896Speaker 1I need a cool climate not very hot.
Speech49.00957.134Speaker 2Okay,[filler] then I suggest you go to Wayanad. That's a there is a beautiful place in Kerala. It's a chill climate.
Speech57.42658.426Speaker 2[filler]
Noise58.17059.045--
Speech58.64360.368Speaker 2You hear about that place?
Speech61.63966.614Speaker 1Yeah. I have heard and I heard it is near to Bangalore or Kerala. I don't know.
Speech68.08770.061Speaker 2[filler]Yeah. It's a Kerala only.
Speech70.92472.224Speaker 1Okay okay.
Speech74.14175.242Speaker 2[filler]
Speech74.31082.459Speaker 1So, can you say me how to go there? Will you arrange any travels or how your <initial>SJ</initial> Holidays working? how?
Speech79.96881.394Speaker 2[filler]
Speech83.95692.456Speaker 2Yeah, yes ma'am. [filler] We have three packages. [filler] Two days package and four package and one days package.
Speech88.77289.447Speaker 1Okay.
Speech92.68894.765Speaker 2[filler]Which one you want?
Speech93.62394.224Speaker 1[filler]
Speech96.76998.644Speaker 1I prefer four days package.
Speech99.090106.215Speaker 2Four days package. [filler] Then you don't worry. Our agency will help you. Okay? [filler] They will
Speech105.774106.447Speaker 1Okay.
Speech106.340110.864Speaker 2[filler]guide you and they will book the hotels
Speech111.028114.727Speaker 2and what are the tourist places are there in the Wayanad.
Speech114.944117.545Speaker 2They will explain to you.
Speech118.394123.870Speaker 1Okay. Now can you give me some explanation? How that trip will be? What are the sites seen there?
Speech119.977120.554Speaker 2Okay.
Speech125.519126.920Speaker 2[filler]Yeah. Of course.
Speech127.209127.986Speaker 2[filler]
Speech128.449139.377Speaker 2There [filler] more [filler] more site visiting places are there. One is zipline. It's a longest zipline [filler] place in Kerala only.
Speech140.389143.764Speaker 2And then one trekking, trekking place is there.
Speech140.734141.484Speaker 1Okay.
Speech144.133153.961Speaker 2[filler]Another one is water falls,[filler] main [filler] Soochipara water fall is there. It is really very chill place.
Speech145.080145.532Speaker 1Okay.
Noise149.116149.616--
Speech154.257161.979Speaker 2[filler]You, you in that place you should walk for nearly two to three kilometers.
Speech155.068155.794Speaker 1Okay.
Speech163.532167.556Speaker 1Okay. So, I have to walk two to three kilometers to reach the Soochipara falls.
Speech167.430169.430Speaker 2Reach there yes yes yes.
Speech169.770173.169Speaker 1So, there is no vehicles only the way mode is walking?
Speech172.389180.264Speaker 2No no no. Only the way you have you walk only. Because stones and path is very narrow.
Speech177.133177.610Speaker 1Okay.
Speech180.770184.371Speaker 2So, vehicles are not allowed in that place.
Speech181.627182.430Speaker 1Okay.

TRANSCRIPTION

TIMETRANSCRIPT
1.375
2.750
Hello Futurebee.
1.975
3.575
Hello Futurebee.
4.125
4.825
-
5.525
6.349
Hello ma'am.
6.724
9.698
Hello good morning. Welcome to <initial>SJ</initial> Holidays.
10.121
10.570
-
11.875
13.000
How can I help you?
13.550
16.750
Ma'am I planning for trip with my husband.
18.725
19.600
Okay.
19.167
22.568
So, can you give me some suggestions to plan the trip?
22.713
23.789
can you guide me?
25.734
31.152
Yeah, yes of course. But I need some information first. How many days trip you have plan?
32.371
34.298
I am planning to go for three days.
35.639
39.963
Three days okay. what type of place you would like to go?
38.461
38.961
-
40.310
42.536
[filler]Talking about climate.
40.981
42.082
-
42.783
44.283
[filler]Especially.
45.195
47.896
I need a cool climate not very hot.
49.009
57.134
Okay,[filler] then I suggest you go to Wayanad. That's a there is a beautiful place in Kerala. It's a chill climate.
57.426
58.426
[filler]
58.170
59.045
-
58.643
60.368
You hear about that place?
61.639
66.614
Yeah. I have heard and I heard it is near to Bangalore or Kerala. I don't know.
68.087
70.061
[filler]Yeah. It's a Kerala only.
70.924
72.224
Okay okay.
74.141
75.242
[filler]
74.310
82.459
So, can you say me how to go there? Will you arrange any travels or how your <initial>SJ</initial> Holidays working? how?
79.968
81.394
[filler]
83.956
92.456
Yeah, yes ma'am. [filler] We have three packages. [filler] Two days package and four package and one days package.
88.772
89.447
Okay.
92.688
94.765
[filler]Which one you want?
93.623
94.224
[filler]
96.769
98.644
I prefer four days package.
99.090
106.215
Four days package. [filler] Then you don't worry. Our agency will help you. Okay? [filler] They will
105.774
106.447
Okay.
106.340
110.864
[filler]guide you and they will book the hotels
111.028
114.727
and what are the tourist places are there in the Wayanad.
114.944
117.545
They will explain to you.
118.394
123.870
Okay. Now can you give me some explanation? How that trip will be? What are the sites seen there?
119.977
120.554
Okay.
125.519
126.920
[filler]Yeah. Of course.
127.209
127.986
[filler]
128.449
139.377
There [filler] more [filler] more site visiting places are there. One is zipline. It's a longest zipline [filler] place in Kerala only.
140.389
143.764
And then one trekking, trekking place is there.
140.734
141.484
Okay.
144.133
153.961
[filler]Another one is water falls,[filler] main [filler] Soochipara water fall is there. It is really very chill place.
145.080
145.532
Okay.
149.116
149.616
-
154.257
161.979
[filler]You, you in that place you should walk for nearly two to three kilometers.
155.068
155.794
Okay.
163.532
167.556
Okay. So, I have to walk two to three kilometers to reach the Soochipara falls.
167.430
169.430
Reach there yes yes yes.
169.770
173.169
So, there is no vehicles only the way mode is walking?
172.389
180.264
No no no. Only the way you have you walk only. Because stones and path is very narrow.
177.133
177.610
Okay.
180.770
184.371
So, vehicles are not allowed in that place.
181.627
182.430
Okay.

Dataset Demographics

Details Headline

Language

English

Language code

en-In

Country

India

Accents

Chandigarh,...more

Gender Distribution

M: 55, F: 45

Age Group

18-70

Audio File Details

Details Headline

Environment

Silent, Noisy

Bit Depth

16 bit

Format

wav

Sample rate

8khz

Channel

Dual separate channel

Audio file duration

5-15 minutes

Download Sample Speech Dataset Now!

Explore Audio Data, Metadata and Transcription to get more clarity and hands on experience of this dataset.

Download Free Dataset

Audio Download Btn
Audio Promp Bg
Audio Promp Bg

Start your AI/ML model creation journey with FutureBeeAI!

Contact Us

Audio Arrow BtnAudio Arrow Btn Black
Audio Promp 2 Bg