English (US) Call Center Speech Dataset for Travel

The audio dataset includes call center conversations in Travel, featuring native English speakers from US, with detailed metadata and accurate transcriptions.

Category

Unscripted Call Center Conversations

Total Volume

30 Speech Hours

Last updated

July 2023

Number of participants

60

Get this Speech Dataset

Get Dataset Btn

About this Off-the-shelf Speech Dataset

About Gradiet Line

What’s Included

Welcome to the English Language Call Center Speech Dataset for the Travel domain. It is a specialized and comprehensive collection of voice data designed to enhance the development of call center speech recognition models specifically for the Travel industry.

With high-quality call center audio recordings, detailed metadata, and accurate transcriptions, it empowers researchers and developers to enhance natural language processing, conversational AI, and generative voice AI algorithms in the Travel domain. Moreover, it facilitates the creation of sophisticated voice assistants and voice bots tailored to the unique linguistic nuances found in the English language spoken in United States.

Speech Data:

This training dataset comprises 30 hours of call center audio recordings covering various topics and scenarios related to the Travel domain, to build robust and accurate customer service speech technology.

To curate realistic call center interactions, we collaborated with a diverse network of 60 expert native English speakers from different states/provinces of United States. This collaborative effort ensures a balanced representation of US accents, dialects, and demographics, promoting inclusivity and reducing biases in the dataset.

Each audio recording captures the essence of unscripted and spontaneous conversations between call center agents and customers, with an average duration ranging from 5 to 15 minutes per call. The dataset includes both inbound and outbound calls, covering scenarios such as inquiries, promotional offers, complaints, technical support, and more. Additionally, the dataset contains call center conversations with both positive and negative outcomes, providing a diverse and realistic dataset.

The speech data is available in WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 kHz, ensuring high-quality audio for accurate analysis. The recording environment is generally quiet, without background noise and echo.

Metadata:

In addition to the audio recordings, our dataset provides comprehensive metadata for each participant. This includes the participant’s age, gender, country, state, and dialect. Additionally, it includes metadata like domain, topic, call type, outcome, bit depth, and sample rate for each conversation.

The metadata serves as a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of English language call center speech recognition models for the Travel domain.

Transcription:

To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. The transcriptions capture speaker-wise transcription with time-coded segmentation along with non-speech labels and tags, covering both the agent and customer conversations.

These ready-to-use transcriptions accelerate the development of Travel call center conversational AI and ASR models for the English language.

Updates and Customization:

We understand the importance of collecting data in various environments to build robust ASR models. Therefore, our call center voice dataset is regularly updated with new audio data captured in diverse real-world conditions.

If you require a custom training dataset with specific environmental conditions, we can accommodate your request. We can provide voice data with customized sample rates ranging from 8kHz to 48kHz, allowing you to fine-tune your models for different audio recording setups. Additionally, we can also customize the transcription following your specific guidelines and requirements, to further support your ASR development process.

License:

This Travel call center audio dataset is created by FutureBeeAI and is available for commercial use!

Conclusion:

Whether you are training or fine-tuning speech recognition models, advancing NLP algorithms, or building state-of-the-art voice assistants to improve customer experiences in the Travel sector, our dataset serves as a trusted resource to meet your goals

Use Cases

Use of speech data for Automatic Speech Recognition

ASR

Use of speech data in Conversational AI

Conversational AI

Use of speech data for Chatbot & voicebot creation

Chatbot

Use of speech data in Language Modeling

Language Modelling

Use of speech data in Text-into-speech

TTS

Speech data usecase in Speech Analytics

Speech Analytics

Dataset Sample(s)

Sample Line

ATTRIBUTES

Channel 1Channel 2Format
Male(29)Female(24)wav, json

TRANSCRIPTION

LABELSTARTENDCHANNELTRANSCRIPT
Noise0.0660.313--
Speech2.1923.192Speaker 1Hello Futurebee.
Speech4.1375.227Speaker 2Hello Futurebee.
Speech7.89111.480Speaker 1Hi I am <PII>Michael</PII> With, Futurebee Travel. Whom I speaking with today?
Speech12.77514.214Speaker 2Hi this is <PII>Mercedes</PII>.
Speech15.28216.108Speaker 1Hi <PII>Mercedes</PII>.
Speech16.51018.318Speaker 1[filler]how can I help you today?
Speech19.67331.902Speaker 2[filler]I just wanted to reach out to your company because I was planning on going on a little vacation. [filler] I was thinking of going to Thailand but honestly I just don't know how
Speech32.83136.819Speaker 2will the time or even the know how to book it myself.
Speech37.53944.517Speaker 2So I wanted to find out some of your guys pricing and the services that you offer for [filler] as a travel agency.
Noise45.50545.673--
Speech47.05157.240Speaker 1Okay well you have called the right place [filler] and we would be ecstatic to help you. [filler] real quick. Do you know where in Chiang Mai you want to [filler] do you, do you know where in
Speech57.98159.234Speaker 1Thailand you want to visit?
Speech61.31766.998Speaker 2[filler]well I was thinking about maybe one of the like coastal towns but
Noise67.03467.287--
Speech67.70970.328Speaker 2I heard that the mountains are really
Speech70.75773.040Speaker 2under rated and thought that that might be
Speech73.74581.197Speaker 2might be a bit different thing than my usual like beach get away to, you know, I go to Mexico sometimes so I thought maybe switch it up a bit.
Speech82.19183.492Speaker 2So somehwere with the mountains.
Speech84.23984.769Speaker 1Okay.
Speech85.36494.034Speaker 1Yeah trying to, trying to branch out. [filler] the reason I actually I said Chiang Mai, as soon as you said Thailand, I thought of Chiang Mai. Chiang Mai is one of my favorite places
Speech94.504103.450Speaker 1in all of Thailand. Honestly in all of the world. [filler] so, if you are looking for the mountains, Thailand might be a really, Chiang Mai might be a really good place to go.
Speech104.099106.498Speaker 1[filler]do you want me to look up some information on that?
Speech108.421109.763Speaker 2Yeah sure go ahead. Very fine.
Noise110.131110.787--
Speech111.558121.625Speaker 1Okay. So while I am getting a few options for you, [filler] could you help me out just, what is your [filler] what is your address? I want to see what airports you close to.
Noise123.269123.673--
Speech123.703128.593Speaker 2[filler]well I am living in San Antonio, Texas right now [filler].
Speech128.967133.563Speaker 2So I would probably say the San Antonio airport could be the closest.
Speech133.895136.752Speaker 2But I often use the Austin
Speech137.131138.377Speaker 2airport as well
Speech139.063139.830Speaker 2[filler]
Speech140.895142.300Speaker 2and if need be
Speech142.770147.179Speaker 2I could even travel to [filler] [noise] to Houston or to Dallas
Noise144.093144.342--
Speech147.877152.197Speaker 2if that really like made a, made a big impact on the price of flights or something like that.
Speech150.545150.973Speaker 1Okay
Speech154.324160.015Speaker 1Its very possible it could. So I am glad you are able to tell me that you [filler] have that as an availability.[filler]
Speech160.497163.149Speaker 1Do you have a time of year you would like to travel?
Speech164.407169.111Speaker 2Well I just, we just entered the new year right? So my [filler]
Noise169.460169.604--
Speech170.087173.533Speaker 2<initial>PTL</initial> for, for my job just renewed.
Speech174.306176.877Speaker 2[filler]I don't really want to use it
Speech178.117180.425Speaker 2[noise] I don't really want to use it all at the beginning.

TRANSCRIPTION

TIMETRANSCRIPT
0.066
0.313
-
2.192
3.192
Hello Futurebee.
4.137
5.227
Hello Futurebee.
7.891
11.480
Hi I am <PII>Michael</PII> With, Futurebee Travel. Whom I speaking with today?
12.775
14.214
Hi this is <PII>Mercedes</PII>.
15.282
16.108
Hi <PII>Mercedes</PII>.
16.510
18.318
[filler]how can I help you today?
19.673
31.902
[filler]I just wanted to reach out to your company because I was planning on going on a little vacation. [filler] I was thinking of going to Thailand but honestly I just don't know how
32.831
36.819
will the time or even the know how to book it myself.
37.539
44.517
So I wanted to find out some of your guys pricing and the services that you offer for [filler] as a travel agency.
45.505
45.673
-
47.051
57.240
Okay well you have called the right place [filler] and we would be ecstatic to help you. [filler] real quick. Do you know where in Chiang Mai you want to [filler] do you, do you know where in
57.981
59.234
Thailand you want to visit?
61.317
66.998
[filler]well I was thinking about maybe one of the like coastal towns but
67.034
67.287
-
67.709
70.328
I heard that the mountains are really
70.757
73.040
under rated and thought that that might be
73.745
81.197
might be a bit different thing than my usual like beach get away to, you know, I go to Mexico sometimes so I thought maybe switch it up a bit.
82.191
83.492
So somehwere with the mountains.
84.239
84.769
Okay.
85.364
94.034
Yeah trying to, trying to branch out. [filler] the reason I actually I said Chiang Mai, as soon as you said Thailand, I thought of Chiang Mai. Chiang Mai is one of my favorite places
94.504
103.450
in all of Thailand. Honestly in all of the world. [filler] so, if you are looking for the mountains, Thailand might be a really, Chiang Mai might be a really good place to go.
104.099
106.498
[filler]do you want me to look up some information on that?
108.421
109.763
Yeah sure go ahead. Very fine.
110.131
110.787
-
111.558
121.625
Okay. So while I am getting a few options for you, [filler] could you help me out just, what is your [filler] what is your address? I want to see what airports you close to.
123.269
123.673
-
123.703
128.593
[filler]well I am living in San Antonio, Texas right now [filler].
128.967
133.563
So I would probably say the San Antonio airport could be the closest.
133.895
136.752
But I often use the Austin
137.131
138.377
airport as well
139.063
139.830
[filler]
140.895
142.300
and if need be
142.770
147.179
I could even travel to [filler] [noise] to Houston or to Dallas
144.093
144.342
-
147.877
152.197
if that really like made a, made a big impact on the price of flights or something like that.
150.545
150.973
Okay
154.324
160.015
Its very possible it could. So I am glad you are able to tell me that you [filler] have that as an availability.[filler]
160.497
163.149
Do you have a time of year you would like to travel?
164.407
169.111
Well I just, we just entered the new year right? So my [filler]
169.460
169.604
-
170.087
173.533
<initial>PTL</initial> for, for my job just renewed.
174.306
176.877
[filler]I don't really want to use it
178.117
180.425
[noise] I don't really want to use it all at the beginning.

Dataset Demographics

Details Headline

Language

English

Language code

en-us

Country

USA

Accents

Arizona,...more

Gender Distribution

M: 55, F: 45

Age Group

18-70

Audio File Details

Details Headline

Environment

Silent, Noisy

Bit Depth

16 bit

Format

wav

Sample rate

8khz

Channel

Dual separate channel

Audio file duration

5-15 minutes

Download Sample Speech Dataset Now!

Explore Audio Data, Metadata and Transcription to get more clarity and hands on experience of this dataset.

Download Free Dataset

Audio Download Btn
Audio Promp Bg
Audio Promp Bg

Start your AI/ML model creation journey with FutureBeeAI!

Contact Us

Audio Arrow BtnAudio Arrow Btn Black
Audio Promp 2 Bg