We Use Cookies!!!
We use cookies to ensure that we give you the best experience on our website. Read cookies policies.
The audio dataset includes call center conversations in Travel, featuring native English speakers from US, with detailed metadata and accurate transcriptions.
Unscripted Call Center Conversations
30 Speech Hours
July 2023
60
Welcome to the English Language Call Center Speech Dataset for the Travel domain. It is a specialized and comprehensive collection of voice data designed to enhance the development of call center speech recognition models specifically for the Travel industry.
With high-quality call center audio recordings, detailed metadata, and accurate transcriptions, it empowers researchers and developers to enhance natural language processing, conversational AI, and generative voice AI algorithms in the Travel domain. Moreover, it facilitates the creation of sophisticated voice assistants and voice bots tailored to the unique linguistic nuances found in the English language spoken in United States.
Speech Data:
This training dataset comprises 30 hours of call center audio recordings covering various topics and scenarios related to the Travel domain, to build robust and accurate customer service speech technology.
To curate realistic call center interactions, we collaborated with a diverse network of 60 expert native English speakers from different states/provinces of United States. This collaborative effort ensures a balanced representation of US accents, dialects, and demographics, promoting inclusivity and reducing biases in the dataset.
Each audio recording captures the essence of unscripted and spontaneous conversations between call center agents and customers, with an average duration ranging from 5 to 15 minutes per call. The dataset includes both inbound and outbound calls, covering scenarios such as inquiries, promotional offers, complaints, technical support, and more. Additionally, the dataset contains call center conversations with both positive and negative outcomes, providing a diverse and realistic dataset.
The speech data is available in WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 kHz, ensuring high-quality audio for accurate analysis. The recording environment is generally quiet, without background noise and echo.
Metadata:
In addition to the audio recordings, our dataset provides comprehensive metadata for each participant. This includes the participant’s age, gender, country, state, and dialect. Additionally, it includes metadata like domain, topic, call type, outcome, bit depth, and sample rate for each conversation.
The metadata serves as a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of English language call center speech recognition models for the Travel domain.
Transcription:
To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. The transcriptions capture speaker-wise transcription with time-coded segmentation along with non-speech labels and tags, covering both the agent and customer conversations.
These ready-to-use transcriptions accelerate the development of Travel call center conversational AI and ASR models for the English language.
Updates and Customization:
We understand the importance of collecting data in various environments to build robust ASR models. Therefore, our call center voice dataset is regularly updated with new audio data captured in diverse real-world conditions.
If you require a custom training dataset with specific environmental conditions, we can accommodate your request. We can provide voice data with customized sample rates ranging from 8kHz to 48kHz, allowing you to fine-tune your models for different audio recording setups. Additionally, we can also customize the transcription following your specific guidelines and requirements, to further support your ASR development process.
License:
This Travel call center audio dataset is created by FutureBeeAI and is available for commercial use!
Conclusion:
Whether you are training or fine-tuning speech recognition models, advancing NLP algorithms, or building state-of-the-art voice assistants to improve customer experiences in the Travel sector, our dataset serves as a trusted resource to meet your goals
Channel 1 | Channel 2 | Format |
---|---|---|
Male(29) | Female(24) | wav, json |
LABEL | START | END | CHANNEL | TRANSCRIPT |
---|---|---|---|---|
Noise | 0.066 | 0.313 | - | - |
Speech | 2.192 | 3.192 | Speaker 1 | Hello Futurebee. |
Speech | 4.137 | 5.227 | Speaker 2 | Hello Futurebee. |
Speech | 7.891 | 11.480 | Speaker 1 | Hi I am <PII>Michael</PII> With, Futurebee Travel. Whom I speaking with today? |
Speech | 12.775 | 14.214 | Speaker 2 | Hi this is <PII>Mercedes</PII>. |
Speech | 15.282 | 16.108 | Speaker 1 | Hi <PII>Mercedes</PII>. |
Speech | 16.510 | 18.318 | Speaker 1 | [filler]how can I help you today? |
Speech | 19.673 | 31.902 | Speaker 2 | [filler]I just wanted to reach out to your company because I was planning on going on a little vacation. [filler] I was thinking of going to Thailand but honestly I just don't know how |
Speech | 32.831 | 36.819 | Speaker 2 | will the time or even the know how to book it myself. |
Speech | 37.539 | 44.517 | Speaker 2 | So I wanted to find out some of your guys pricing and the services that you offer for [filler] as a travel agency. |
Noise | 45.505 | 45.673 | - | - |
Speech | 47.051 | 57.240 | Speaker 1 | Okay well you have called the right place [filler] and we would be ecstatic to help you. [filler] real quick. Do you know where in Chiang Mai you want to [filler] do you, do you know where in |
Speech | 57.981 | 59.234 | Speaker 1 | Thailand you want to visit? |
Speech | 61.317 | 66.998 | Speaker 2 | [filler]well I was thinking about maybe one of the like coastal towns but |
Noise | 67.034 | 67.287 | - | - |
Speech | 67.709 | 70.328 | Speaker 2 | I heard that the mountains are really |
Speech | 70.757 | 73.040 | Speaker 2 | under rated and thought that that might be |
Speech | 73.745 | 81.197 | Speaker 2 | might be a bit different thing than my usual like beach get away to, you know, I go to Mexico sometimes so I thought maybe switch it up a bit. |
Speech | 82.191 | 83.492 | Speaker 2 | So somehwere with the mountains. |
Speech | 84.239 | 84.769 | Speaker 1 | Okay. |
Speech | 85.364 | 94.034 | Speaker 1 | Yeah trying to, trying to branch out. [filler] the reason I actually I said Chiang Mai, as soon as you said Thailand, I thought of Chiang Mai. Chiang Mai is one of my favorite places |
Speech | 94.504 | 103.450 | Speaker 1 | in all of Thailand. Honestly in all of the world. [filler] so, if you are looking for the mountains, Thailand might be a really, Chiang Mai might be a really good place to go. |
Speech | 104.099 | 106.498 | Speaker 1 | [filler]do you want me to look up some information on that? |
Speech | 108.421 | 109.763 | Speaker 2 | Yeah sure go ahead. Very fine. |
Noise | 110.131 | 110.787 | - | - |
Speech | 111.558 | 121.625 | Speaker 1 | Okay. So while I am getting a few options for you, [filler] could you help me out just, what is your [filler] what is your address? I want to see what airports you close to. |
Noise | 123.269 | 123.673 | - | - |
Speech | 123.703 | 128.593 | Speaker 2 | [filler]well I am living in San Antonio, Texas right now [filler]. |
Speech | 128.967 | 133.563 | Speaker 2 | So I would probably say the San Antonio airport could be the closest. |
Speech | 133.895 | 136.752 | Speaker 2 | But I often use the Austin |
Speech | 137.131 | 138.377 | Speaker 2 | airport as well |
Speech | 139.063 | 139.830 | Speaker 2 | [filler] |
Speech | 140.895 | 142.300 | Speaker 2 | and if need be |
Speech | 142.770 | 147.179 | Speaker 2 | I could even travel to [filler] [noise] to Houston or to Dallas |
Noise | 144.093 | 144.342 | - | - |
Speech | 147.877 | 152.197 | Speaker 2 | if that really like made a, made a big impact on the price of flights or something like that. |
Speech | 150.545 | 150.973 | Speaker 1 | Okay |
Speech | 154.324 | 160.015 | Speaker 1 | Its very possible it could. So I am glad you are able to tell me that you [filler] have that as an availability.[filler] |
Speech | 160.497 | 163.149 | Speaker 1 | Do you have a time of year you would like to travel? |
Speech | 164.407 | 169.111 | Speaker 2 | Well I just, we just entered the new year right? So my [filler] |
Noise | 169.460 | 169.604 | - | - |
Speech | 170.087 | 173.533 | Speaker 2 | <initial>PTL</initial> for, for my job just renewed. |
Speech | 174.306 | 176.877 | Speaker 2 | [filler]I don't really want to use it |
Speech | 178.117 | 180.425 | Speaker 2 | [noise] I don't really want to use it all at the beginning. |
TIME | TRANSCRIPT |
---|---|
0.066 0.313 | - |
2.192 3.192 | Hello Futurebee. |
4.137 5.227 | Hello Futurebee. |
7.891 11.480 | Hi I am <PII>Michael</PII> With, Futurebee Travel. Whom I speaking with today? |
12.775 14.214 | Hi this is <PII>Mercedes</PII>. |
15.282 16.108 | Hi <PII>Mercedes</PII>. |
16.510 18.318 | [filler]how can I help you today? |
19.673 31.902 | [filler]I just wanted to reach out to your company because I was planning on going on a little vacation. [filler] I was thinking of going to Thailand but honestly I just don't know how |
32.831 36.819 | will the time or even the know how to book it myself. |
37.539 44.517 | So I wanted to find out some of your guys pricing and the services that you offer for [filler] as a travel agency. |
45.505 45.673 | - |
47.051 57.240 | Okay well you have called the right place [filler] and we would be ecstatic to help you. [filler] real quick. Do you know where in Chiang Mai you want to [filler] do you, do you know where in |
57.981 59.234 | Thailand you want to visit? |
61.317 66.998 | [filler]well I was thinking about maybe one of the like coastal towns but |
67.034 67.287 | - |
67.709 70.328 | I heard that the mountains are really |
70.757 73.040 | under rated and thought that that might be |
73.745 81.197 | might be a bit different thing than my usual like beach get away to, you know, I go to Mexico sometimes so I thought maybe switch it up a bit. |
82.191 83.492 | So somehwere with the mountains. |
84.239 84.769 | Okay. |
85.364 94.034 | Yeah trying to, trying to branch out. [filler] the reason I actually I said Chiang Mai, as soon as you said Thailand, I thought of Chiang Mai. Chiang Mai is one of my favorite places |
94.504 103.450 | in all of Thailand. Honestly in all of the world. [filler] so, if you are looking for the mountains, Thailand might be a really, Chiang Mai might be a really good place to go. |
104.099 106.498 | [filler]do you want me to look up some information on that? |
108.421 109.763 | Yeah sure go ahead. Very fine. |
110.131 110.787 | - |
111.558 121.625 | Okay. So while I am getting a few options for you, [filler] could you help me out just, what is your [filler] what is your address? I want to see what airports you close to. |
123.269 123.673 | - |
123.703 128.593 | [filler]well I am living in San Antonio, Texas right now [filler]. |
128.967 133.563 | So I would probably say the San Antonio airport could be the closest. |
133.895 136.752 | But I often use the Austin |
137.131 138.377 | airport as well |
139.063 139.830 | [filler] |
140.895 142.300 | and if need be |
142.770 147.179 | I could even travel to [filler] [noise] to Houston or to Dallas |
144.093 144.342 | - |
147.877 152.197 | if that really like made a, made a big impact on the price of flights or something like that. |
150.545 150.973 | Okay |
154.324 160.015 | Its very possible it could. So I am glad you are able to tell me that you [filler] have that as an availability.[filler] |
160.497 163.149 | Do you have a time of year you would like to travel? |
164.407 169.111 | Well I just, we just entered the new year right? So my [filler] |
169.460 169.604 | - |
170.087 173.533 | <initial>PTL</initial> for, for my job just renewed. |
174.306 176.877 | [filler]I don't really want to use it |
178.117 180.425 | [noise] I don't really want to use it all at the beginning. |
English
en-us
USA
Arizona,...more
M: 55, F: 45
18-70
Silent, Noisy
16 bit
wav
8khz
Dual separate channel
5-15 minutes
Explore Audio Data, Metadata and Transcription to get more clarity and hands on experience of this dataset.
Download Free Dataset
Contact Us