We Use Cookies!!!
We use cookies to ensure that we give you the best experience on our website. Read cookies policies.
The audio dataset includes call center conversations in Telecom, featuring native English speakers from UK, with detailed metadata and accurate transcriptions.
Unscripted Call Center Conversations
30 Speech Hours
July 2023
60
Welcome to the English Language Call Center Speech Dataset for the Telecom domain. It is a specialized and comprehensive collection of voice data designed to enhance the development of call center speech recognition models specifically for the Telecom industry.
With high-quality call center audio recordings, detailed metadata, and accurate transcriptions, it empowers researchers and developers to enhance natural language processing, conversational AI, and generative voice AI algorithms in the Telecom domain. Moreover, it facilitates the creation of sophisticated voice assistants and voice bots tailored to the unique linguistic nuances found in the English language spoken in United Kingdom.
Speech Data:
This training dataset comprises 30 hours of call center audio recordings covering various topics and scenarios related to the Telecom domain, to build robust and accurate customer service speech technology.
To curate realistic call center interactions, we collaborated with a diverse network of 60 expert native English speakers from different states/provinces of United Kingdom. This collaborative effort ensures a balanced representation of British accents, dialects, and demographics, promoting inclusivity and reducing biases in the dataset.
Each audio recording captures the essence of unscripted and spontaneous conversations between call center agents and customers, with an average duration ranging from 5 to 15 minutes per call. The dataset includes both inbound and outbound calls, covering scenarios such as inquiries, promotional offers, complaints, technical support, and more. Additionally, the dataset contains call center conversations with both positive and negative outcomes, providing a diverse and realistic dataset.
The speech data is available in WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 kHz, ensuring high-quality audio for accurate analysis. The recording environment is generally quiet, without background noise and echo.
Metadata:
In addition to the audio recordings, our dataset provides comprehensive metadata for each participant. This includes the participant’s age, gender, country, state, and dialect. Additionally, it includes metadata like domain, topic, call type, outcome, bit depth, and sample rate for each conversation.
The metadata serves as a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of English language call center speech recognition models for the Telecom domain.
Transcription:
To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. The transcriptions capture speaker-wise transcription with time-coded segmentation along with non-speech labels and tags, covering both the agent and customer conversations.
These ready-to-use transcriptions accelerate the development of Telecom call center conversational AI and ASR models for the English language.
Updates and Customization:
We understand the importance of collecting data in various environments to build robust ASR models. Therefore, our call center voice dataset is regularly updated with new audio data captured in diverse real-world conditions.
If you require a custom training dataset with specific environmental conditions, we can accommodate your request. We can provide voice data with customized sample rates ranging from 8kHz to 48kHz, allowing you to fine-tune your models for different audio recording setups. Additionally, we can also customize the transcription following your specific guidelines and requirements, to further support your ASR development process.
License:
This Telecom call center audio dataset is created by FutureBeeAI and is available for commercial use!
Conclusion:
Whether you are training or fine-tuning speech recognition models, advancing NLP algorithms, or building state-of-the-art voice assistants to improve customer experiences in the Telecom sector, our dataset serves as a trusted resource to meet your goals
Channel 1 | Channel 2 | Format |
---|---|---|
Male(23) | Male(22) | wav, json |
LABEL | START | END | CHANNEL | TRANSCRIPT |
---|---|---|---|---|
Noise | 0.619 | 0.828 | - | - |
Speech | 1.172 | 3.069 | Speaker 1 | Hello is this Virgin Media? |
Noise | 3.374 | 3.745 | - | - |
Speech | 3.868 | 8.257 | Speaker 2 | Hello? Yeah, this is <PII>Stephen</PII> here from Virgin Media Broadband. How can I help you today? |
Speech | 8.147 | 10.198 | Speaker 1 | Hey <PII>Stephen</PII> how are you doing? |
Speech | 11.005 | 12.443 | Speaker 2 | I'm good, [noise] and yourself? |
Speech | 12.769 | 23.618 | Speaker 1 | Great I just (()) from Brazil and I am looking for new telecom agency [noise] [filler] you have to nothing to great. It is want to get something set up my phone |
Speech | 24.078 | 27.608 | Speaker 1 | just to I can you know have a few minutes and data etc. |
Speech | 28.376 | 41.097 | Speaker 2 | Of course, yeah. [filler] So what [filler] are you with any British [filler] [noise] Telecom provider as it stands, or you're absolutely unprovided by a British Telecom service at the moment |
Speech | 40.208 | 43.170 | Speaker 1 | No, I arrived I arrived yesterday so not the |
Speech | 42.347 | 43.597 | Speaker 2 | [noise] you arrived yesterday |
Speech | 44.429 | 58.798 | Speaker 2 | Okay okay. [noise] And would you want just a simple [filler] phone [filler] sort of telecom package or would you like a broad band and phone so for your home you know WiFi as well at the home where you stay |
Noise | 46.972 | 48.472 | - | |
Speech | 57.484 | 66.799 | Speaker 1 | [noise] Okay so,(()) you know in the <initial>US</initial> in the Brazil sorry I think so is the different. How does work you in the <initial>UK</initial> |
Speech | 64.995 | 65.254 | Speaker 2 | Yeah. |
Speech | 67.158 | 74.117 | Speaker 2 | Of course, Yeah. So I'm assuming you're staying now at a resident or at a home now in the <initial>UK</initial> Will that be a correct assumption? |
Speech | 73.200 | 78.352 | Speaker 1 | Yeah, yeah. I just got into my resident (()) nothing yet but slowly and surely, |
Speech | 77.534 | 77.861 | Speaker 2 | Yep, |
Speech | 78.322 | 81.552 | Speaker 2 | slowly and surely of course Yeah. So here in the <initial>UK</initial> |
Speech | 82.750 | 96.054 | Speaker 2 | [filler] the telecom providers provide broad band so internet for domestic purposes, so for the house phone services. So if you require a landline and you want to have a domestic phone, that's also provided by us here and well |
Speech | 96.215 | 110.516 | Speaker 2 | us in Virgin Media, but also telecom providers across the <initial>UK</initial> [noise] [filler] <initial>TV</initial> So if you want a <initial>TV</initial> license, that can also be acquired via us. [noise] And then we also provide [filler] fiber broadband, which is a much quicker and |
Speech | 110.530 | 113.745 | Speaker 2 | sort of broadband connection for your home. And of course, |
Speech | 114.480 | 127.296 | Speaker 2 | four <initial>G</initial> [noise] services or five <initial>G</initial> services, as we now have the newer version for your cellular phone, which you can use anywhere, even at home if you want to. But obviously, assuming you have a broadband package at your residence, |
Noise | 127.292 | 127.496 | - | - |
Speech | 127.549 | 132.408 | Speaker 2 | you would want to use that and not use up your four <initial>G</initial> [filler] allocation. [noise] |
Speech | 132.779 | 133.308 | Speaker 2 | [filler] |
Speech | 133.466 | 135.062 | Speaker 1 | [noise] Okay, sounds good |
Speech | 133.901 | 135.893 | Speaker 2 | [noise] with oh sorry, Yeah, yeah. |
Speech | 136.264 | 137.032 | Speaker 1 | (()) sounds good. |
Speech | 137.895 | 151.848 | Speaker 2 | [filler] within the broadband packages, we have a lot to offer. So we start as low as [filler] <initial>M</initial> [filler] one twenty five fiber broadband package. So that's around one thirty two megabytes [filler] per second. [filler] I know, |
Speech | 151.961 | 166.681 | Speaker 2 | [noise] it's called one twenty five, it offers one thirty two.[laugh] That's how it works. We then have the <initial>M</initial> two fifty fiber broadband, that's doubling that's two, six, four megabits per second of the internet connection. That's download speed, not upload speed, |
Speech | 154.473 | 154.871 | Speaker 1 | [laugh] |
Speech | 166.697 | 181.562 | Speaker 2 | [noise] by the way. [noise] And then we've got the <initial>M</initial> three fifty fiber broadband, which is three hundred and sixty two megabits per second, and it allows you know people to stream in four <initial>K</initial> [filler] multiple users at once. And then you've got the <initial>M</initial> five hundred, |
Speech | 181.627 | 182.401 | Speaker 2 | which[filler] |
TIME | TRANSCRIPT |
---|---|
0.619 0.828 | - |
1.172 3.069 | Hello is this Virgin Media? |
3.374 3.745 | - |
3.868 8.257 | Hello? Yeah, this is <PII>Stephen</PII> here from Virgin Media Broadband. How can I help you today? |
8.147 10.198 | Hey <PII>Stephen</PII> how are you doing? |
11.005 12.443 | I'm good, [noise] and yourself? |
12.769 23.618 | Great I just (()) from Brazil and I am looking for new telecom agency [noise] [filler] you have to nothing to great. It is want to get something set up my phone |
24.078 27.608 | just to I can you know have a few minutes and data etc. |
28.376 41.097 | Of course, yeah. [filler] So what [filler] are you with any British [filler] [noise] Telecom provider as it stands, or you're absolutely unprovided by a British Telecom service at the moment |
40.208 43.170 | No, I arrived I arrived yesterday so not the |
42.347 43.597 | [noise] you arrived yesterday |
44.429 58.798 | Okay okay. [noise] And would you want just a simple [filler] phone [filler] sort of telecom package or would you like a broad band and phone so for your home you know WiFi as well at the home where you stay |
46.972 48.472 | |
57.484 66.799 | [noise] Okay so,(()) you know in the <initial>US</initial> in the Brazil sorry I think so is the different. How does work you in the <initial>UK</initial> |
64.995 65.254 | Yeah. |
67.158 74.117 | Of course, Yeah. So I'm assuming you're staying now at a resident or at a home now in the <initial>UK</initial> Will that be a correct assumption? |
73.200 78.352 | Yeah, yeah. I just got into my resident (()) nothing yet but slowly and surely, |
77.534 77.861 | Yep, |
78.322 81.552 | slowly and surely of course Yeah. So here in the <initial>UK</initial> |
82.750 96.054 | [filler] the telecom providers provide broad band so internet for domestic purposes, so for the house phone services. So if you require a landline and you want to have a domestic phone, that's also provided by us here and well |
96.215 110.516 | us in Virgin Media, but also telecom providers across the <initial>UK</initial> [noise] [filler] <initial>TV</initial> So if you want a <initial>TV</initial> license, that can also be acquired via us. [noise] And then we also provide [filler] fiber broadband, which is a much quicker and |
110.530 113.745 | sort of broadband connection for your home. And of course, |
114.480 127.296 | four <initial>G</initial> [noise] services or five <initial>G</initial> services, as we now have the newer version for your cellular phone, which you can use anywhere, even at home if you want to. But obviously, assuming you have a broadband package at your residence, |
127.292 127.496 | - |
127.549 132.408 | you would want to use that and not use up your four <initial>G</initial> [filler] allocation. [noise] |
132.779 133.308 | [filler] |
133.466 135.062 | [noise] Okay, sounds good |
133.901 135.893 | [noise] with oh sorry, Yeah, yeah. |
136.264 137.032 | (()) sounds good. |
137.895 151.848 | [filler] within the broadband packages, we have a lot to offer. So we start as low as [filler] <initial>M</initial> [filler] one twenty five fiber broadband package. So that's around one thirty two megabytes [filler] per second. [filler] I know, |
151.961 166.681 | [noise] it's called one twenty five, it offers one thirty two.[laugh] That's how it works. We then have the <initial>M</initial> two fifty fiber broadband, that's doubling that's two, six, four megabits per second of the internet connection. That's download speed, not upload speed, |
154.473 154.871 | [laugh] |
166.697 181.562 | [noise] by the way. [noise] And then we've got the <initial>M</initial> three fifty fiber broadband, which is three hundred and sixty two megabits per second, and it allows you know people to stream in four <initial>K</initial> [filler] multiple users at once. And then you've got the <initial>M</initial> five hundred, |
181.627 182.401 | which[filler] |
English
en-gb
UK
English - East and C,...more
M:55, F:45
18-70
Silent, Noisy
16 bit
wav
8khz
Dual separate channel
5-15 minutes
Explore Audio Data, Metadata and Transcription to get more clarity and hands on experience of this dataset.
Download Free Dataset
Contact Us