![cookie-icon](/assets/img/Privacy/cookie-color-icon.webp)
We Use Cookies!!!
We use cookies to ensure that we give you the best experience on our website. Read cookies policies.
The audio dataset includes call center conversations in Telecom, featuring native English speakers from India, with detailed metadata and accurate transcriptions.
Unscripted Call Center Conversations
30 Speech Hours
July 2023
60
Welcome to the English Language Call Center Speech Dataset for the Telecom domain. It is a specialized and comprehensive collection of voice data designed to enhance the development of call center speech recognition models specifically for the Telecom industry.
With high-quality call center audio recordings, detailed metadata, and accurate transcriptions, it empowers researchers and developers to enhance natural language processing, conversational AI, and generative voice AI algorithms in the Telecom domain. Moreover, it facilitates the creation of sophisticated voice assistants and voice bots tailored to the unique linguistic nuances found in the English language spoken in India.
Speech Data:
This training dataset comprises 30 hours of call center audio recordings covering various topics and scenarios related to the Telecom domain, to build robust and accurate customer service speech technology.
To curate realistic call center interactions, we collaborated with a diverse network of 60 expert native English speakers from different states/provinces of India. This collaborative effort ensures a balanced representation of Indian accents, dialects, and demographics, promoting inclusivity and reducing biases in the dataset.
Each audio recording captures the essence of unscripted and spontaneous conversations between call center agents and customers, with an average duration ranging from 5 to 15 minutes per call. The dataset includes both inbound and outbound calls, covering scenarios such as inquiries, promotional offers, complaints, technical support, and more. Additionally, the dataset contains call center conversations with both positive and negative outcomes, providing a diverse and realistic dataset.
The speech data is available in WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 kHz, ensuring high-quality audio for accurate analysis. The recording environment is generally quiet, without background noise and echo.
Metadata:
In addition to the audio recordings, our dataset provides comprehensive metadata for each participant. This includes the participant’s age, gender, country, state, and dialect. Additionally, it includes metadata like domain, topic, call type, outcome, bit depth, and sample rate for each conversation.
The metadata serves as a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of English language call center speech recognition models for the Telecom domain.
Transcription:
To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. The transcriptions capture speaker-wise transcription with time-coded segmentation along with non-speech labels and tags, covering both the agent and customer conversations.
These ready-to-use transcriptions accelerate the development of Telecom call center conversational AI and ASR models for the English language.
Updates and Customization:
We understand the importance of collecting data in various environments to build robust ASR models. Therefore, our call center voice dataset is regularly updated with new audio data captured in diverse real-world conditions.
If you require a custom training dataset with specific environmental conditions, we can accommodate your request. We can provide voice data with customized sample rates ranging from 8kHz to 48kHz, allowing you to fine-tune your models for different audio recording setups. Additionally, we can also customize the transcription following your specific guidelines and requirements, to further support your ASR development process.
License:
This Telecom call center audio dataset is created by FutureBeeAI and is available for commercial use!
Conclusion:
Whether you are training or fine-tuning speech recognition models, advancing NLP algorithms, or building state-of-the-art voice assistants to improve customer experiences in the Telecom sector, our dataset serves as a trusted resource to meet your goals
Channel 1 | Channel 2 | Format |
---|---|---|
Female(22) | Female(21) | wav, json |
LABEL | START | END | CHANNEL | TRANSCRIPT |
---|---|---|---|---|
Speech | 0.350 | 1.300 | Speaker 2 | Good morning. |
Speech | 1.899 | 3.625 | Speaker 2 | Thank you for calling telecom. |
Speech | 4.424 | 5.825 | Speaker 2 | My name is <PII>Sushmita</PII>. |
Speech | 6.349 | 8.125 | Speaker 2 | How can I assist you today? |
Noise | 7.650 | 8.099 | - | - |
Speech | 8.375 | 9.948 | Speaker 1 | Good morning <PII>Sushmita</PII>. |
Speech | 11.698 | 14.675 | Speaker 1 | I am having some issue with my phone service. |
Speech | 15.948 | 19.250 | Speaker 1 | I get experiencing drop calls and poor reception |
Speech | 19.870 | 22.370 | Speaker 1 | in certain area. Can you help me with that? |
Speech | 24.954 | 25.704 | Speaker 2 | Of course. |
Speech | 26.045 | 28.896 | Speaker 2 | I will be happy to assist you with that. |
Speech | 29.137 | 32.686 | Speaker 2 | Let's start by checking the coverage in your area. |
Speech | 33.628 | 35.160 | Speaker 2 | Could you please provide me |
Speech | 35.444 | 36.987 | Speaker 2 | with your zip code? |
Speech | 38.953 | 42.628 | Speaker 1 | Yeah sure. My zip code is <PII>one two three four five</PII> |
Speech | 44.070 | 44.771 | Speaker 2 | Thank you. |
Speech | 45.130 | 47.204 | Speaker 2 | Let me check the coverage map |
Speech | 47.478 | 48.527 | Speaker 2 | for your area. |
Speech | 49.024 | 50.024 | Speaker 2 | I see that |
Speech | 50.444 | 52.345 | Speaker 2 | there are a few cell towers |
Speech | 53.169 | 54.100 | Speaker 2 | near by you. |
Speech | 54.637 | 56.862 | Speaker 2 | So, you should have good coverage. |
Speech | 57.536 | 58.310 | Speaker 2 | However |
Speech | 58.728 | 59.402 | Speaker 2 | sometimes |
Speech | 59.728 | 61.703 | Speaker 2 | there can an environmental factors |
Speech | 62.103 | 63.878 | Speaker 2 | that affect the signal strength. |
Speech | 64.590 | 67.367 | Speaker 2 | Have you noticed is the signal issues |
Speech | 67.608 | 69.683 | Speaker 2 | occur indoors or outdoors? |
Speech | 72.400 | 77.552 | Speaker 1 | It seems to happen mostly in indoor specially in office building and at home. |
Speech | 78.652 | 79.277 | Speaker 2 | I see. |
Speech | 79.736 | 83.554 | Speaker 2 | Indoor coverage can be influence by factors like |
Speech | 83.947 | 85.289 | Speaker 2 | building materials, |
Speech | 85.671 | 86.645 | Speaker 2 | and obstruction. |
Speech | 87.587 | 90.138 | Speaker 2 | Let's troubleshoot the issue further. |
Speech | 90.558 | 91.009 | Speaker 2 | First |
Speech | 91.546 | 92.572 | Speaker 2 | I recommend |
Speech | 92.921 | 94.245 | Speaker 2 | try a simple step, |
Speech | 94.715 | 96.266 | Speaker 2 | restarting your phone. |
Speech | 97.459 | 98.286 | Speaker 2 | This can |
Speech | 98.632 | 101.639 | Speaker 2 | often resolve temporary network glitches. |
Speech | 102.355 | 106.057 | Speaker 2 | Can you please turn off your phone and turn it back on. |
Speech | 108.751 | 111.230 | Speaker 1 | Sure. I will do that right now. |
Speech | 112.105 | 114.605 | Speaker 1 | Okay. I have restarted my phone. |
Speech | 116.918 | 117.468 | Speaker 2 | Great. |
Speech | 117.861 | 118.611 | Speaker 2 | Let see |
Speech | 118.882 | 120.281 | Speaker 2 | if the issue persists. |
Speech | 120.900 | 121.724 | Speaker 2 | Can make |
Speech | 122.117 | 126.311 | Speaker 2 | Can you make a test call to friend or family member? |
Speech | 126.555 | 127.504 | Speaker 2 | And let me know |
Speech | 127.772 | 129.247 | Speaker 2 | if the call quality |
Speech | 129.467 | 130.442 | Speaker 2 | has improved. |
Speech | 132.633 | 133.556 | Speaker 1 | All right. |
Speech | 133.919 | 135.568 | Speaker 1 | I am making the call now. |
Speech | 136.456 | 144.181 | Speaker 1 | The call quality does seem a beat better than before. But I can still hear some static allocational drop in the call. |
Speech | 146.901 | 149.919 | Speaker 2 | Thank you for testing that. It's okay. |
Speech | 150.443 | 154.062 | Speaker 2 | It's good to know that the call quality has improved. |
Speech | 154.501 | 154.949 | Speaker 2 | But |
Speech | 155.281 | 158.031 | Speaker 2 | we will work on eliminating those |
Speech | 158.354 | 159.580 | Speaker 2 | remaining issues. |
Speech | 160.131 | 163.056 | Speaker 2 | To address this static and drop calls |
Speech | 163.506 | 164.431 | Speaker 2 | I recommend |
Speech | 164.717 | 167.044 | Speaker 2 | checking for any software update |
Speech | 167.372 | 169.270 | Speaker 2 | available for your phone. |
Speech | 170.258 | 174.286 | Speaker 2 | Keep your phone software up to date can often resolve |
Speech | 174.586 | 176.161 | Speaker 2 | compatibility issues |
Speech | 176.485 | 177.508 | Speaker 2 | with the network. |
Speech | 178.102 | 180.324 | Speaker 2 | Could you please go to your phone setting |
TIME | TRANSCRIPT |
---|---|
0.350 1.300 | Good morning. |
1.899 3.625 | Thank you for calling telecom. |
4.424 5.825 | My name is <PII>Sushmita</PII>. |
6.349 8.125 | How can I assist you today? |
7.650 8.099 | - |
8.375 9.948 | Good morning <PII>Sushmita</PII>. |
11.698 14.675 | I am having some issue with my phone service. |
15.948 19.250 | I get experiencing drop calls and poor reception |
19.870 22.370 | in certain area. Can you help me with that? |
24.954 25.704 | Of course. |
26.045 28.896 | I will be happy to assist you with that. |
29.137 32.686 | Let's start by checking the coverage in your area. |
33.628 35.160 | Could you please provide me |
35.444 36.987 | with your zip code? |
38.953 42.628 | Yeah sure. My zip code is <PII>one two three four five</PII> |
44.070 44.771 | Thank you. |
45.130 47.204 | Let me check the coverage map |
47.478 48.527 | for your area. |
49.024 50.024 | I see that |
50.444 52.345 | there are a few cell towers |
53.169 54.100 | near by you. |
54.637 56.862 | So, you should have good coverage. |
57.536 58.310 | However |
58.728 59.402 | sometimes |
59.728 61.703 | there can an environmental factors |
62.103 63.878 | that affect the signal strength. |
64.590 67.367 | Have you noticed is the signal issues |
67.608 69.683 | occur indoors or outdoors? |
72.400 77.552 | It seems to happen mostly in indoor specially in office building and at home. |
78.652 79.277 | I see. |
79.736 83.554 | Indoor coverage can be influence by factors like |
83.947 85.289 | building materials, |
85.671 86.645 | and obstruction. |
87.587 90.138 | Let's troubleshoot the issue further. |
90.558 91.009 | First |
91.546 92.572 | I recommend |
92.921 94.245 | try a simple step, |
94.715 96.266 | restarting your phone. |
97.459 98.286 | This can |
98.632 101.639 | often resolve temporary network glitches. |
102.355 106.057 | Can you please turn off your phone and turn it back on. |
108.751 111.230 | Sure. I will do that right now. |
112.105 114.605 | Okay. I have restarted my phone. |
116.918 117.468 | Great. |
117.861 118.611 | Let see |
118.882 120.281 | if the issue persists. |
120.900 121.724 | Can make |
122.117 126.311 | Can you make a test call to friend or family member? |
126.555 127.504 | And let me know |
127.772 129.247 | if the call quality |
129.467 130.442 | has improved. |
132.633 133.556 | All right. |
133.919 135.568 | I am making the call now. |
136.456 144.181 | The call quality does seem a beat better than before. But I can still hear some static allocational drop in the call. |
146.901 149.919 | Thank you for testing that. It's okay. |
150.443 154.062 | It's good to know that the call quality has improved. |
154.501 154.949 | But |
155.281 158.031 | we will work on eliminating those |
158.354 159.580 | remaining issues. |
160.131 163.056 | To address this static and drop calls |
163.506 164.431 | I recommend |
164.717 167.044 | checking for any software update |
167.372 169.270 | available for your phone. |
170.258 174.286 | Keep your phone software up to date can often resolve |
174.586 176.161 | compatibility issues |
176.485 177.508 | with the network. |
178.102 180.324 | Could you please go to your phone setting |
English
en-In
India
Chandigarh,...more
M:55, F:45
18-70
Silent, Noisy
16 bit
wav
8khz
Dual separate channel
5-15 minutes
Explore Audio Data, Metadata and Transcription to get more clarity and hands on experience of this dataset.
Download Free Dataset
Contact Us