English-Gujarati Medical Domain Parallel Corpora
The dataset consists of bilingual sentence-aligned corpora for the Medical domain from English to Gujarati and vice versa.
Category
Parallel Corpora
Volume
50K+ Corpus
Last Updated
June 2022
Number of participants
200+ people
Get this AI Dataset
Request Custom Collection
About This OTS Dataset
Introduction
Welcome to the English-Gujarati Bilingual Parallel Corpora dataset for the Medical domain! This meticulously curated dataset offers a rich collection of bilingual text data, translated between English and Gujarati, providing a valuable resource for developing Medical domain-specific language models and machine translation engines.
Dataset Content
Domain Specific Content
This Parallel Corpus is meticulously curated to capture the linguistic intricacies and domain-specific nuances inherent to the Medical industry.
Format and Structure:
Usage and Application
Secure and Ethical Collection
Update and Customization
To ensure the continued relevance and effectiveness of this Medical Domain Parallel Corpora Dataset for robust language models and machine translation engines, we are committed to regular updates.
License
This Gujarati-English Parallel Corpus dataset for the Medical domain is created by FutureBeeAI and is available for commercial use.
Use Cases
MT Engine
Language model
Predictive keyboards
Spell check
Grammar correction
Text/speech systems
Dataset Sample(s)
SAMPLE
Source Language | Target Language |
---|---|
Smoking and drinking alcohol is injurious to health. | ધૂમ્રપાન અને દારૂ પીવું સ્વાસ્થ્ય માટે હાનિકારક છે. |
The organs of two brain dead patients were donated on the same day in Surat. | સુરતમાં એક જ દિવસે બે બ્રેનડેડ દર્દીના અંગોનું દાન કરવામાં આવ્યું. |
The patient underwent a heart transplant at a hospital 273 km in 90 minutes Far away from Ahmedabad . | 90 મિનિટમાં 273 કિ.મી. દૂર અમદાવાદની હોસ્પિટલમાં દર્દીનું હાર્ટ ટ્રાન્સપ્લાન્ટ કરાયું. |
Swine flu became more deadly than Corona. | કોરોના કરતાં પણ સ્વાઇન ફ્લૂ વધુ ઘાતક બન્યો. |
The highest number of swine flu cases were reported this year. | આ વર્ષે સ્વાઇન ફ્લૂના સૌથી વધુ કેસ નોધાયા. |
Gujarat ranks second in the highest number of deaths due to swine flu. | સ્વાઇન ફ્લૂથી સૌથી વધુ મૃત્યુમાં ગુજરાત બીજા સ્થાને. |
Gujarat reported 1315 cases of swine flu in a month out of which 34 died. | ગુજરાતમાં એક મહિનામાં સ્વાઇન ફ્લૂના ૧૩૧૫ કેસ, જેમાંથી ૩૪ નું મૃત્યુ થયું. |
Alzheimer's disease, which cripples the elderly even though the body is healthy. | શરીરે સ્વસ્થ હોવા છતાં વૃદ્ધોને પાંગળા બનાવી દેતી બીમારી, અલ્ઝાઈમર. |
The number of people suffering from Alzheimer's in India is around 3.5 million. | ભારતમાં અલ્ઝાઈમરથી ૫ીડાતા લોકોની સંખ્યા ૩૫ લાખ જેટલી છે. |
More than two and a half crore people in world suffer from the Sourceoblem of amnesia. | દૂનિયામાં અઢી કરોડથી પણ વધુ લોકો સ્મૃતિભ્રંશની સમસ્યા ભોગવે છે. |
ATTRIBUTES
target_language | Gujarati |
source_language | English |
domain | Medical |
Dataset Details
Dataset type
Text Corpus Data
Volume
50K+ Sentences
Media type
Text
Language pair
English-Gujarati
File Details
Type
Bilingual
Word count
7 to 12 words per asset
Format
XLSX, TMX, XML, XLIFF, XLS
Annotation
NA
Download data Sample
Download a free sample of this dataset to get more clarity about this set! OR get in touch with one of our expert to get hands on experience 📨
Download Free Dataset
Need datasets for a specific AI/ML use case? Don’t worry, we’ve got you covered! 👍
Contact Us