English-Tamil Political Domain Parallel Corpora
The dataset consists of bilingual sentence-aligned corpora for the Political domain from English to Tamil and vice versa.
Category
Parallel Corpora
Volume
50K+ Corpus
Last Updated
June 2022
Number of participants
200+ people
Get this AI Dataset
Request Custom Collection
About This OTS Dataset
Introduction
Welcome to the English-Tamil Bilingual Parallel Corpora dataset for the Political domain! This meticulously curated dataset offers a rich collection of bilingual text data, translated between English and Tamil, providing a valuable resource for developing Political domain-specific language models and machine translation engines.
Dataset Content
Domain Specific Content
This Parallel Corpus is meticulously curated to capture the linguistic intricacies and domain-specific nuances inherent to the Political industry.
Format and Structure
Usage and Application
Secure and Ethical Collection
Update and Customization
To ensure the continued relevance and effectiveness of this Political Domain Parallel Corpora Dataset for robust language models and machine translation engines, we are committed to regular updates.
License
This Tamil-English Parallel Corpus dataset for the Political domain is created by FutureBeeAI and is available for commercial use.
Use Cases
MT Engine
Language model
Predictive keyboards
Spell check
Grammar correction
Text/speech systems
Dataset Sample(s)
SAMPLE
Source Language | Target Language |
---|---|
Bihar Chief Minister Nitish Kumar's confirmed: No more alliance with BJP forever | |
Today evening there is a meeting of ADMK MLAs in Chennai | |
Congress President Election tomorrow: 4 polling centers in Sathyamurthy Bhavan, Chennai | |
A.D.M.K. Golden Jubilee Anniversary: Respect to MGR, Jayalalitha Statues | |
Public Meetings of 51st ADMK's Annual Inaugural : Edappadi will deliver keynote speech at Namakkal on 20th | |
Congress President Election: Mallikarjuna Kharge resigns from Rajya Sabha post | |
Sudden visit to the headquarters: E.P.S. Emergency discussion with A.D.M.K. Administrators | |
Ghulam Nabi Azad's new party is called 'Democratic Freedom Party’ | |
3-day hiking from Chennai to Sriperumbudur from 25th to protect Constitution: K.S. Alagiri | |
DMK Nominations for internal party elections have started |
ATTRIBUTES
target_language | Tamil |
source_language | English |
domain | Political |
Dataset Details
Dataset type
Text Corpus Data
Volume
50K+ Sentences
Media type
Text
Language pair
English-Tamil
File Details
Type
Bilingual
Word count
7 to 12 words per asset
Format
XLSX, TMX, XML, XLIFF, XLS
Annotation
NA
Download data Sample
Download a free sample of this dataset to get more clarity about this set! OR get in touch with one of our expert to get hands on experience 📨
Download Free Dataset
Need datasets for a specific AI/ML use case? Don’t worry, we’ve got you covered! 👍
Contact Us