Go back

Open Ended Classification Prompt & Response Dataset in Portuguese

This open ended classification prompt and completion dataset consists of a wide range of open ended classification prompts and responses in the Portuguese language. Along with that, it includes detailed annotation for each dataset.

Total volume

3000+ Assets

Last Updated

Sep 2023

Number of participants

50+ people

Get this AI Dataset

Open Ended Classification Prompt & Completion Dataset in Portuguese

Request Custom Collection

About This OTS Dataset

What’s Included

Welcome to the Portuguese Open Ended Classification Prompt-Response Dataset—an extensive collection of 3000 meticulously curated prompt and response pairs. This dataset is a valuable resource for training Language Models (LMs) to classify input text accurately, a crucial aspect in advancing generative AI.

Dataset Content:

This open-ended classification dataset comprises a diverse set of prompts and responses where the prompt contains input text to be classified and may also contain task instruction, context, constraints, and restrictions while completion contains the best classification category as response. Both these prompts and completions are available in Portuguese language. As this is an open-ended dataset, there will be no options given to choose the right classification category as a part of the prompt.

These prompt and completion pairs cover a broad range of topics, including science, history, technology, geography, literature, current affairs, and more. Each prompt is accompanied by a response, providing valuable information and insights to enhance the language model training process. Both the prompt and response were manually curated by native Portuguese people, and references were taken from diverse sources like books, news articles, websites, and other reliable references.

This open-ended classification prompt and completion dataset contains different types of prompts, including instruction type, continuation type, and in-context learning (zero-shot, few-shot) type. The dataset also contains prompts and responses with different types of rich text, including tables, code, JSON, etc., with proper markdown.

Prompt Diversity:

To ensure diversity, this open-ended classification dataset includes prompts with varying complexity levels, ranging from easy to medium and hard. Additionally, prompts are diverse in terms of length from short to medium and long, creating a comprehensive variety. The classification dataset also contains prompts with constraints and persona restrictions, which makes it even more useful for LLM training.

Response Formats:

To accommodate diverse learning experiences, our dataset incorporates different types of responses depending on the prompt. These formats include single-word, short phrase, and single sentence type of response. These responses encompass text strings, numerical values, and date and time formats, enhancing the language model's ability to generate reliable, coherent, and contextually appropriate answers.

Data Format and Annotation Details:

This fully labeled Portuguese Open Ended Classification Prompt Completion Dataset is available in JSON and CSV formats. It includes annotation details such as a unique ID, prompt, prompt type, prompt length, prompt complexity, domain, response, response type, and rich text presence.

Quality and Accuracy:

Our dataset upholds the highest standards of quality and accuracy. Each prompt undergoes meticulous validation, and the corresponding responses are thoroughly verified. We prioritize inclusivity, ensuring that the dataset incorporates prompts and completions representing diverse perspectives and writing styles, maintaining an unbiased and discrimination-free stance.

The Portuguese version is grammatically accurate without any spelling or grammatical errors. No copyrighted, toxic, or harmful content is used during the construction of this dataset.

Continuous Updates and Customization:

The entire dataset was prepared with the assistance of human curators from the FutureBeeAI crowd community. Ongoing efforts are made to add more assets to this dataset, ensuring its growth and relevance. Additionally, FutureBeeAI offers the ability to gather custom open-ended classification prompt and completion data tailored to specific needs, providing flexibility and customization options.

License:

The dataset, created by FutureBeeAI, is now available for commercial use. Researchers, data scientists, and developers can leverage this fully labeled and ready-to-deploy Portuguese Open Ended Classification Prompt-Completion Dataset to enhance the classification abilities and accurate response generation capabilities of their generative AI models and explore new approaches to NLP tasks.

Use Cases

Language Model Training

Classification Model Training

Natural Language Understanding

Dataset Sample(s)

Samples will be available soon!

Dataset Details

Dataset type

Classification Prompt & Response Dataset

Volume

3000+

Media type

Text

Language

Portuguese

Domain

science, history, technology,...more

File Details

Format

JSON, CSV

Annotation

Yes

Schema Element

unique_id, ,...more

Read the License Terms

Browse FAQs

Similar to Open Ended Classification Prompt & Response Dataset

Open Ended Classification Prompt & Completion Dataset in Gujarati

Gujarati Open Ended Classification Dataset

Open ended classification prompt & response dataset in Gujarati Language.

3000+

Diverse Types

Language Model Training

Classification Model Training

Open Ended Classification Prompt & Completion Dataset in Korean

Korean Open Ended Classification Dataset

Open ended classification prompt & response dataset in Korean Language.

3000+

Diverse Types

Language Model Training

Classification Model Training

Open Ended Classification Prompt & Completion Dataset in Russian

Russian Open Ended Classification Dataset

Open ended classification prompt & response dataset in Russian Language.

3000+

Diverse Types

Language Model Training

Classification Model Training

Open Ended Classification Prompt & Completion Dataset in Chinese

Chinese Open Ended Classification Dataset

Open ended classification prompt & response dataset in Chinese Language.

3000+

Diverse Types

Language Model Training

Classification Model Training

View All

Need datasets for a specific AI/ML use case? Don’t worry, we’ve got you covered! 👍

Open Ended Classification Prompt & Response Dataset in Portuguese

Category

Total volume

Last Updated

Number of participants

Get this AI Dataset

Request Custom Collection

About This OTS Dataset

What’s Included

Use Cases

Language Model Training

Classification Model Training

Natural Language Understanding

Dataset Sample(s)

Samples will be available soon!

Dataset Details

Dataset type

Volume

Media type

Language

Domain

File Details

Format

Annotation

Schema Element

Similar to Open Ended Classification Prompt & Response Dataset

Gujarati Open Ended Classification Dataset

Korean Open Ended Classification Dataset

Russian Open Ended Classification Dataset

Chinese Open Ended Classification Dataset

More in Portuguese

Bengali Closed Ended Question Answer Dataset

Kannada Closed Ended Question Answer Dataset

Odia Open Ended Classification Dataset

Marathi Brainstorming Dataset

Need datasets for a specific AI/ML use case? Don’t worry, we’ve got you covered! 👍

Open Ended Classification Prompt & Response Dataset in Portuguese

Category

Total volume

Last Updated

Number of participants

Get this AI Dataset

Request Custom Collection

About This OTS Dataset

What’s Included

Use Cases

Language Model Training

Classification Model Training

Natural Language Understanding

Dataset Sample(s)

Samples will be available soon!

Dataset Details

Dataset type

Volume

Media type

Language

Domain

File Details

Format

Annotation

Schema Element

Similar to Open Ended Classification Prompt & Response Dataset

Gujarati Open Ended Classification Dataset

Korean Open Ended Classification Dataset

Russian Open Ended Classification Dataset

Chinese Open Ended Classification Dataset

More in Portuguese

Bengali Closed Ended Question Answer Dataset

Kannada Closed Ended Question Answer Dataset

Odia Open Ended Classification Dataset

Marathi Brainstorming Dataset

Need datasets for a specific AI/ML use case? Don’t worry, we’ve got you covered! 👍

We Use Cookies!!!