Improving language model behavior by training on a curated dataset

OpenAI improved language model behavior by training on a carefully curated dataset. This approach helps reduce harmful and untruthful outputs. It matters because better training data leads to safer and more reliable AI models.

ArchiveMajor

Signal trust

Single sourceEarly signal

PublishedThursday, June 10, 2021 at 9:00 AMJun 10, 09:00 AM

FreshnessArchive

Story ID#759

Back to feed Original report

Original article excerpt

Server-side extracted preview paragraphs from the original source.

Our latest research finds we can improve language model behavior with respect to specific behavioral values by fine-tuning on a small, curated dataset.

We’ve found we can improve language model behavior with respect to specific behavioral values by fine-tuning on a curated dataset of <100 examples of those values. We also found that this process becomes more effective as models get larger. While the technique is still nascent, we’re looking for OpenAI API users who would like to try it out and are excited to find ways to use these and other techniques in production use cases.

Opening the briefing

Improving language model behavior by training on a curated dataset

Original article excerpt