Follow

RT @OpenAI@twitter.com

We've used reinforcement learning from human feedback to train language models for summarization. The resulting models produce better summaries than 10x larger models trained only with supervised learning: openai.com/blog/learning-to-su

🐦🔗: twitter.com/OpenAI/status/1301

Sign in to participate in the conversation
Mastodon

Personal server of Lukáš Lánský