RT @OpenAI@twitter.com
We've used reinforcement learning from human feedback to train language models for summarization. The resulting models produce better summaries than 10x larger models trained only with supervised learning: https://openai.com/blog/learning-to-summarize-with-human-feedback/