Unknown NLP Lab
Toggle navigation
about
blog
publications
people
submenus
bookshelf
blog
ctrl k
agi
an archive of posts with this tag
Sep 23, 2024
Training Language Models to Self-Correct via Reinforcement Learning