The paper Scalable agent alignment via reward modeling: a research direction is available here: 1. 2. Pick up cool perks on our Patreon page: We would like to thank our generous Patreon supporters who make Two Minute Papers possible: 313V, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Anthony Vdovitchenko, Brian Gilman, Christi