Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A critic for the critic would be “Recursive Reward Modelling”, an exciting idea that has not been made to work in the real world yet.


Most of my ideas are not original but where can I learn more about this recursive reward modeling problem?





Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: