Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
nmca
on June 27, 2024
|
parent
|
context
|
favorite
| on:
CriticGPT: Finding GPT-4's mistakes with GPT-4
A critic for the critic would be “Recursive Reward Modelling”, an exciting idea that has not been made to work in the real world yet.
soloist11
on June 27, 2024
[–]
Most of my ideas are not original but where can I learn more about this recursive reward modeling problem?
nmca
on June 27, 2024
|
parent
[–]
https://arxiv.org/abs/1811.07871
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: