Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

isn't this just related to the question "how do you train a transformer"? you give it wrong examples, and use optimization algorithms to move away from that kind of completions


thats quite hard for the reasons i explained. might be solvable using q learning techniques, but those are not easy in the context of transformers iiuc




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: