isn't this just related to the question "how do you train a transformer"? you gi... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		RugnirViking 8 months ago \| parent \| context \| favorite \| on: Why language models hallucinate isn't this just related to the question "how do you train a transformer"? you give it wrong examples, and use optimization algorithms to move away from that kind of completions

throwawaymaths 8 months ago [–]

thats quite hard for the reasons i explained. might be solvable using q learning techniques, but those are not easy in the context of transformers iiuc

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact