Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

you can’t directly compare losses because they changed the data distribution for each phase ( I think. 100% guaranteed they change the data distribution after the 10 trillion token mark, that’s when they start adding in instruction following data, but I don’t know for sure if the other phase changes also include data distribution changes.)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: