Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This 35B-A3B model is 4-5x cheaper than Haiku though, suggesting it would still be cheaper to outsource inference to the cloud vs running locally in your example


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: