Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The wikipedia page also has an article on large language models[0] that includes a section on emergent behaviour.

> While it is generally the case that performance of large models on various tasks can be extrapolated based on the performance of similar smaller models, sometimes large models undergo a "discontinuous phase shift" where the model suddenly acquires substantial abilities not seen in smaller models. These are known as "emergent abilities", and have been the subject of substantial study. Researchers note that such abilities "cannot be predicted simply by extrapolating the performance of smaller models".

[0]: https://en.wikipedia.org/wiki/Large_language_model



The reference/source to that Wikipedia paragraph is: https://openreview.net/forum?id=yzkSU5zdwD


The owner of the link is listed on that page, so I'm guessing it's no coincidence.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: