Where is this spelled out formally and proven logically?

skissane · 2025-09-04T00:22:13 1756945333

LLM backtracking is an active area of research, see e.g.

And I was wrong that nobody has implemented it, as these papers prove people have… it is just the results haven’t been sufficiently impressive to support the transition from the research lab to industrial use - or at least, not yet

measurablefunc · 2025-09-04T00:31:27 1756945887

> Empirical evaluations demonstrate that our proposal significantly enhances the reasoning capabilities of LLMs, achieving a performance gain of over 40% compared to the optimal-path supervised fine-tuning method.

afiori · 2025-09-04T02:54:44 1756954484

I would expect to see something like this soonish as around now we are seeing the end of training scaling and the beginning of inference scaling

foota · 2025-09-04T03:24:38 1756956278

This is a neat observation, training has been optimized to hell and inference is just beginning.