That Apple paper mainly demonstrated that "reasoning" LLMs - with no access to a... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		simonw 7 months ago \| parent \| context \| favorite \| on: Building a Personal AI Factory That Apple paper mainly demonstrated that "reasoning" LLMs - with no access to additional tools - can't solve problems that deliberately exceed their token context length. I don't think it has much relevance at all to a conversational about how good LLMs are at solving programming problems by running tools in a loop. I keep seeing this idea that LLMs can't handle problems that aren't in their training data and it's frustrating because anyone who has spent significant time working with these systems knows that it obviously isn't true.

pydry 6 months ago [–]

It demonstrated that there was a hard limit on the complexity of a puzzle that LLMs could solve no matter how many tokens they threw at it (using a form of puzzle construction that it ensured that the LLM couldn't just refer to its training data to solve it).

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact