Discussion about this post

User's avatar
Anatol Wegner, PhD's avatar

I believe at this stage one should be extremely cautious about company demos. The proof/counterexample was obtained by an undisclosed custom internal model that seems to have been specifically tailored/fine tuned to the problem by two future fields medalist caliber mathematicians Mark Sellke and Mehtaab Sawhney with the help of another CS prodigy Lijie Chen. They probably iteratively fine tuned the scaffolding, model, context/RAG and verifier and biased it towards promising strategies (all of which can then conveniently be hidden behind the undisclosed internal model) and then OpenAI claimed an autonomous result when the model after having a 125 page random walk (and that’s just a summary) was able to tumble over the finishing line under the watchful eyes of expert mathematicians who then picked up the output, checked it and turned it into an actual proof.

The fact that nowadays a group of brilliant mathematicians tweaking an AI into producing a proof is considered a much bigger breakthrough than them proving the thing themselves should in itself be enough to show how proficient these systems actually are at mathematics.

Tom's avatar
2dEdited

Well done, this is a fairly balanced take on the OpenAI result unlike the techno-hype surrounding it or the complete cynical dismissal of it seen in what I refer to as the “professional AI skeptic” community.

The technology is still flawed and there is significant room for improvement, but I am not sure we can conclude from that a priori that “mechanical intelligence” will remain inferior to “creative intelligence” or that only humans are capable of creative insights. Furthermore, many pronouncements about human exceptionalism should be met with as much as skepticism as the hype surrounding AI.

You said: “But deep understanding, creativity and theory development do not appear within reach of AI models.” I’d love to pick your brain on the following:

Is this a specific claim about the current paradigm of LLM-based AI models, or a more general claim about any possible AI architecture, and if it is the latter, what theoretical or physical barriers fundamentally preclude artificial systems from achieving these traits?

Is this a testable empirical hypothesis, or is it a non-falsifiable philosophical position, and if it is testable, what specific, measurable benchmarks or behavioral outputs would constitute sufficient evidence to convince you otherwise? Would progress on ARC AGI 3 make a difference?

Where exactly is the line between “complex pattern matching” and “deep understanding”, and is there a quantifiable mechanism that differentiates the two, or do you view understanding as a moving goalpost that shifts whenever AI masters a new cognitive domain?

What you are calling creative intelligence arose through evolution and the collective interaction of humans over time. Many in the AI skeptic community tend to believe that human intelligence is an essential, individualized “secret sauce” construct of a brain when, in reality, human intelligence is a product not only of a brain but also of interaction between humans and environmental conditions. For example, studies of feral children show that human intelligence is greatly curtailed and there are certain gaps that cannot be filled in with later training after this form of deprivation occurs. I don’t think anyone has offered a definitive proof for the inherent superiority of this “natural” process or has shown that it is the only way that creative intelligence could arise in the universe even if you may be correct about the deficiencies of the current AI paradigm.

1 more comment...

No posts

Ready for more?