Rich Sutton on AI creativity and discovery

Posted by yimby 6 hours ago

https://www.youtube.com/watch?v=K5LAFEjTlBA

Comments

Comment by musebox35 1 hour ago

The most successful applications like coding are not the result of pure LLM/generative modeling. They come from closing the loop with an agentic harness. The generate-test-selectively refine loop is the core modality of scientific work. An LLM + RL with Verifiable Rewards + feedback from compiler/terminal runs mimics this process to a great extend.

This is Fisher/Box feedback loop (https://www-sop.inria.fr/members/Ian.Jermyn/philosophy/writi...) implemented on a modern computational system. LLM is just a component. I wish Sutton had commented on this fuller picture of what we have now instead of commenting just on the LLM/Backprop side of things. I am honestly curious of whether such a loop can at least partially automate discovery.

There are more elements to discovery though. It is still not clear where the initial working model/hypothesis comes from or how the updates are selected (unless it is just parameter induction). I recently read about Hanson's Patterns of Discovery which aims in that direction. I have still not read it, but I am curious if it has any mechanistic clues.

Comment by rembicilious 4 hours ago

"So that is my call to arms. If we want the full power of AI scientists, then we should share the goals with them so they can create, evaluate, discover, and in these ways fully participate in achieving the goals. Let’s be bold! Let’s fully automate Creativity and Discovery!"

Should we automate exercise and play as well? How about learning?

The machine didn't have a soul, so we donated ours.

Eureka! My AI found it!

Comment by musebox35 1 hour ago

I understand the skepticism. I am worried about the implications of AI as well. The deeper issue at stake is that the depth of scientific knowledge has been increasing for a very long time. Now you get to have a PhD in esoteric subproblems and that slows down research especially if the discoveries require depth in multiple subdomains. Socially and economically training people in every combinatorial combination of subfields at the required depth may not be possible. I am especially interested in two problems to be resolved and do not care if an AI scientist performed the discovery. It will be humbling, but totally worth it:

- Fusion (a clean sustainable form): Without this I think we are heading in a very wrong direction, whether it is conflict or climate change does not matter. Everyone is aware of this and instinctively afraid of the implied loss of quality+quantity of life.

- Cure for Cancer: It is a world wonder even in Civ. I and for good reason. As a father of a teenager, every time I hear a story of someone losing a parent/child I cringe. We have to accept this as a reality of life until a proper/generic cure is found that eliminates the most common offenders.

I am skeptical that we will have AGI anytime soon and I think the social aspects will help balance the technical developments even it becomes a reality (Three laws, A Butlerian uprising, you name it).

Chess bots can beat grandmasters, but I have a friend who takes his son to tournaments. Humans are still playing chess, kids in the same tournament with grand masters. We have to have faith in the humanity, or all else will not matter.

And I will definitely keep playing Factorio even if AGI comes to pass ;-)

Comment by hashta 2 hours ago

I think a lot of deep learning is compositional generalization. Models learn reusable pieces (abstractions, styles, procedures, constraints, etc) and recombine them in ways that may never have appeared as a whole in the training data. So even if the ingredients come from past data the final composition can still be novel in a meaningful sense

Comment by doctoboggan 2 hours ago

> That is, I would say that creativity requires that the new things generated be Evaluated. Without evaluation, and retention of the best, there is nothing created. The novelty flickers into existence but, if its value is unrecognized, it flickers away and is lost.

I really like the way he frames this here. I think a lot of people in the twitter comments (and maybe a few here) aren't reading past the introduction. He isn't saying AI systems are incapable of creativity and discovery. He is claiming generative AI without a harness is not capable of creativity and discovery. There needs to be some other system that "recognizes the value" of the novel idea and remembers it. He gives examples of where this value recognition step is automated and thus by his definition achieve creativity and discovery in a fully automated system.

Comment by Shitty-kitty 3 hours ago

One has to be very specific when throwing around words like "creative" when talking about A.I

Can A.I create art. Well it can create something that's pleasing to our senses but art is ultimately about conveying human feelings and emotions. Even as humans, understanding art is not universal. "feelings and emotions" and therefore art, can be deeply tied to a particular groups shared beliefs and experiences.

Can it be creative in non-subjective fields such as math or sciences. Einstein derived GR from his creative thought experiments. If an A.I poped out GR's field equations simply by testing different mathematical frameworks that resolve the issues discovered by experiments, is that creative? Perhaps but certainly not in the same way.

Comment by highfrequency 4 hours ago

Unless I'm missing something, this argument seems to apply only to the original pretraining era (eg GPT 1-4). The post-training and reinforcement learning paradigms are clearly doing variation, evaluation and selective retention no?

Comment by kibibu 4 hours ago

The transcript does seem to overlook post-training steps like Reinforcement Learning with Verifiable Rewards (RLVR) (but I'll certainly won't claim that Rich Sutton is unaware of such things; RLVR has a very narrow set of evaluation approaches).

I wonder if this is a precursor to Keen Tech leaning into David Silver's Ineffable Intelligence approach.

Comment by LarsDu88 3 hours ago

This was exactly what I was thinking of. RLVR is the secret sauce behind o3 and its many successors.

Its the secret sauce behind why the current models are so great at coding and soon to be unbeatable at math.

LLMs can pose many questions and if they are easily verifiable, fine tune very heavily. A lot of the world models discussion will inevitable lean into simulations as verification.

Comment by armchairhacker 3 hours ago

I don’t think ML can’t be creative or make discoveries. I think creativity and discovery are, ultimately, simultaneously thinking about the right seemingly-disparate concepts (whereas algorithmic thinking is more obviously related concepts). If not an LLM, some other model can generate random ideas, rank them, then output the best.

But I think humans are better at it, while ML is better at algorithmic thinking. “Better” being more efficient and something we more enjoy doing; we can also more accurately rank what subjectively appeals to humans (i.e. taste), especially ourselves.

I think ML should be optimized for tasks that require more generalization than programming, but are still mostly logic. Like software development, translation, and tools for art and discovery.

Comment by edot 5 hours ago

I don't quite follow his point. Is it: a) that we need a new foundational algorithm that integrates a goal (one with "taste") directly into the training step, or b) that we need to point trained models towards goals as they iterate?

If it's a), he doesn't propose such an algorithm, and I don't know how you'd do it at such a low level because how do you quantify abstract goals? Did he suggest such an algorithm and I misread? If it's b), that already exists, see AlphaEvolve or any number of things he said. Or, to be a bit of a smart-ass, just type /goal and let it rip ...

I also think he's just categorically wrong that LLMs cannot do good and novel things. And if it can, then you could just say "well that's not novel, that's derivative". A simple example, if I make up a programming language with an LLM and it works well for my purposes, then is that not novel and good? I mean, is any language other than FORTRAN not novel?

Everything is derivative and you can put an LLM in a loop to evaluate LLMs trying things. I must be misunderstanding because he's too smart to be this wrong.

Comment by nateroling 5 hours ago

No, I think I he’s saying that we have that, and we should use it more.

AlphaGo uses discovery when it evaluates potential moves and iterates.

Claude Code uses discovery when it generates a script and the evaluates whether it works or not.

He’s saying we need to allow ai systems to do the evaluation and iteration themselves for science and engineering the same way we do for code.

Basically, harness engineering for engineering.

Comment by oliveiracwb 5 hours ago

LLMs possess the map but are unable to discern fertile from barren ground. For instance: how does Anthropic's new model generate promising 'medications'? Because, beyond the knowledge embedded within the model, it has assimilated AlphaFold's reasoning paradigm. By itself, Claude would be incapable of engineering a protein analysis method

Comment by whattheheckheck 5 hours ago

Idk one of his yt video presentations was saying we're entering a "designer" age of the universe

https://youtu.be/ThFq87Rp21s?si=SrKj72_X8bjnB6ED

Around 35min mark

Comment by sph 1 hour ago

The world will not be satisfied until we have read, and discussed every half-famous person’s opinion on AI.

Still about ten million discussions to go.

Comment by yanis_t 1 hour ago

I mean even I know who Sutton is [1]. He is one of the reinforcement learning pioneers.

[1] https://en.wikipedia.org/wiki/Richard_S._Sutton

Comment by whatever1 2 hours ago

It’s ok, LLMs are useful as they are today. Even if they can never can come up with the next generation of math, physics etc.

Even for humans the brains who managed a step change in thinking are so rare that we literally know them by name.

Comment by whiplash451 2 hours ago

You might be missing that those rare humans were sitting on tons of failed or somewhat useful discoveries made by more “mediocre” humans that history forgot.

Comment by jaakl 3 hours ago

AI reminds me a lot https://en.wikipedia.org/wiki/TRIZ , it is like a good machine implementation of it.

Comment by cortesoft 3 hours ago

> When we ask for a fiction or novelty, the AI can give it to us because its processing is in part stochastic. Every decision can go multiple ways and will go different ways and produce a different trajectory every time. The trajectory can be random—and thus novel—or it can be based on the training data—and thus “good” because the training data is good, sourced from people or reality. Thus, the trajectory is either novel or good—based on randomness or based on data—but never both at the same time.

This doesn't seem true? You can be both random and based on training data.

Comment by maxbond 2 hours ago

I think they meant more "it can be extrapolated or interpolated" or "it can be high variance and 'creative' or it can be low variance and 'reliable/correct/likely'". If you want to see something new, the model will need to step off the manifold. But the manifold is where you've learned the "correct" solutions live.

Comment by zeroonetwothree 3 hours ago

Only to a limited extent.

Comment by 1dom 50 minutes ago

I enjoyed reading this at the start, the language is very... inspiring. By the end, I was disappointed. I don't disagree with what they're saying, but the opening style and statements made me expect some more specific or groundbreaking conclusions.

The point seems to be that generative AI just generates stuff, and that real discovery requires variation, evaluation and selective retention.

The call to arms seems based on the assumption that people only every talk about generative AI as discovery machines themselves. I think it's pretty widely accepted that's not the case by everyone apart from cliche out-of-touch CEOs.

But the talk makes me realise that generative AI are incredible tools to do the discovery cycle with, and this is what I imagine professionally successful AI users are doing: variation, evaluation and selective retention of their inputs and outputs to generative AI.

Comment by anabis 2 hours ago

He seems to be saying that Claude Code can make discoveries. Does anyone think novel discoveries can be made from systems created by supervised learning only, and attempting to do so?

> Claude-Code, which have brought true advances in ... programming. ... these systems have found things that are both novel and good.

Comment by WithinReason 1 hour ago

https://www.forbes.com/sites/anishasircar/2026/04/17/ai-solv...

Comment by balazstorok 4 hours ago

There seems to be a problem with how he poses the problems alphaGo and these GAI models face.

AlphaGO is given a hard evaluation externally. It did not itself come up with it.

When GAI models are given an external hard evaluation, they can also succeed in many different domains (that is one of the remarkable features, succeeding in many domains) ranging from simple programming tasks to frontier mathematics (disproving conjectures recently) to writing more optimized kernel code than before.

And there is plenty of RL especially in these fields where the solution may be extremely complex but eval is rather less complex. And even the discovery and the "evolution-like" trace-selection is also happening.

For this reason it seems strange to compare it to AlphaGO as alphago is given a hard eval independent of itself, from an external source (humans) in a narrow domain. If GAI is given such, it can also show some remarkable results.

But what I find more strange is that innovation and moving forward in many many many cases does not require truly novel ideas but instead a high-quality execution of layering different methods, tactics, ideas on top of each other. Because in many domains our collective knowledge is incredibly sparse and complex, something being able to recombine tools, models, ideas in a high quality way (as he mentions being selective) I think is extraordinarily powerful. And in such cases, with a finite exploration horizon (time, resource available) with 1% "good choices" vs 3% "good choices" are worlds apart, incomparable.

Most importantly: none of the above is about intelligence, it's barren solution-farming to important, valuable problems we have. Most of the AGI and intelligence-related debate seems to miss out on this simple fact. (Insert the usual stuff like a plane being unable to fly like a bird or a submarine not swimming is totally irrelevant to it being useful).

And then a final point: do we really think this thing is incapable of doing better on average on problems we average people face in our lifetime? What should we think, how should we define human intelligence when we give out degrees in science or medicine for 60-70% exam results on problems considered to be generic in the field?

Comment by est 1 hour ago

Richard talks about AI been either novel or good

Then it shifts to discovery.

These seems related but not exactly the same thing.

Comment by thedreammachine 4 hours ago

Humm maybe. But a plain model sampling outputs obviously isn't doing discovery in the AlphaGo sense. But once you put the model in a loop with tests, feedback, tools or even a human picking the good result, it starts to get much closer to the process he's describing.

Comment by E-Reverance 4 hours ago

I think its worth emphasizing that his argument isn't completely against generative ai, but rather its environment. Although I don't see why it would be impossible for something like an LLM to learn some sort of self-play within its context window

Comment by dwd 5 hours ago

"We have many AI systems which can give us more. ... and Claude-Code, which have brought true advances in science, mathematics, and programming."

That contradiction kind of says he doesn't know what he's talking about.

Comment by phyzix5761 5 hours ago

Yes, the guy with a PhD in Machine Intelligence, co-author of Reinforcement Learning: An Introduction, which is universally considered the bible of the field, recipient of the AAAI fellowship award and the Turing Award, and the inventor of Temporal Difference Learning doesn't know what he's talking about.

Comment by dwd 4 hours ago

Sure, but does that mean he's right all the time about all things, including everything in his own field?

He is saying no generative AI is going to produce output that is both good and novel because it is always derivative. And then adds a generative AI (Claude Code) into his list of AI that have produced output that he feels is good and novel, invalidating what he is arguing.

"...no matter how many instances of white swans we may have observed, this does not justify the conclusion that all swans are white."

Comment by zeroonetwothree 2 hours ago

If you read it he says that CC has additional aspects beyond ordinary GAI, namely the ability to verify. That aspect is necessary for GAI to be good and novel.

Although personally I think code doesn’t actually need to be very novel so it’s actually the best example.

Comment by lowbloodsugar 4 hours ago

“When a distinguished but elderly scientist states that something is possible, he is almost certainly right. When he states that something is impossible, he is very probably wrong.”

https://en.wikipedia.org/wiki/Clarke%27s_three_laws

Comment by E-Reverance 4 hours ago

I don't completely disagree but its worth noting how new a lot of the empirical evidence in favour of LLMs are, so its not impossible to be a tad ignorant of the present

Comment by aureate 2 hours ago

Surprisingly enough, Turing Award winner and father of reinforcement learning Richard Sutton knows perfectly well what he's talking about. The whole talk is about the need to have the ability to test novel outputs against reality and iterate to find ones that are good. This is exactly what Claude Code, the agent framework, adds to Claude, the LLM, to allow it to find novel coding solutions that actually work.

Comment by Lerc 5 hours ago

I think the variation, evaluation, and selection idea is a good, if not the only, way do do creative work.

I don't think I would attribute anything in that process that I would consider an AI to be incapable of.

The characterisation of variation like this would seem to rest on the same 'random but directed' crutch that some free will arguments rest upon.

There is no random but directed of course, there is random and there is caused, and there are things that use both as components, but the random remains wholly random, and the caused remains entirely deterministic.

I think there is a good case to say that, in many fields, AI is better than humans at evaluation.

To find avenues to consider, I'm not entirely convinced that human innovation is more than a heuristic that appears more chaotic by virtue of a inconsistent and opaque formulation.

Many aspects of ideas com from noting how some two things are different and then considering that axis of difference when applied to another thing.

The possibilities thrown up by this extremely simple method are vast enough to require multiple layers of evaluation, most could be dismissed out of hand by a quick 'This is nonsense' check that I suspect people do so often and at a rate that it wouldn't even rise to the level of consciousness.

Comment by skor 1 hour ago

yes, when you drive these systems you can get novel output, depends on the effort you put in

Comment by 2 hours ago

Comment by acosmism 1 hour ago

This reset my thoughts

Comment by 2 hours ago

Comment by Papazsazsa 5 hours ago

Creativity = variation + evaluation + selection. It's not bad, though every example he gives has a built-in scoring function haha.

Best thing about nerds is watching them try and build frameworks and formulas for the creative act. Like a metronome trying to compose a symphony.

Comment by vasco 2 hours ago

It's funny to me that one would rate their own takes as "new and possibly controversial". Whatever comes next is read under that light of an author that thinks this about their own thoughts.

And the core point is not even true. They can definitely output novel things that are good - less so but they can and they do. Plenty of examples.

> Thus, the trajectory is either novel or good—based on randomness or based on data—but never both at the same time.

This assumes no possible unexplored path yields good results, or said another way, that none of the random results can be good, which is not true. The whole text seems to try to prove a point decided a-priori rather than make a case based on reality.

Comment by simianwords 3 hours ago

I'm trying to keep an open mind and understand what the author is trying to say because he is credentialed.

His main point is that discoveries involve

1. Variation,

2. Evaluation, and

3. Selective retention.

He makes a jump saying AI is only capable of 1) and humans are capable of 1) 2) and 3). I don't know what makes humans special enough that they can do 2) and 3)?

In fact, the more you think of this it is kind of strange - in science humans can only do "evaluation" because they have access to the real world. They can evaluate a new drug because they can do it on people so it is not some inherent limitation of AI but rather access to physical realm.

Finally I want to ask a specific thing: how do you mathematically falsify what this person is saying? How can you formally prove that - no AI can not "evaluate"? I ask because I make AI evaluate a lot of people's claims and it works for me.

Comment by skybrian 2 hours ago

He's saying that pre-training an LLM alone can't do it, but if you run an LLM in a loop with tools (like any coding agent) then it can. Also, the technique his group came up with should be used more:

> This is the weakness of deep learning that is alleviated with a new algorithm that my group presented in Nature a couple of years ago. Our “continual backpropagation” made one small change: every so often a less-used neuron would be re-initialized to small random weights. This allows the variation to continue and plasticity to be retained.

Here's the paper: https://www.nature.com/articles/s41586-024-07711-7

It has a fair number of citations, but I haven't looked into how much it's used.

Comment by simianwords 2 hours ago

Sorry this makes no sense — humans also use tools to evaluate their discoveries.

Kant said something like this: knowledge can’t be obtained by pure thinking, it needs interaction with the world.

This is obvious to me so why is the author making a claim that LLMs can make knowledge without access to environment but purely through thinking in aether

Comment by zeroonetwothree 2 hours ago

He actually says the areas in which AI has had the novel successes are those which can be evaluated (like coding or Go). Not that it can’t happen at all.

Comment by simianwords 2 hours ago

That’s my point, he says ai does well where evaluation is neurosymbolically closed.

But so do humans? How do humans make discoveries without having formal ways to evaluate? In my pharma drug example, humans could evaluate only because they had access to the physical realm.

I can’t think of an example of humans evaluating a discovery in a way that LLMs can’t. can you?

Comment by albinowax_ 2 hours ago

[dead]

Comment by pevansgreenwood 4 hours ago

[dead]

Comment by oliveiracwb 5 hours ago

[dead]

Comment by erickhill 5 hours ago

[flagged]

Comment by Legend2440 5 hours ago

[flagged]

Comment by dang 3 hours ago

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html

Comment by habitue 5 hours ago

[flagged]

Comment by dang 3 hours ago

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html

Comment by 21 minutes ago