Nicholas

DeepMind's Pushmeet Kohli on AI's Scientific Revolution

Nicholas

Pushmeet Kohli leads AI for Science at DeepMind, where his team has created AlphaEvolve, an AI system that discovers entirely new algorithms and proves mathematical results that have eluded researchers for decades. From improving 50-year-old matrix multiplication algorithms to generating interpretable code for complex problems like data center scheduling, AlphaEvolve represents a new paradigm where LLMs coupled with evolutionary search can outperform human experts. Pushmeet explains the technical architecture behind these breakthroughs and shares insights from collaborations with mathematicians like Terence Tao, while discussing how AI is accelerating scientific discovery across domains from chip design to materials science. Hosted by Sonya Huang and Pat Grady, Sequoia Capital

Published
Published Jul 11, 2025
Uploaded
Uploaded Jun 11, 2026
File type
POD
Queried
0

Full transcript

Showing the full transcript for this episode.

AI-generated transcript with timestamped sections.

0:00-1:43

[00:00] So I went to a biology conference and after I gave my talk, [00:04] A biologist approached me. [00:06] And he said, Pushmit, I have been working on this protein for, [00:12] or the last 10 years. [00:15] And I had collected so much lab data [00:19] to characterize this protein [00:21] to figure out its structure. But somehow this has eluded, like this has evaded all kind of investigation and we still didn't know the structure. [00:31] But we had all this data. [00:33] If we knew the structure, we could sort of validate it very quickly. I ran AlphaFold 2. It gave me the structure. It perfectly fit the answer. I've been working on this for 10 years. Wow. What do I do next? [00:48] *music* [01:03] What happens when AI stops just answering questions and starts asking them? [01:08] In this episode, Pushmeet Kali discusses DeepMind's Alpha Evolve, a breakthrough evolutionary AI system that discovers entirely new algorithms. Pushmeet reveals how coupling language models with evaluators creates something unprecedented, AI that can tackle decades-old math problems and generate human-interpretable code that outperforms expert design solutions. [01:29] Pushmeet shares stunning examples of AI uncovering hidden mathematical truths and explains why we're witnessing the emergence of a new scientific method, one where AI doesn't just accelerate discovery, but transforms which problems we can even attempt to solve.

1:44-3:17

[01:44] Enjoy the show. Pushmate, thank you so much for joining us today. We've all been eagerly waiting for the moment that AI is capable of making novel scientific discoveries. Do you think that Alpha Evolve is that watershed moment? [01:57] Yeah, so it's certainly a key sort of milestone. What we have shown is that [02:03] You have AI sort of model, a large language model, when coupled with a harness, is able to discover new algorithms. And not only that, it's basically able to view new mathematical results, which have been studied for many, many years. [02:22] You used the words when coupled with a harness. Can you tell us more about that harness? [02:27] Yeah, so... [02:29] If you now go back to AI models, like the history of AI for science is very long. We have a number of different models that have tried to do this. [02:41] scientific discovery. Like one of the key models in this category is AlphaFold, right, which is the prototypical example of what can be achieved by AI in science. We released AlphaFold 2 at the end of 2021, and it won the Nobel Prize last year. [03:04] So the impact of AI in science is very well understood. Now, the question is whether LLMs, [03:13] And foundational models, how can they impact science?

3:18-4:51

[03:18] in [03:19] Around two years back, we had an agent called FundSearch in which we took an LLM. [03:25] and we coupled it with an evaluator. [03:28] and the evaluator allowed the LLM [03:31] to figure out when it was making new conjectures on sort of coming up with new ideas to solve problems, whether they were hallucinations. [03:42] or whether they were brilliant insights. [03:45] So essentially, in this particular case, hallucinations were great because some of those hallucinations were, in fact, brilliant new insights that nobody had thought about. [03:56] So this is where the hardness comes in. [03:59] that you have this evaluation function and essentially a search protocol associated with the LLM that together is able to come up with completely new discoveries that are really impactful. [04:14] You mentioned FunSearch. Could you say a word on the difference between the results you all accomplished with FunSearch versus AlphaEvolve? [04:21] Yeah, so FunSearch was our first instantiation of taking a large-time model and trying to see if it can discover new algorithms. [04:30] The models at that time, [04:33] were weaker. [04:35] And the type of search that we were trying to do, we had not sort of explored things much further. So what we asked the LLM to do was essentially try to complete a small function.

4:51-6:26

[04:51] and see if it can do that much better. [04:55] And surprisingly, [04:56] It was able to discover completely new algorithms that mathematicians had been trying to study for a long time. [05:06] But the limitation was that the mathematician or the researcher had to give a template. [05:13] in which the algorithm should be found. [05:19] With AlphaEvolve, we have removed that restriction. AlphaEvolve is not just sort of searching for a few lines. It's basically looking at whole algorithms themselves, right? Very, very large pieces of code and sort of optimizing them over a long period of time. And secondly, FunSearch, our original sort of model, used a lot of function evaluations, right? [05:44] to discover, to make these new discoveries. [05:47] AlphaEvolve can work with many fewer function calls. And it can basically, by looking at fewer proposals, it can discover new algorithms much more quickly. [05:59] Can you tell us about the role that the evolving Gemini models play in the capabilities of AlphaEvolve? And I think I saw in your blog post, you have both Gemini Flash and Pro involved in the harness. What is each responsible for? [06:13] Yeah, so I think... See... [06:15] We have been evaluating, as Gemini improves with various sort of generations, it is becoming much, much better at its understanding of code.

6:27-7:57

[06:27] Now, if you have a proposal generator which can understand code much more effectively, then it generates proposals which are not only syntactically correct. [06:40] They are also semantically trying to solve the task. And then you are sampling what are the different ways in which the task can be solved. [06:50] So as the baseline model, Gemini's abilities to perform coding improve, [06:58] our sample effectiveness in searching for the right solution on these very hard, [07:05] maths and computational problem becomes much better. So, [07:11] So if you want to search in a large space, [07:15] There are two sort of elements. One is the speed of how you can generate these proposals and then the speed at which you can evaluate those proposals. [07:26] right like so first how quickly can you say can you give me a new sort of candidate [07:32] algorithm and then secondly how quickly can you evaluate whether the algorithm is any good or not [07:38] And both things are really important. And the fact that you have these variants of Gemini, like Gemini Flash, which can do that very efficiently and very quickly, this is really important. [07:50] I know Alpha Evolve is more of a broad domain model than some of its predecessors. How broad is it? What's in scope? What's out of scope?

7:58-9:32

[07:58] Yeah, so... [07:59] Alpha Evolve essentially allows you to search [08:04] not only [08:06] in [08:07] Not only in terms of the size of what you can search over, you can now discover whole new algorithms, but it also is extremely general in its ability of thinking about algorithms in various different languages. [08:23] So, [08:24] Not only can it sort of search in C++, but it can also do it in Python. It can also sort of do in Verilog, which is what the language is for describing chips. [08:36] right, in chip design. So the generality of Alpha Evolve is in its ability to search for these large [08:46] algorithmic spaces, but also in different syntactic and semantic representations. It's not restricted to a particular language like Python. [08:56] but it can sort of do that search across many different types of languages. [09:01] and many different types of tasks. The only expectation it has is that you have a function evaluator, that you can quickly evaluate whatever proposal there is and say how good it is. [09:15] It seems like the rough kind of cognitive architecture, so to speak, of generating a bunch of algorithm candidates, evaluating them, and then, I guess, evolutionarily deciding which ones to keep and then going forward from there. And it seems like it roughly mirrors the scientific method. Is that intentional? Yeah.

9:33-11:06

[09:33] Yeah, so I think there are also, if you think about it, there is another sort of agent that we released earlier this year, which was called CoScientist. [09:44] And in CoScientist, essentially what you had was Gemini playing the role [09:51] of the whole scientific academic process. [09:54] So Gemini playing the role of a hypothesis generator. [09:58] Gemini playing the role of the critique. [10:02] Gemini playing the role of sort of ranking different sort of of reviewing those ideas and ranking those ideas and then editing those ideas. So it was Gemini playing, playing all these roles in a multi-agent role. [10:17] Setup. [10:18] And these are all sort of Gemini models prompted differently to play different roles. And very sort of interestingly, [10:27] this combined multi-agent system [10:31] came up with behavior that went much beyond [10:35] a single [10:36] Gemini... [10:38] sort of models answer. [10:41] So it was able to give much, much, much better proposals. [10:45] and new ideas compared to a single sort of model. [10:49] What's the intuition behind why that works? [10:51] Yeah, so I think it is something that is still being studied. But it is a fascinating sort of thing. The one thing that I actually sort of noticed is that, especially with regards to sort of co-scientists,

11:07-12:46

[11:07] you would run [11:09] sort of co-scientist on a particular project [11:12] problem. [11:13] And the very first... [11:16] answer that you might get might not be very different from the baseline Gemini sort of model. [11:22] Um, um, [11:24] And... [11:25] But what happens is even... [11:28] As you sort of increase the amount of computation over... [11:31] sort of, uh, [11:32] This is you're not talking about just a few minutes or a few hours, but even days. [11:38] As the whole multi-agent system sort of looks at the solutions and then refines them and sort of tries to sort of rank them, it just becomes much, much better. [11:52] why that might be happening. [11:54] It might be that the proposal, there is deep... [11:58] insights or there is some sort of intuitions that are buried in the tail of the distribution. [12:04] Right. And then somehow Gemini's ability, [12:09] to evaluate which [12:12] sort of proposal which idea is better. [12:15] is much better than its capability to come up with a sort of a new idea. It's the same sort of thing in computer science, right? Sometimes we are able to find, sometimes we know whether a particular solution is correct or not, but it's very difficult to come up with a solution. [12:33] Right. [12:34] So it's the same sort of thing appearing again in this multi-agent setup that somehow the agents working together are able to extract many more impactful results.

12:47-14:22

[12:47] It seems like the architecture of kind of, you know, generators and verifiers, it seems like that paradigm is being echoed across the broad AI space, whether it's, you know, very general models. [12:57] or very specific kind of like AI systems for very specific applications. Mm-hmm. [13:03] Is that fair that that's sort of the consensus architecture right now? And do you think that'll be the thing that people continue to push and scale? [13:10] Yeah, so I think there is going to be more work [13:16] What we are seeing is basically the very start of research on agents. Whether in Alpha Evolve, you had a generator coupled with an evaluator. [13:32] The generator was a neural network, a foundational model, an LLM, and the evaluator was even hand-coded. [13:39] But together with an evolutionary search scheme, [13:45] you were able to sort of get these much more effective results. In co-scientists, you didn't have just one agent. [13:54] you had multiple agents working in a shared memory. [13:58] Now, like what is the optimal agent configuration? [14:03] Like this is still an open research problem. [14:06] Super interesting. [14:08] Are the results that you're getting, are they different from the ways that humans... [14:12] would derive them. And I'm kind of thinking of the AlphaGo, Move 37 stuff. Are the methods different? Are the results, how do they compare to the ways humans would think about them?

14:22-15:53

[14:22] So let's go back to the original motivation for why we started even working on the first iteration of using LLNs for algorithmic discovery, which was FunSearch. [14:37] So, [14:38] A few years back, [14:40] Like, as you know, DeepMind has done a lot of work in using AI systems for searching over large spaces. We have done a lot of work in building agents which have been trained using reinforcement learning, which can deal with many complex challenges from the game of Go, Go, Go. [15:03] to playing StarCraft, which are quite complex challenges. [15:09] We... [15:10] set ourselves a challenge that can we take the same kinds of models? [15:15] like the AlphaZero family of models, which were extensions of what we had done in Go and the development of AlphaGo. Can we use the same types of models for discovering new algorithms? [15:33] And we came up with a new sort of agent called AlphaTensor. [15:39] which was particularly focused on finding [15:43] Um, [15:44] solutions for the matrix multiplication problem. [15:47] And we found that this agent was able to improve

15:53-17:33

[15:53] over the past known results which had stood for 50 years. [16:00] But the key question sort of remained, [16:03] can you do something better? [16:05] And secondly, can you sort of come up with a solution that is more interpretable? [16:10] - Yeah. [16:11] At the same time, like when we were looking at practical problems in Google, like how do you schedule jobs in a data center? [16:21] Now, there has been a lot of work on coming up with new algorithms, and these heuristics have been designed by some of the best researchers and engineers at Google, because they have a huge amount of impact in terms of computer utilization. [16:40] If you use a typical reinforcement learning agent on this kind of problem, [16:45] you might get better results, but it might come at the cost of interpretability. [16:50] Because now you have a neural network [16:52] deciding which workloads are [16:54] go to which computers. [16:56] And if something breaks, then you don't know how do you debug this thing. So what engineers would really prefer is instead of giving them a neural network, you gave them [17:08] a piece of code that they can interpret and they can run. [17:11] Mm. [17:12] And this was essentially the motivation. Can we now use LLM? So instead of searching in the space of... [17:20] of specific algorithms like we had done, matrix multiplication algorithms like we had done in AlphaTensor, or coming up with a neural network policy to directly solve the problem. Can we come up with an agent which can...

17:33-19:06

[17:33] search in the space of [17:35] programs. [17:36] and come up with a program that solves this hard problem. [17:39] And the benefit, of course, will be interpretability, that you can see the sort of code, you can see what its properties are and so on. And that's what happened. We found sort of programs that not only were effective, but when... [17:56] the experts actually saw those programs, they could sort of recover insights. [18:01] So, for instance, one of the maths problems that we had looked at for FunSearch was called the cap set problem. This is a problem that Terence Tao, one of the famous mathematicians, he is very interested in. [18:18] And we collaborated with this mathematician, Jordan Ellenberg at NYU. And when we looked at the program that FANSAGE had produced, he found that there were certain symmetries. [18:31] that were in the problem that had not been recognized before. [18:34] And somehow the program, like OneSearch, the agent had discovered those. [18:41] and was utilizing those to get a better solution. [18:44] Can you say a word about you mentioned working with Terence Tao and other famous mathematicians? [18:49] Is math considered the gold standard for testing and benchmarking? [18:54] if these models are generating novel scientific results. [18:57] Thank you. [18:58] Yeah, so math certainly sort of has some properties which are very interesting, right? The fact that it's very precise.

19:06-20:41

[19:06] like you can, you know, whether the property that you are looking for, whether you have found it or not. [19:15] You know the matrix multiplication, for matrix multiplication, how many multiplications you require. [19:25] like for a 4x4 matrix, what was known was that you can do it with [19:30] 49 sort of multiplications. [19:32] That's by Strasser. And we showed that you can do it by 48. So that's a very precise result. [19:39] There is no sort of arguing about that. So it gives you a very crisp way of evaluating [19:48] how well you have done. [19:50] Um [19:51] And there is no sort of like RLHF needed in terms of human feedback, whether this was a nice result or whether this was a nice output or not. [20:01] And you don't need to rely on sort of an LMSIS code. You just know. [20:06] that you're better. [20:08] Yeah. [20:09] Okay, so then when you go from the beautiful, pristine environment that is math, [20:15] to the real world. It seems like you all have found a lot of real world applications and data centers in the Verilog world. [20:22] Could you say a little bit about which applications you expect Alpha Evolve to be most impactful for? [20:29] Yeah, so wherever sort of you can find a good... [20:34] a function evaluator. [20:35] Wherever you can find an evaluator where you can say, I really trust this evaluation scheme.

20:41-22:12

[20:41] If you give me a program, I can tell you very concretely how good it is. [20:46] If your problem satisfies that... [20:50] that setup, [20:51] then you can use Alphabal. [20:53] Because unlike a human sort of programmer who can try 10 things or 100 things or a thousand things, [21:01] Alpha Evolve does not, like it does not, it does not sort of, it can go on and on and on and on. [21:09] It can come up with very counterintuitive strategies to solve that problem, some things that you might not have ever imagined. [21:20] Can you have humans be the function evaluators? [21:22] Or does that not work? Humans can be the function value. [21:28] It's a question of sort of scale. [21:30] Right? Like how many can you sort of evaluate and whether you can [21:37] evaluate. [21:38] the property of the program effectively. [21:41] So at scale, [21:43] and with the right level of accuracy. [21:46] How do you do that? Do you build that into the application itself so that there's a human in the loop evaluating as it goes? [21:54] Do you do that offline separately before the application is produced? I guess, how do you do that or how do you imagine people doing that? Yeah, so I mean, like, so we haven't used a human in the loop for Alphiball. [22:08] Most of our evaluators were programmatic evaluators.

22:12-23:43

[22:12] But imagine a hypothetical scenario where Alpha Evolve was told that you have to solve this math problem and come up with a new algorithm to solve this problem. And suppose it came up with many different types of problems, many different kinds of solutions, which are all equivalent. [22:29] in performance. [22:31] But then which one is the best? The best is the one which is not only sort of very effective on the problem, but is the most elegant. [22:42] according to a mathematician or the most simple to understand. [22:46] And that's a very subjective human thing. [22:50] like simplicity or interpretability. [22:53] like we don't have a sort of crisp definition of it it's it's it depends on it is [22:59] Grounded. [23:00] in the human. [23:02] Observer. [23:03] At what point do you need to pair kind of what's happening in the digital world to any kind of physical world stuff? I think in your blog post, you mentioned that you could see also Evolve being useful for, for example, for material science. [23:15] Do you need to be able to connect to a real world laboratory to kind of [23:19] get any of that feedback? Or do you think all of this can kind of happen in the algorithmic domain? Yeah, that's a very good question. And I think this goes back to how much do you trust the evaluator? [23:30] if you, if, uh, sort of, uh, [23:32] if your evaluation was based on a computational sort of method, [23:39] And the computational method was perfect and you completely trusted it.

23:43-25:18

[23:43] then you don't have to. Then you think, well, I believe the computational model, the computational model says that the solution that AlphaEvol came up with is... [23:54] satisfies these properties. [23:56] job is done, right? But if you don't believe, [23:59] that the computational model is the perfect [24:03] uh, [24:04] characterization of reality, then you want to make sure that you sort of validate that result. [24:12] in the real world. [24:13] And you see whether that assessment of the evaluator was indeed correct. [24:21] As Alpha Evolve becomes more and more successful, as Gemini becomes more and more powerful, what do you think happens to these domains and how will the human scientists and engineers working in them work? [24:32] So, for example, if you take chip design as an example, you mentioned these models are getting very good at generating Verilog, creating new chip designs. Does that mean the role of a chip designer is? [24:45] goes away, changes? Like, how do you think that this changes the world? [24:48] Yeah, so I think that's, again, sort of a very interesting point [24:53] question. I... [24:55] Give you the example of what happened with AlphaFold. [25:00] So we started working on this problem of protein structure prediction. So for those of you who don't know, proteins are the building blocks of life. They are the Lego blocks. [25:09] of life. [25:11] And, uh, [25:13] For many, many decades, scientists have been trying to figure out what is the shape of proteins.

25:19-26:51

[25:19] Because if we understand the shape of proteins, we understand how they function and we can use that to sort of develop new drugs to treating sort of the most sort of challenging diseases on the planet. We can develop better sort of enzymes and so on. [25:36] Now, [25:37] In 2021, as I sort of mentioned, we released AlphaPole too. [25:43] Before that, you used to take... [25:45] or even a single protein, sometimes... [25:48] one to five years. [25:50] to find the structure of a single protein and it might take a million dollars. [25:54] And there were some proteins which were so notoriously hard that people had been trying to study them for almost one or two decades and had not found the solution. [26:05] And which is why only 37%, roughly 37% of the human proteins, [26:11] their structure was known. [26:13] So after we released AlphaFold 2, I went to a biology conference. [26:18] And because with alpha fold 2, we could find the structure of all proteins, not just human proteins, all proteins on the planet. [26:26] And we made the structures available to everyone on the planet. [26:29] So I went to a biology conference and after I gave my talk, a biologist approached me. [26:35] And he said... [26:38] Pashmeet, I have been working on this protein [26:43] for the last 10 years. [26:46] And I had collected so much lab data,

26:51-28:22

[26:51] to characterize this protein [26:53] to figure out its structure. But somehow this has eluded, this has evaded all kinds of investigation and we still didn't know the structure. [27:02] But we had all this data. [27:05] If we knew the structure, we could sort of validate it very quickly. [27:09] Iron, alpha, fold, do. [27:11] It gave me the structure, it perfectly fit the answer. [27:15] I've been working on this for 10 years. [27:17] What do I do next? [27:21] So what has happened? [27:24] after alpha fold 2. [27:26] What happened is basically suddenly... [27:29] It... [27:29] There are three things. [27:32] It first advanced structural biology. [27:35] What was not possible earlier, it would take a synchrotron and six months and a million dollars is now done in a second. [27:44] Right? [27:45] So it really advanced what was possible. [27:48] Secondly, it accelerated it. [27:50] And thirdly, it democratized it. [27:53] Like that particular scientist working in Latin America or South Asia or Africa on some neglected tropical disease had no chance to sort of figure out the structure of the protein. They did not have the funds or have access to instruments that could find them the structure. [28:09] Now they have access to those things, to like any sort of parasite that they're working on. [28:15] So what do they do? [28:17] They are now working in this new model where structures of proteins are not

28:22-29:53

[28:22] hard to get, they are everywhere. [28:26] And so they are working on the next set of things. Like how do you now use that knowledge [28:30] to treat [28:32] diseases and design better drugs. [28:34] And I think the same thing will happen with Alpha Evolved. When you have these agents which can go beyond... [28:42] human abilities in solving these problems, then the question becomes which problems do we solve? [28:49] What are the important characteristics of a chip? [28:52] that we need to improve on. [28:55] We want to make it much more efficient, [28:59] much more sort of [29:01] So that it requires less cooling. It requires sort of less expensive construction mechanism. It's more fault tolerant. Many other things. You can make the problem more and more sophisticated because now you have more sophisticated systems to sort of optimize them. [29:17] While I have you, something I've always wondered, the AlphaFold results are... [29:20] are phenomenal. And the story you shared with us is really impactful. Um, [29:26] Do you think that it's caused an inflection point in the... [29:29] kind of [29:30] availability of new drugs? Or are there other bottlenecks now that are just, we're faster at one part, but unfortunately, everything else is just hard. So we're still slow overall. [29:42] No, it's... [29:43] So it has speeded things up, but I think there's a... [29:48] We want us to understand that... [29:51] Drug discovery is a long...

29:53-31:27

[29:53] process. Now, what are the roadblocks for drug discovery? First, you have to understand the target. You have to understand the [29:59] here's a protein in the body that I need to bind because this protein is somehow involved in... [30:06] the disease. So, if I can somehow [30:08] bind something to this protein and change its function, [30:12] It will... [30:13] have an effect that can [30:14] treat the disease. First, you have to come up with that conjecture. [30:18] Then you have to say, okay, now I have a target protein. [30:22] how do I develop a drug? How do I develop a small molecule? [30:27] or another protein that binds to it. [30:30] So for that, you needed to understand the structure of the protein. [30:33] which other proteins that it interacted with. How did it interact with this molecule? This would take... [30:39] a significant amount of time, sometimes two years. [30:42] Now that process is [30:43] dramatically sort of [30:45] accelerated. Now you can do it in sort of a few weeks or a month, a few months that took you [30:52] multiple years sometimes. [30:54] But that's not... [30:55] the end of the story. After that, [30:57] you need to now clinically validate it. [31:00] So you have to go through phase one trials, phase two trials, phase three trials. You have to think about toxicity, all these other sort of things. So what AlphaFold did was... [31:09] Take 1. [31:11] blocker away made the overall timeline faster, but there are other sort of blockers which [31:16] Our new generation of AI for biology models are hoping to. [31:20] accelerate and make much faster. So we have taken a big step, but we need to take

31:27-32:59

[31:27] A few more big steps. [31:29] What domains do you think will be most lucrative for this family of models? [31:34] I think... [31:36] The question is, what is, I mean, the answer to your question is basically what domains do you think are important for society? [31:45] Because AI is going to accelerate everything. [31:49] It's going to accelerate healthcare. It's going to accelerate sort of the ability for us to develop more smart systems, [31:58] Uh, [32:00] from healthcare to sort of material science. Like if you think about the history of our civilization, [32:07] We even describe our civilization in the sense of there was [32:11] First, we were sort of cave dwellers and then we went into the Stone Age and then we sort of went to the Iron Age and then the Bronze Age. And now, depending on... [32:22] who you talk to, you're either in the silicon age or in the plastic age. [32:25] whether you're optimistic or feeling a bit sort of sad. Um, [32:30] But... [32:31] If you take a step back and you think about what has... [32:36] humanity achieved. [32:37] What we have achieved compared to any other species is the ability to transform energy, to leverage energy. [32:44] We have been able to leverage energy and do big things with that power. [32:50] Thank you. [32:50] Now, if you can come up with, say, a new room temperature superconductor, [32:56] That completely transforms your ability to handle energy.

33:00-34:31

[33:00] Hmm. [33:00] Right? [33:01] What changes will it bring about in society? [33:05] They are hard to predict. If you can [33:08] if you can deal with energy in that way, [33:10] Right, if we can unlock fusion. [33:14] Right? [33:14] and energy becomes so cheap. If you think about geopolitics, if you think about economy, a lot of it is about energy. [33:22] And suddenly, if energy sort of goes down to zero, [33:27] What will be the impact on [33:29] On. [33:30] on the economics of the whole thing. [33:32] Similarly, [33:33] Like if you think about like, [33:36] Good day. [33:37] And if, [33:37] you have these agents which can code, [33:41] What does that mean? If everyone can sort of code... [33:45] like intelligence sort of is completely ubiquitous. Everyone has access to all these different things. [33:52] So there will be dramatic changes and everything will be impacted. [33:56] from materials to energy to sort of coding to healthcare. [34:01] Really cool. [34:02] Do you think we're going to have a fast takeoff moment for scientific discoveries? Do you think we're... [34:07] At the ramp of one, you think we're already there? I think we are living through the middle of it. We just, we don't, like when you're in the middle, you don't really see it. [34:16] But I think we are already in that [34:19] era of AI accelerated scientific discovery. [34:22] What do you see as the biggest bottlenecks going forward? [34:24] I think... [34:26] sort of two [34:27] Elements. [34:29] One is

34:32-36:05

[34:32] Validation. Bringing... [34:35] bridging the gap between the digital and the real world. [34:39] Right? How do you validate [34:41] Some of that. [34:42] So that is one sort of key point [34:44] idea, right? And really sort of capturing what is important for the problem, right? And the second is sort of the other bottleneck is [34:54] Uh, [34:55] How do you... [34:56] make this technology accessible. [35:00] You can build the most sophisticated technology if people don't know how to use it. [35:05] then [35:07] then you will not have the impact that you want. [35:10] right? Alpha fold 2 [35:13] the [35:14] It was not [35:16] Just... [35:17] impactful and transformative because it had very high accuracy. [35:22] Because even if it was quite accurate, it was not perfect, right? And suppose it was accurate on 99% of the... [35:30] of the things that it predicted. It's definitely not at 99%, probably at the 90 or 95% mark. But suppose even it was accurate at 99%. [35:42] The one person who got unlucky... [35:44] with their prediction. [35:45] and then spend the next sort of one or two years [35:48] chasing a wrong prediction, [35:51] would then... [35:52] sort of say that I should not use it. [35:55] I should not sort of use the predictions. [35:58] So why is everyone using AlphaPort? [36:01] They're using AlphaFold because not only is AlphaFold good at

36:06-37:37

[36:06] Making these predictions, [36:08] which are accurate, but it's also very good [36:10] in understanding [36:12] the limits of its predictions. [36:15] when it makes mistakes it basically holds its hand and says [36:19] I've made a mistake. [36:21] So now, if it is making your prediction and saying I'm very confident, like most of the time, it's correct. [36:27] And that's great. This is something that the LLNs of today don't have. They don't have calibrated uncertainty. [36:34] Should we close out with some rapid fire... [36:36] Questions? [36:37] Yeah, sure. [36:38] Must read paper of the year. [36:41] Must read paper of the year. Oh, uh... [36:45] I would say alpha evolved or co-scientists. I like, like, like, uh, [36:50] Yeah. Favorite algorithm nobody talks about? Oh, the wake-sleep algorithm. And very few people know about it. But it's essentially the idea-- it's a paper from MIT [37:04] from Kevin Ellis and Josh Tendenbaum [37:07] which sort of talks about [37:10] it's a way of sort of... [37:13] doing training where you find some exploration and then you somehow build the gist of it. [37:20] So think about sort of library construction. [37:24] An analogy is library construction. You don't just want to write programs, but you want to also create the libraries that have common modules [37:32] That will make all your future programs much easier to write.

37:37-39:11

[37:37] Hmm. [37:38] Very cool. Agree or disagree, inference time computes will be the next major lag of compute scaling. [37:44] Um, [37:47] Somewhat agree. [37:48] Okay, say more. So I think interest time compute will be very, very important. [37:55] I think also test time, sort of training time compute. [37:59] will be [38:00] equally important, right? We also, like if you look at distillation, [38:03] how powerful distillation sort of has been, right? So if these models have a [38:10] have an ability to sort of understand and conceptualize. [38:15] what these models are able to do. [38:18] and come up with better inherent representations. [38:22] then they just become much more effective. [38:26] in making predictions, maybe their sort of uncertainty improves and so on. [38:31] They become more efficient even. [38:33] Robotics, bullish or bearish? [38:36] Um, [38:38] I'm bullish about everything. So I have to say bullish. I think everything will have an impact. But the question is basically near term or longer term, right? In the near term, getting robotics to work is challenging, but in the medium to long term, I think I'm bullish. [38:59] Humanoid robots, bullish or bearish? [39:02] We have constructed our world. [39:05] for humans. [39:07] right? We like the human form. A lot of the

39:11-40:41

[39:11] the non-natural world around us. [39:15] is made for humans, has been designed for humans, like from an architecture perspective. [39:20] Right. [39:21] Now, humanoids have the same form. [39:25] as humans. [39:27] So they will fit in [39:30] in all these different architectures that we have built. Now, whether they are the most optimal thing, [39:37] That is not clear, but they certainly sort of have... [39:44] an advantage that we designed everything for the human form. [39:49] and now humanoids have the same form. [39:52] Hmm. [39:53] Future Nobel Prizes in the Sciences. [39:57] uh, [39:57] Will all of them be one? [40:00] by teams working with AI. [40:02] No, I think we would be getting there, but I think like humans are still winning Nobel Prizes in the sciences. So I think... [40:11] Um, [40:13] I think they will come. [40:14] a point where [40:16] AI will be indispensable. [40:18] So it will be sort of humans and AI teams working together. [40:22] to achieving these amazing breakthroughs. [40:26] Then. [40:26] Pushmeet, thank you so much for joining us today. These are really fundamental, really general results that you're pushing forward at DeepMind. And we appreciate you joining us to share more about how you managed to do all this so far and what's ahead. Thank you. [40:41] Thank you.

40:42-41:08

[40:42] *music* [41:06] .

Want to learn more?

Ask about this episode