Training Data ยท 2026-06-11

Google DeepMind's Logan Kilpatrick on Agentic AI, Coding Agents, and Omni Models

Hosts: Unknown

Guests: Logan Kilpatrick

agentic AIcoding agentsGemini APIAI StudioOmni modelgenerative mediaworld modelsAI product strategyDeepMind culture

Why it matters

Google's agentic AI era is powered by the 'anti gravity' agent harness, providing a unified framework across Google products for autonomous agent capabilities.

Key claims

  • Google's agentic AI era is powered by the 'anti gravity' agent harness, providing a unified framework across Google products for autonomous agent capabilities.
  • Coding agents are a key use case and represent a form of narrow superintelligence, significantly accelerating software development and research progress.
  • AI Studio has enabled the creation of over 350,000 Android apps recently, democratizing app development and unlocking personal productivity use cases.
  • Omni is a new single, multi-modal model integrating text, audio, video, and other modalities, enabling advanced generative media capabilities including real-time video editing.

Briefing memo

Summary

In this episode, Logan Kilpatrick, head of Google AI Studio and the Gemini API, discusses the evolution of Google's AI strategy centered around agentic AI and the 'anti gravity' agent harness that powers a growing number of Google products. He explains how this harness enables agentic capabilities across coding, search, and consumer applications, emphasizing a shift from maximizing user eyeballs to maximizing user outcomes. Logan also highlights the rapid progress in coding agents, describing them as a form of narrow superintelligence that accelerates software development and research productivity.

The conversation covers Google's approach to integrating AI into developer tools like AI Studio, where hundreds of thousands of Android apps have been generated, and the challenges and opportunities in generative media and video game creation using coding agents combined with world models. Logan elaborates on the concept of 'the model eats the harness,' describing how AI models increasingly internalize external scaffolding and agent frameworks, which may shift where competitive advantage lies in the future. Finally, he reflects on DeepMind's unique culture within Google, balancing deep scientific research with applied product deployment across Google's ecosystem.

  • Google's agentic AI era is powered by the 'anti gravity' agent harness, providing a unified framework across Google products for autonomous agent capabilities.
  • Coding agents are a key use case and represent a form of narrow superintelligence, significantly accelerating software development and research progress.
  • AI Studio has enabled the creation of over 350,000 Android apps recently, democratizing app development and unlocking personal productivity use cases.
  • Omni is a new single, multi-modal model integrating text, audio, video, and other modalities, enabling advanced generative media capabilities including real-time video editing.
  • The concept 'model eats the harness' describes how AI models increasingly internalize external agent scaffolding, potentially reducing the need for separate agent frameworks over time.
  • Google DeepMind maintains a culture focused on scientific rigor and mission-driven impact, while collaborating closely with Google's broad product ecosystem.
  • Agentic AI is still in the 'crawl' phase across most Google products, with more advanced 'walk' stage examples like the Gemini app and autonomous coding agents.
  • There is strong belief in the continued opportunity for startups and independent companies to innovate in vertical domains despite increasing AI model capabilities.

Source material

Transcript

So we could edit this set so it looks like we should.

Yeah, yeah, I want this where we were talking off camera like we should do that for the intro because I think it just like makes all this stuff more capable.

I've seen these examples of like such subtle nuance that like make me appreciate that it's like the world understanding playing out.

I was I was giving a talk and was on stage with with my friend Tulsi who leads the model team.

I had mentioned to someone in the crowd to like edit the video and they literally took the picture edited it with Omni in real time.

And this like dog came on the stage in the edited version.

The other guests sort of like look down and see the dog.

They like chuckle a little bit.

This is wall.

I'm like, oh, pining about whatever.

Yeah, it was not my joke.

They laugh at the dog coming up.

It jumps onto my lap.

I sort of like acknowledge the dog.

I keep talking.

I'm like petting it or whatever.

Just like there's like so much subtle subtlety in getting that right and the model crushed it.

And it's just it's very interesting and like still trying to like absorb and digest like what that means for you know the way we make content and all these other things.

That's so interesting.

I'm delighted to have Logan on the show.

Logan runs Google AI Studio in the Gemini API.

You spend a lot of your time thinking and building for the next generation of builders.

Yes.

So I'm excited to talk to you about everything from magenta AI to AI coding world models and more today and right off the heels of Google I.

So what better timing?

Yeah, I'm super excited.

Thank you for having me.

Wonderful.

Let's start with agentic AI.

So send our open IO by calling this the agentic Gemini era.

What does agentic AI mean for Google?

Yeah.

It's a good question.

I think and we were sort of if you if you followed closely we did sort of mention some of these things back with like Gemini 2.0 which I think was like a little bit early.

And so I think this era this like Gemini 3.5 era feels like it's actually becoming true now and we're in the era of agentic coding or agentic products and everything agents as far as Gemini.

I guess as far as Gemini goes I think for us this agentic layer and I think we announced this actually at IO sort of being powered by the anti gravity agent harness is this like additional through line for Google that sort of connects all of our products that they're sort of like based on now.

And so historically like prior to Gemini they're actually like wasn't a through line for the you know probably sub 100 number of Google products that we have the 50 Google products we have there wasn't a through line.

There wasn't a through line we had Gemini became this through line everything is now sort of using Gemini in some way.

That's now becoming true for antigravity a sort of all of the products rebase to become sort of like agentic native products and like actually taking action on behalf of users and helping them get things done.

You see this like new through line emerging which I think is actually really really interesting.

I'm sorry help me with antigravity is the idea right.

Yeah antigravity is a lot of things which I think is is sort of again is an opportunity for us you have sort of a core IDE you have sort of like the agent first experience if you want it on the web.

You have a CLI you have an SDK so I actually think and I don't know how much we framed it this way but like it really isn't ecosystem of stuff that we built.

It's designed to sort of like meet developers wherever they are so you could use it through the Gemini API if you want to and you want to manage agent that you don't have to do any of the sort of infrastructure work for.

And then the most interesting bit is like it's not just the ecosystem of antigravity stuff it's also power like literally it's the the same harness is actually powering all the other Google products.

Antigravity will be powering a bunch of agent stuff and search in the Gemini app across like cloud and and as you do which is really exciting.

I see so it used to be the Gemini API so like the language model was a through line in terms of how I get baked in every Google product.

Yeah.

And now it's not only the API it's the the coding harness.

Exactly.

That's being used in each of these products and therefore it's the coding agent itself that's driving more agentic properties.

Yeah.

So the products.

Yeah.

Fair description.

Fair description.

I think more generically too is just like it is the agent harness.

I think like coding as sort of like a specialized use case of the agent harness.

I think is is obviously powerful but it is like coding has proved to be the general purpose agent harness in addition to also working really well for coding.

Our agent harness and coding harness synonymous or not.

There's definitely nuance.

I think there's like optimization that you can squeeze out of like specializing and actually you see this for like the you know technically the agent harness that gets used for the way that a studio uses it is like a little bit specialized for you know the vibe coding use case.

And the the way that the Gemini app is using the agent harness is a little bit specialized for the sort of consumer always on 24 by seven agent.

So I think you have that base harness that like probably has like 80% of the same stuff and then you specialize for for coding or for whatever you see.

Interesting.

How do you think about the cannibalization of the existing business especially now that you are you know going much more aggressively into agent tick properties because I could see.

For example, if all you're doing is search or summarization there's you know not as much of a cannibalization fear whereas if you're actually going through my emails reply.

Reflying to them for me like am I even going to my email anymore and so I could imagine that there's actually just fewer human eyeball hours.

On your products as a result of having more agent to capabilities is that fair or how do you think about the cannibalization.

Yeah, it's interesting I think.

One sort of observation I have is that like at the beginning and I think soon or it's done a great job of sort of talking through this is at the beginning of the sort of current AI era like everyone assumed that.

AI being able to answer questions for you was going to be like negative some for for search and actually what's ended up happening has been incredibly positive some for search like people are searching more people are doing more.

And so I think.

Yeah, and agent actually again there's like this whole market that spawned at the same time that agents are doing more at the same time that humans are also searching more and so I think it will be and obviously there's a finite amount of like human time in the world.

But from from like my early feelings of how a lot of this is playing out it does feel like it's it's very positive so I'm from like an ecosystem value creation like how the human behavior aspect of it turns out.

I think is like somewhat clear in the next one to two years much less clear you know three to five years from now when the technology is improved and the products probably look a little bit different than the way that they do but.

But ultimately like that is the success of product I think like we have a bunch of conversations with demos all the time and it's like the point of building.

The technology is so that it can go and do stuff for you like the point like success for Google like probably doesn't look like.

You know maximizing eyeball time in front of our products it's like maximizing outcome for customers to like do the thing that they want to do so that they can go and live their life and do what they want and so I feel like.

You'll you'll probably see us go down the route of like maximizing outcomes for customers and like not maximizing eyeballs yeah.

I've this term second I had agent live growth like it seems to me so I'm using using coding agents lot in my personal time and you know I just let the agent make all the infrastructure choices for me I'm like I don't care what's database you tell me.

Yeah and and so and the reason I ask is you know it's true encoding today I would imagine it's maybe going to be generally true for a lot of things let's say shopping down the line.

How do you think that's kind of change how advertising works how value capture works for for the aggregators.

It feels like it's a very similar trend this isn't perfectly true but a lot of these things are just like proxies every other like the way that SEO works I think like is directly correlated with like the way that.

Like I forgot what the term now for it's like GEO is like the generative engine optimization or whatever it's called.

And so it does feel like there's a lot of correlation between between the things my guess is it looks like much less of a radical shift than than the then I think maybe what we assume right now just because these things compound on top of each other.

If you were to you know grade the scale of agent thickness in terms of crawl lock run or are we in terms of how agent take the Google suite of products is.

Yeah that's a that's a great question it's definitely like crawl right now and I think some of this is like all of the inherent product tension for Google is like you have what 13 billion plus user products and so like actually think we have some.

More like labs like experiences where you're probably closer to running or walking.

But I think like most of the product experience today is is definitely closer to crawling and I think that's just like the stewardship responsibility we have sort of building a product that's being used by lots of people like I don't think the long tail of customers are like.

Ready to have AI running and just doing all the things like they probably they want to be in the driver's seat they're cautiously taking the first step and I think the the Google team like searches maybe like the most quintessential example this like I think they have a lot of responsibility to actually do that in a way that it.

brings people along and doesn't just like change everything of how they interact with the internet and the way they associate with products and stuff like that.

So yeah which products do you think are closest to the luck that's a good question.

I think Gemini app is definitely closest to walk and so for for spark I think having a 24 seven always on agent like literally going and potentially doing a bunch of actions on your behalf.

is definitely like one of the frontier use cases and I think you'll see I think like anti gravity is another one where it's like you could have autonomous coding agents you know rebuilding operating systems and doing you know billions of tokens and spending thousands of dollars on your behalf and I think those are.

again like more and actually like they're in gdm as well as another angle of this so I think like gdm is taking like very much like a frontier look at this where I think like the rest of Google's products I think are like more incrementally getting there which again makes makes reasonable sense to me.

Do you think that Google ends up with one two three products surfaces for using AI or thousands.

It's tough.

I think a lot of this is actually baked and just like how humans consume products and my sense is that there's something nice about like having this like compartmentalization and this like specialization of products where like it.

becomes if you end up with a product that is like doing everything for you inherently there's more work involved in using that version of the product.

I think I think would be like the default say I think maybe somebody will spend together like the truly magic experience that doesn't make that true but.

I think I think the long tail of folks end up having to spend more mental energy and more time to actually like get the general purpose product to do the thing that they actually want to do versus like there's something nice about.

I click my calendar app it just shows me my calendar like I don't need to worry and deal with anything else.

This is my hot take for my slide decks have existed for so long of just like you know the thing the piece of information you want to be exactly in the same place.

And I think like we as humans are just actually very used used to that the idea of a generative in their face sounds so cool to me but it's like our door brains really isn't just more cognitive overhead for us.

It definitely is uncertain cases and I think somebody needs to again there's there's a lot of incredibly smart people in the world and so maybe somebody will find the experience like makes it feel.

But to me right now I'm I'm maybe not 10,000 is the extreme version I'm guessing it looks more like more products going after sort of like different and maybe the other answer is like I don't know what it looks like for Google for the ecosystem it looks like a lot more products I think like and that that's really exciting.

I think like how Google will end up strategically deciding like do our customers want to deal with us having 10,000 products or would it be better to only have three.

We'll come down to like a strategic decision for us that's only makes sense.

When I talk to companies in the enterprise they say you know everyone's talking about agentic AI but they only place they've seen agents really working is coding agents.

Do you agree or disagree with that take.

Yeah I think it depends what your bar for working is which I think is a lot of the nuance of this like I think if you're if you're truly trying to like offload.

Very complicated tasks for for domains in which like it's the models have an actually cross the threshold of quality then like I think that's definitely true like the it's not going to solve the problem but this is something that I want I wish we could like measure a good example is like open router for example is like measuring you know the total token consumption that's happening and so you can sort of like see these trends play out over time of like how much more intelligence is in the world.

You know now versus a year ago in parallel the thing that I'm actually really interested to measure is like how long is the average like thing.

The average like agent run or the average task actually taking place and it's I don't think it's something that they publish but I feel like they probably have interesting data on sure there's others.

Because because I do think you're like seeing these like new model capability lands or new model drop and and it's like spiking up and maybe the curve is still like very low right now.

But like you're seeing those like early signs of it spiking up where to like long running tasks and all the model labs are talking about like we've released this new model and it did you know three days of autonomous work or whatever it is.

That's the extreme but I think in practice you're seeing that like trickling up like pretty pretty quickly which is really interesting so even if the enterprises haven't felt it outside of coding like they are going to like this year.

As sort of a bunch of those other use cases get get much better as well from like a you know from the deep mind perspective do you think long horizon agents is like a KPI that matters is it they is it the KPI that matters.

It definitely matters.

I think for deep mind like we're doing lots of things which we can talk more about later like there's you know a huge portfolio of different bets that are taking place.

Long running agents obviously matters a lot and I think also like specifically coding agents and that matters a lot like it clearly is an accelerant of like every other part of your business if you have a great coding model.

And so making sure we have that thing is super top of mind.

Got it.

I'd love to shift gears a little bit and talk about coding.

Yeah.

Okay, I ask a hard question.

A lot of my developer friends were using cloud for a long time.

Open AI saw that declared code read codex is now really good.

I would say my friends are maybe split 50 50 now in using cloud and using codex.

I don't hear a ton of them using Gemini which is always kind of puzzled me.

What's going on with that?

Yeah, it's a great question.

I think there's one there's one part of the story that I'll add which is which which makes it even more interesting which is.

December the narrative was that Google had one and when we landed Gemini three I think it was like such such a profound improvement from a model capability perspective.

I think a lot of the narrative was like Google has taken a hugely forward and and made that happen and I think what was interesting to see sort of as a as an ecosystem participant.

It's like how not how quickly that narrative shifted but just like the next wind of the narrative obviously was like all the agent and coding stuff that happened over over the holidays and then into January and beyond.

And that was that was not that long ago.

And so it's a it is a living in warp speed ever since.

Yeah, for sure.

But it is it's a matter of matter of like just how fast things can can change.

I think the observation is is not is not unreasonable.

I do think the what's happening behind the scenes for us is like trying to push the frontier as fast as possible on coding.

And so I think anti gravity actually like isn't important part of that.

I think one of the takeaways is that it's actually really hard to make a great coding model for this like.

For this developer use case of like really long running sweet work.

If you don't actually have a product that does that and so I think like Google realize that that's why the sort of like when surf.

Deal happened it's why those folks came over and then ultimately built anti gravity and sort of we've been using internally actually in soon our show this at IO.

Just like the graph of growth of token consumption inside of Google.

So you sort of like you need that engine to spin and sort of the meta comment again is like the engine is spinning it takes time in order to like actually make model progress.

But I'm super confident I think the the folks the group of folks who we have working on code is like I describe it as like the Avengers of AI internally.

And so like it really is like the some of the best people inside of Google trying to put to rock up the hill on this stuff and taking it super seriously in trying to push and I think three flash.

You know notwithstanding like some of the conversation about like the price and stuff like that like is sort of a step towards actually starting to bring a lot of these capabilities.

And like the fruits of that labor paying off like it's a flash model that's better than any pro model we've ever released from a coding standpoint and the pro models are really good before so there's another threat of this.

Also which is like everyone forgets that there's like pre training windows and I wonder like somebody should like track this online which would be interesting this meaning like the big run like what clusters have been available.

The big the big runs are like are an interesting threat of this and so like like it might look from an external perspective that like oh you're you're super behind in some way and like actually you miss all the context of like where the big runs are and where the large pre training runs are.

So I think that that also like obviously there's pre training is historically been like a massive strength for a deep mind like we have some of the best people in the world.

And so excited to see sort of the fruits of that labor and and everything else that's happened like 3.5 flash was like all post training gains which is really cool.

So a huge huge testament to the team that the work that that team did to actually like make the level of gains and like surpass the previous pro model.

Literally just with post training which is awesome.

How religious are you all about dog fooding internally?

Like our for example our deep mind folks still allowed to use other models or is it like you guys are using the Gemini hardest now and we have to make this really really good.

Yeah, there's I mean I think people it's so healthy to be using other models just because like it's it's so sometimes hard to like actually grow.

What's happening in ecosystem if you're not so like I use all the models I use all the products I think like you know folks across the rest of deep mind are doing the same thing you definitely have to use the Gemini models though.

It's just like great from from a feedback flywheel perspective and it's part of how they get better is like deep mind has and Google more broadly has like a hundred thousand plus incredible engineers who are using the models and giving feedback and like it should be a competitive advantage for Google.

Because we have that scale of sort of engineering resources and like the depth of the talent and can run you know AB tests and live experiments and all that stuff so.

I think you have to use all the models but I think for for the majority of folks it's like Gemini as the daily driver which is great.

Do you believe in this narrative around like a like a soft take off of like once you have a good enough agency coding model.

Then it accelerates the pace of research progress and like it's a self reinforcing cycle.

It seems obvious that that's true but I don't maybe I'm I'm too I've drank in too much cool laid that that's the best the case.

Are you seeing the signs of it yet?

Yeah, I mean you definitely see some signs of this I think the signs that are like still early is doing this from a model perspective and I think part of the context of that is like the resource allocation.

For some of these like larger training runs is just like significant and so like you definitely still have like a human in the driver seat of making those decisions because like you're not going to.

Accidentally you know take 10,000 TPUs to go kick off some job that like actually doesn't make that much sense.

But from a product perspective you for sure see it like I think we're seeing this on our team like we've built mobile apps using anti gravity and like we'll want to them to the world like faster than I think any team at Google has ever built a mobile app.

Josh's team did this with the Gemini macOS app and sort of like end and delivered an app sort of faster than any team had ever delivered a mac app at Google.

And it's because of it's because of a data coding and so it's great for my product perspective.

I think you said in the past that if you could have a system that could build anything with code humans can't compete on the same level and that's narrow super intelligence.

Do you think we've reached that point?

It is interesting I think this like narrow super intelligence example is interesting to see how it obviously kind of feels that way for coding right now we're like coding is like just so good.

That it does kind of feel like narrow super intelligence I don't know it depends how you actually end up the details of quantifying this but I think the important thing is like to your point earlier it works incredibly well for code.

And so it would be great if it did a bunch of other things but it's actually just like so impactful that it can be great at code.

And so I spent a lot of time just like letting that that fact sort of just like wash over me because I think it's like obviously building a GI super important and very interesting but like building a GI if it sort of like takes away from the story of like the current present capability of the technology I think is actually like kind of a bad bad sort of like trade off and so I'm trying to like always hold these two things in my head equal at the same time which is we need to build general purpose technology but obviously it's so impactful to have this thing.

And it feels like it hasn't taken away sort of it's been one of the best positive outcomes is that.

I feel like it hasn't taken away from like human developers it really does feel like an accelerant of what human developed like I as a human developer feel like I have more agency in the world I feel like I can tackles my personal experience I feel like I can tackle more ambitious problems I feel like.

I used to kick around ideas and they were like slightly out of reach and I would just be like I wouldn't be nice and now I have the opposite problem which is I'm kicking around an idea and I'm like I could probably make this even more ambitious and sort of it does it adds a different layer of sort of.

Responsible or like some different layer of burden actually because I'm like oh I can't just like do the the sort of MVP of this like I actually need to like go 10 steps further because the technology enables me.

And like resetting my my level of ambition I think is something that I've also spent a much of time thinking about but I think that will happen in other these like vertical super intelligence domains.

Which will be interesting and it feels like we're going to get a bunch of those before we've like solved like it's almost like jagged like jagged super intelligence I think is what wound up with.

What vertical do you think will get super intelligence at next.

That's a great question.

I do spend a lot of my time too much time probably thinking about coding these days so I'll think for a second of like the other the other domains.

I think part of this is like things that have like better verifiability obviously are like the ones where you'll you'll see the gains happen more quickly.

So like things with like math and finance actually like science could be really interesting one like it would be fascinating to see like some of these domains where.

There's some level of verifiability like actually like really start to take off.

Which would be cool and I also think like an important thing in this like broader narrative about just like what what impact AI is having on the world like you almost like want that to be the case in the sequencing of like things that work you want a lot of these like really really good impactful positive things for the world to happen.

As early on as humanly possible so that like folks understand what the potential positive impact of the technology is so I think science could be really interesting one.

Yeah, obviously there's all the stuff happening right now with like math proves and stuff like that which I'm not a mathematician so it's it's somewhat over my head but.

I saw a great sweep in a day why did there's have so many problems exactly that's a good one I like that is a good like to share.

So funny okay I but speaking with whether I went through your Twitter before this so I'm going to read back another tweet that you look the good thing on Twitter is there's a public record of all your predictions.

I need to turn on that auto to do anything to be sure whatever.

Last October you tweeted everyone is going to be able to vibe code video games by the end of 2020 25.

Yeah.

To the end up being true.

It feels close and I think there's I mean.

It obviously not AAA games like you're not building you know the next call of duty or GTA yet but I think it's it feels closer than it's ever been.

And I think a lot actually a lot of the interesting bit about video games is you actually need to end up building a lot of this like other stuff like models and we were talking off camera before this like three jasses a great example of this like three jasses makes a lot of things possible that weren't before but they're still all these like.

Rough edges that like just a coding agent doesn't solve and so you need like you know sprite generation and like the models aren't very good at doing that natively and so you need like some orchestration layer and tooling in order to make that happen.

There's a bunch of other things like that that like are core to like the gaming video game experience that need to have a high degree of reliability that I think it feels like it's within reach but actually like requires a lot of like product scaffolding work in order to create experiences that are like.

Reusable and replayable and sort of like how to level of depth and requires a little bit of taste in there.

Do you see people making a lot of video games inside and I studio and the other developer surfaces that you have.

Yeah and so this was actually based on like us looking at the early data and there was something like an as studio at the time it was like 20% of all apps that folks were making were actually games like you were trying to build games a lot of it.

Is that the most popular category it's not the most popular category anymore just because I think like the the ecosystem is shifted and like the user base has shifted but it's a lot of a lot of games.

What's the most popular category I think it's like it's like 20% like finance related stuff 20% people like counting their money that much people like I think it's it's something around crypto actually I think is what people are doing a lot of stuff with with finance.

A lot of like personal productivity things and a lot of gen media stuff actually because obviously the Google suite of gen media stuff is yeah has done a great job.

But I also think gdm has sort of like a obviously dumbest cares a ton about games and sort of like started as career and doing AI stuff because of games.

And so I think we'll have some interesting swings at this and our team actually in in Kaggle which is sort of a bunch of the AI benchmarking stuff we do in gdm sort of works with gdm to build this game arena which is sort of our way of sort of like testing.

Progress towards agi like using games is a proxy which again is like very deeply rooted in and gdm's history so.

How close do you think we are to you know Rando off the street with a good idea can vibe code it really fun playable game.

I want to say this year I actually I think it's I think the model capability makes it possible I think this is where like I've gotten excited on the product side and you know again we are also talking off camera about sort of like the startups in this ecosystem because.

It feels like it's possible it doesn't feel like there's a gap in model quality it feels like there's a gap in like you someone who knows what it takes to build a great game actually like putting the scaffolding together in the right way to make that possible I think there are folks who are doing this right now and so.

Some of it is like a discoverability and awareness thing that like people just don't even know that they can do that.

And some of it is just like maybe certain categories of model capabilities are just like slightly off and we're like you know weeks or months away from like that.

Caeson being crossed and then it just like working for most people.

And so this is a good segue into I want to ask about world models next but do you think vibe quoted video games is more likely.

Going to be you know game engine plus coding agent spaced or do you think is more likely to be world model based.

Yeah, I think the well end up happening is the definition of world models world blur which we should talk about with on me and it will still.

I think the like coding agent will look like some sort of world model type system but you actually do need to make world models useful for like real things you need like scaffolding.

And so I think there's actually a bunch of interesting startups like doing work like figuring out what is the scaffolding for world models so that you can take them from.

These like very open ended inherent design of world models very open ended spaces and like do it in a tangible way so that it's like grounded in a use case that like you could use in a reoccurring way.

That could be somebody maybe will figure out the scaffolding for world models to make games possible like the inherent nature of world models right now I think make it so that it's like actually not well suited for like games in the current form but.

Progress has been crazy so who knows maybe in like two years the versions will be able to but at least in the short term it's like coding agent plus some sort of game engine I think is like where you'll see way more alpha from a games perspective.

That makes sense.

Okay, so you said the definitions of world models are blurry can we unpack that.

I think like on me is a is an example of this you know we launched this at IO you can sort of taking any input create any output and I think demos sort of like framed it to the world.

Rightfully so as a world model because of just like the level of understanding that it has of the world I think that like technically looks different than.

And I'm not an architecture expert on like the way that we've done world models before but it is different from an architectural standpoint than what's happened in the past.

Which I think is positive because it's getting closer to like some of the ways in which it might actually be more scalable.

And historically like it's been like super not scalable it's like very very expensive to run.

Yeah.

Traditional like online world models like genie being like yeah.

So if you think of traditional world models is being like an action conditions video model yes.

Then like right now what we're when we when we say world model what we actually mean is a model that has some understanding of the world as opposed to being strictly technically a action condition video model.

Yeah and and so the interesting thing though is like it has understanding of the world but then it also has that like really great.

And that's where like the line is blurry to me where it's like it can do a lot of those same use case it's not real time right now.

But like it can do a lot of those same use cases that you could describe or like visually could create with that same exact world model which I think is what's most interesting to me so I do feel like this like world model video model thing is going to is going to.

Change and play out in a different way than was obvious before and how does it work on the hood like whatever you're able to share like is it Gem and I plus video models is it.

Something different entirely.

It is it is a single model which I think is the important part like this was actually part of the original desire was like you were training like eight different models to do all of those things historically it's like you have a text model with the baseline gem and I model you have audio you have.

Music models with Lira you have nano banana you have video video models you have a we have a whole suite of audio models and like it would be great for us our customers.

If you just had a single model to do all those things so it is like a new set up that sort of makes that possible it's not like routing to a bunch of different models which like we you could have imagined we could have done something like that actually before and.

I'm like a Gem and I Omni model but this is like a true Omni model and it's starting with like the use case that works the best right now which is the why it's the one that's available.

Is this like video editing capability.

The technically it's like functional with the other things it's just like the quality isn't isn't like perfect and is not say to the art so we we haven't rolled that out yet it's also just like the first.

Crank of the model turn on Omni it's the Omni flash model the first iteration and so we'll have like much much more capable.

Powerful versions which will be which will be exciting to see.

So we could edit this set so it looks like we should yeah yeah yeah I want this we're again we're talking off camera like we should do that for the intro because I think it just like makes all this stuff more capable and I've seen these examples of like.

Such subtle nuance that like make me appreciate that it's like the world understanding playing out I was I was giving a talk.

And was on stage with with my friend Tulsi who leaves the model team but I don't know if you've ever had on before but she's amazing I love Tulsi.

And and I mentioned to someone in the crowd to like edit the video and they literally like took the picture edited it with Omni in real time and this like dog came on the stage.

And like the other in the edited version the other guests sort of like look down and see the dog they like chuckle a little bit this is wall I'm like oh pining about whatever.

Yeah yeah that it was not my jokes they laugh at the dog coming up jumps onto my lap I sort of like acknowledge the dog I keep talking I'm like petting it or whatever and just like there's like so much subtle subtlety and getting that right and the model crushed it and it's just it's very interesting and like still trying to like absorb and digest.

Like what that means for you know the way we make content in all these other things.

That's so interesting.

Yeah.

I'm the biggest bull on generative media and what it means and.

I mean one of the things we've thought about for our podcast is the visuals matter as much as the content.

For sure.

That's how you catch people attention in the first place right and and so okay I'm excited to I'm excited to play with Omni.

I'm excited to and I think the and I think you probably feel this way as somebody who makes content but I've historically like then like.

Very for myself personally like I don't use AI to make any content that I produce like it's all my words it's always my voice it's always my image and picture showing up like I just I feel like there's just like so much alpha and authenticity and so like I would much rather it be me than some AI version of me.

What I like so much about Omni is that it's like not changing me it is like changing a bunch of these other bits which are not me like I didn't choose any of the like set around us or the the coffee table it's like so our words can stay the same and like you can change these bits that are like not personal and do something more interesting with them which I think is really really cool and feels.

It feels like the version of what I want sort of like gen media to be which is like not a bunch of like AI avatars.

No fruit island videos exactly true it like it really is like it's the original content it's the person it's like the person hood is there it's just different and amplified super interesting okay I'm excited to play with it.

Yeah we should we should send some prompts right after this and try.

I don't mind the fruit videos though I'm happy for all of us.

On the coding side you launch the ability in AI studio from people to vibe code android apps yeah yeah.

I'd love to you know hear how that's going so far and and where you're going to take that.

Yeah it's super exciting I think one of the strategic things for AI studio and actually this is based on like a lot of the feedback from the ecosystem and actually from developers or mothers it's like.

There's so many Google products there's so many different like ways in which you like touch Google through all these different journeys of building a startup or bringing idea to your life and so.

We have this like first class principle of like how do we bring things into AI studio that make it so that you are exposed to other parts of the Google ecosystem without having to like go through nine different UIs across Google.

So Android is like a great example not only of that but also of enabling people who wouldn't have otherwise built an Android app and so I literally built my first Android app in AI studio very cool to see it's.

What is it?

Yeah I just did like a plan not a criminal app just a plant one I was planting trees in my back.

I'm like a gardening app.

Yeah and so it was just like playing around with the gardening app as I was kicking the tires.

I haven't had my like breakthrough idea yet of what I want for a mobile app but I'm going to come up with something and see go compete on the app store.

Have you seen anything that I've quoted like really flying in the app store yet?

That's a good quote I actually be interesting to like see some analysis I don't know I'm sure it's like accelerating a lot of things on the app store but I don't know how much like I don't know anybody would like personally who's who's done that.

Yeah.

It is interesting and I was going to make the observation too that I think the last time I checked the numbers we were viewing at this morning it was like.

350,000 Android apps built in a as studios since last week which is crazy.

And like excitingly it's like 350,000 apps that like probably no one was going to build before.

A lot of these are personal too and so this is where I think this like maybe genuine is like farther out there but I think like the idea of you building software.

To solve your personal problem is like very real right now and like people are doing that it's like one of the most common use cases of a lot of these products.

And being able to like unlock a bunch of the native capabilities of the phone I think is also really interesting because you just have so much context that's like in different places.

So I'm getting very excited about sort of that opportunity and Android feels like it's becoming the platform for builders.

Does it matter that something is an app versus just like web is so powerful now?

It's also very interesting to see that play out web is definitely powerful there are certain things that the operating systems have that like you just can't unlock.

Like lots of like native richness that actually like make experiences feel so much richer.

I think about this for like text messaging actually that like the text messaging experience and all of the and all the main operating systems feel way richer to me than like any AI chat app that I've ever used.

Like if I could just talk to AI and whatever texting app I use like I would be way happier than having to go to some other app.

Because I think we're also just like conditioned on like the operating systems so.

Yeah, makes sense.

Okay, I want to ask about the model eats the harness or the model eats the scaffolding.

Yeah, I think it's true and I think part of this is like what we have historically thought of as the model is not the model anymore.

Like when I think like two years ago when LMS were popular it was like the model was like actually just a set of weights.

It was a set of weights and it was like really like how can you like a simple as possible sent tokens in and get tokens out and I think we've just like progressively step by step by step.

We still call it the model we still call it you know Gemini 3.5 so call it GPT whatever and and Claude whatever but like it's actually not just the weights anymore.

It's like an entire expanding sprawling system that's built around the weights.

That sort of like enable a lot of these like next generation experiences from a gentle tool calling to tool you know like all these hosted tools search code execution etc.

You know the models are now being spun up in containers and sort of have an agent harness and all that stuff.

So the scaffolding is like often times a couple of steps ahead of like where the act what is like big directly into the model.

And then what ends up happening is like the model eats that scaffolding and it becomes part of like the native model system and there's still value in having sort of the external scaffolding in certain cases and like search maybe is an example of this like there's lots of folks who use different search providers.

And there's different like use cases that you want and so like sure maybe the model can natively use search but you also want something else code execution and other example of that.

But it does feel like like maybe the agent harness is like the quintessential example of this right now we're like everyone's like we got to go build a harness and like the harness is where the alpha is and like I think that perhaps won't be true at least in the way that we think of the harness today.

In 12 months I think the models will have sort of just like digested a bunch of that it'll be upstream into the model and the alpha will be somewhere else now it won't be in sort of trying to spin your own harness because the model just like does it natively.

But I thought that the part of the reason why people are building their own harnesses is because if you use a harness from any given model provider you're locked in right so all the application companies want flexibility which is why they're building their own harnesses.

And I think that's part of the scaffolding story is like that starts out perhaps true but then as the model capability improves like it becomes less true over time actually I think the model the like you don't have.

A generalized model if it can't use another harness and so it is and is important and I mentioned this in another conversation with someone a few weeks ago but we need something like harness bench which is like actually measuring like.

How good are all these different models at adapting to all the different harnesses I feel like that seems like a reasonable thing we should we should measure as a ecosystem.

And I'd be curious to see like what models are actually best but I think over time you expect they they'd be able to use every harness unless you're like.

Completely out of distribution which in that case like you're still going to be completely out of distribution even if you're using an harness so not sure it matters much.

Very enough.

What about the application layer how do you think about where independent companies can you know have a hope of surviving when the model eats the harness and eats you know the stuff around it.

Yeah it feels like there's yeah it's an interesting story that like both of these things feel true both on one hand I ever where I look I'm like there's never been more opportunity to go and build something at the same time obviously the models are doing more than they've never done before.

I think there's like you know there's that thread of capability overhang which I think there's a huge amount of alpha and there's the thread of the model companies are like going after these like very general problems and there's just like so much value in these like verticalized domains.

You have expertise in that domain and you sort of like know the customers you know the ecosystem like it's just you can really like run laps around even the best model laps because like focus is the like super power of startups like if you can focus you can do anything.

And if you look at all of the companies that are big or doing lots of stuff like there's just not a lot of focus and for some for some reasons like rightfully so because you know maybe I'm overly justified you know Google strategy but like we just have a lot of products we have a lot of users we have a lot of different things going on and so like we actually can't focus in one domain we have an obligation to do a bunch of things as a big company.

I think that's not true for startups and so I think like 24 months ago we were all asking ourselves like oh wow it seems like the opportunity space is shifting and maybe it's it's possible one of the outcomes is there's less opportunity for startups in the future that feels like so far in a way not what has ended up playing out which is really positive.

If anything it feels like there's just even more opportunity than there was like now coding has helped you like close the gap on like larger companies that have like established code bases and all this other stuff because you can just like run way faster and right software quicker.

The agentic like primitive is like a new category that you can sort of build products around that like actually in a lot of cases to the conversation about like the risks involved with building like there's risk involved and so like what's your like the risk appetite of different companies is different.

And so if you're willing to take more risk in some domains like you can win a user cohort who's like interested in also taking risk there's so much opportunity.

Awesome.

I'd love to talk about Google DeepMines culture and I'm curious what does it feel like to be inside.

GDM right now, you know what we had demissed that AI sent you so inspiring I've heard Sarah guys back I've guys have known Shazir back like.

Walk me through what it's like to be a GDM right now it's incredible I do try to take it all in because it is like it's like a moment I try to reflect as much as possible and in the chaos of all the things that are happening just because there's like so much cool stuff going on.

GDM's culture is interesting and like maybe three observations one back to this thread of like focus we're doing a lot of things and so I think you see sort of I think about this a lot like from a portfolio perspective I think we have like one of the strongest portfolios which is really exciting.

But you do see these moments were like another lab or another company whatever it is will like pull ahead in a certain area where like we under invested just like hadn't been focused enough in that domain.

And it's cool to see like the the way we go about trying to like close that gap I very much I very much appreciate it I think I've watched the demiss thinking game documentary a few times and like you see sort of like a lot of like details of that like.

original culture and just like the way that strikes work and all this stuff which is actually really similar today is like just get a bunch of smart people together and like go solve the problem and I love that and it's like very cool.

To to be a part of another one is this I think you see the culture permeate from like who the leaders are and as I maybe.

This isn't like a perfect characterization of the ecosystem but like Demis is a Nobel Prize scientist and like the sort of OG of a lot of this stuff and sort of you feel that in the deep mine culture I think like Sam is like the you know maybe one of the world's best business man ever and like you sort of see that in the open AI culture and the way that they go about the world I don't have a strong sense of who Dario is.

But like I think anthropic is a very interesting place and you sort of I least as an external observer like they're he seems like an interesting guy and so.

Someone as a terror and so that seems like they're sort of like that in the DNA and the culture of the company you know all the other labs are interesting.

But I'd like this like very scientific approach to the world and the way that like Demis looks at like.

The reason he's doing this and the reason they started this mission was like literally like solve disease and all these things and it's like so easy to get.

And again, I'm always trying to pull myself out of the moment, but like it's so easy to get lost in this like competitive race of who's.

pushing a number higher on sweet adventure whatever it is it's very easy to lose sight lose sight of like the reason we're doing that is so that like we're.

Consult problems that humans actually have and there's a.

My favorite quote from all Silicon Valley is something like you know we can't let other people make the world the better place more than we can which is like what this moment feels like.

The Gavin Nelson quote and I think about that all the time and it's like we're all fighting over who can make the world better more than the other person.

Which just like when you frame it like that seems really goofy to me and so it's very much not zero some and I think that's like a way of looking out the world.

I think the last thing about deep minds culture is like we're very sort of the engine room of Google which I think is like literally the Twitter bio now of the the deep mine Twitter account, which I love.

You man the deep mindset I don't I don't want any responsibility manning other people's accounts online too much too much responsibility to do that but it does feel like that too so it's like on one hand you have sort of like the deep rooted lab culture.

The other hand you have sort of like all of these partners across the Google ecosystem that we're collaborating with everybody from Android that we talked about earlier to Google Cloud to you know Gmail to workspace et cetera et cetera and so it's an interesting blend of like.

I think there's lots of research work happening but like there's tons of applied work that's happening to like actually like work with some of the like the forefront customers like deploying Gemini to Billy and user products is a problem that like only two companies in the world have and we have 13 of those products and like we the you know Google goes through this all the time now and it's such an interesting.

It plays to like see that happen and see the innovation that takes place in order to make that actually possible and I feel like it's you can only do that inside of inside of Google which is really cool.

Be the place then did they did it give them a lot of heartburn when you joined and we're tweeting a lot.

That's a good question.

To get sign off from.

I'm very one of the silver linings to my Google experience has been just like how great.

That group of like folks across marketing comms are to to work with and I think like a you know their job is.

Protect Google make sure we tell the right story and make sure a bunch of bad things don't happen and so I have a ton of appreciation and partnership with them but.

It's been an incredible experience to like be able to go try to tell the story that resonates with developers in a way that feels authentic and not have a huge amount of you know.

I don't have to get my tweets approved all the time and all this stuff like is very very positive culture and I think hopefully I'm always trying to walk the line of not not burning.

The trust and goodwill that that I've accumulated with those folks but it's been super positive because ultimately I think it's like it's really hard for Google to tell.

This like authentic stories so there's like it's a big company there's a lot of people there's a lot of opinions and so you take the like magic of Google and you water it down through like a lot of people and a lot of process and you actually.

You you miss the beautiful story which is like Google's doing the most interesting technology in the world and like helping our users with some of the hardest problems in the world and.

It feels it's a privilege to like get to help tell that story so it's it's a lot of fun I enjoy it.

I love what you're doing I love what Josh is doing you guys have put a really kind of sincere human touch on.

As you put it the most important problem or time so thank you.

Well wonderful Logan thank you so much for joining me today to say very far ranging conversation everything from agents and coding to world models and.

Hardnesses and GDM culture and lots lots of nuggets here thank you for for joining me today.

This is what I'm a fun thank you for having me and I'm excited to see what the folks cook up where we've been sitting this whole time maybe in front of us.

I don't even know me a dog.

A dog.

I love it awesome thanks Logan course.

Thank you.