Training Data · 2026-04-30

Andrej Karpathy on Agentic Engineering and the Future of AI Programming

Hosts: Unknown

Guests: Andrej Karpathy

agentic engineeringsoftware 3.0verifiabilityAI programmingLLMsjagged intelligenceAI automationfuture of computing

Why it matters

December 2022 marked a turning point where AI coding tools became reliable enough to trust without frequent corrections.

Key claims

December 2022 marked a turning point where AI coding tools became reliable enough to trust without frequent corrections.
Software 3.0 is a new computing paradigm where programming is done via prompts and context given to LLMs acting as interpreters.
Verifiability is key to AI automation progress; tasks with clear verification criteria (e.g., coding, math) see rapid AI capability gains.
AI models exhibit 'jagged intelligence'—highly capable in some areas but inconsistent or flawed in others due to training data and reward focus.

Briefing memo

Summary

Andrej Karpathy discusses the transformative shift in AI programming paradigms, highlighting the transition from traditional coding to what he terms 'software 3.0,' where prompting large language models (LLMs) acts as programming. He reflects on his personal experience of feeling behind as a programmer due to rapid advances in agentic AI tools that can autonomously generate and debug code. Karpathy emphasizes the importance of verifiability in AI automation, noting that domains where outputs can be verified—such as coding and math—are advancing fastest. He introduces the concept of 'agentic engineering,' which focuses on coordinating fallible AI agents to maintain software quality while accelerating development.

December 2022 marked a turning point where AI coding tools became reliable enough to trust without frequent corrections.
Software 3.0 is a new computing paradigm where programming is done via prompts and context given to LLMs acting as interpreters.
Verifiability is key to AI automation progress; tasks with clear verification criteria (e.g., coding, math) see rapid AI capability gains.
AI models exhibit 'jagged intelligence'—highly capable in some areas but inconsistent or flawed in others due to training data and reward focus.
Agentic engineering involves managing multiple AI agents to accelerate software development without sacrificing quality or security.
Future AI systems may shift from classical computing to neural network-centric architectures, with CPUs as co-processors.
Human roles will evolve to focus on oversight, design, aesthetics, and judgment, while AI handles detailed implementation and recall.
Karpathy envisions a future where AI agents autonomously deploy and manage applications, interacting on behalf of users and organizations.

Source material

Transcript

We're so excited for our very first special guest.

He has helped build modern AI, then explain modern AI, and then occasionally rename modern AI.

He actually helped co-found open AI right inside of the office, was the one who actually got autopilot working at Tesla back in the day, and he has a rare gift of making the most complex technical shifts, feel both accessible and inevitable.

You all know him for having coined the term vibe coding last year, but just in the last few months he said something even more startling that he's never felt more behind as a programmer.

That's where we're starting today.

Thank you Andre for joining us.

Yeah, hello.

Excited to be hearing.

To kick us off.

Okay, so just a couple months ago, you said that you've never felt more behind as a programmer.

It's startling to hear from you of all people.

Can you help us unpack that?

Was that feeling exhilarating or unsettling?

Yeah, I'm mixture of both for sure.

Well, first of all, I guess like as many of you, I've been using agentic tools like cloth code, adjacent things for a while, maybe over the last year as it came out, and it was very good at chunks of code, and sometimes it would mess up, and you have to edit them, and it was kind of helpful.

Then I would say December was this clear point where for me, I was on a break, so I had a bit more time.

I think many other people were similar, and I just started to notice that with the latest models, the chunks just came out fine, and then I kept asking for more, and just came out fine.

Then I can't remember the last time I corrected it, and then I was just trusted this system more and more, and then I was about coding.

It was kind of, I do think that it was a very stark transition.

I think that a lot of people actually, I tried to stress this on Twitter and or X, because I think a lot of people experienced AI last year as ChachiPTU Jason thing, but you really had to look again, and you had to look as of December, because things had changed fundamentally, and especially on this like, agentic coherent workflow that really started to actually work.

And so I would say that, yeah, it was just that realization that really had me go down their whole rabbit hole, just, you know, infinity-side project.

My side project's folder is extremely full with lots of random things, and just by coding all the time.

So, yeah, that kind of happened in December, I would say, and I was looking at the repercussions of that since.

You've talked a lot about this idea of LLM's as a new computer, that it isn't just better software, it's a whole new computing paradigm, and software 1.0 was explicit rules, software 2.0 was learned weights, software 3.0 is this.

If that's actually true, what does a team build differently the day they actually believe this?

Right, so yeah, exactly.

So, software 1.0, I'm writing code, software 2.0, actually programming by creating data sets and training neural networks, so the programming is kind of like arranging data sets and maybe some objectives and neural network architectures.

And then what happened is that, basically, if you train one of these GPT models or LLMs on a sufficiently large set of tasks implicitly, implicitly, because by training on the internet, you have to multitask all the things that are in the data set.

These actually become kind of like a programmable computer in a certain sense.

So, software 3.0 is kind of about, you know, your programming now turns to prompting, and what's in the context window is your lever over the interpreter that is the LLM, that is kind of like interpreting your context and performing computation in the digital information space.

So, I guess, yeah, that's kind of the transition.

And I think there's a few examples of that really drove it home for me, maybe that might be instructive.

Also, for example, when you, when open claw came out, when you want to install open claw, you would expect that normally this is a batch script, like a shell script, so around the shell script to run to install open claw.

But the thing is that in order to target lots of different platforms and lots of different types of computers you might run an open claw, these shell scripts usually balloon up and become extremely complex.

But the thing is you're still stuck in a software 1.0 universe of wanting to write the code.

And actually the open claw installation is a copy paste of a bunch of text that you're supposed to give to your agent.

So, basically, it's a little skill of, you know, copy paste this and give it to your agent, and it will install open claw.

And the reason this is a lot more powerful is you're working now in the software 3.0 paradigm where you don't have to precisely spell out all the individual details of that set up.

The agent has its own intelligence that it packages up, and then it kind of follows the instructions.

And it looks at your environment, your computer, and it kind of performs intelligent actions to make things work and debug things in the loop, and it's just like so much more powerful, right?

So, I think that's a very different kind of way of thinking about it.

It's just like, what is the piece of text to copy paste to your agent?

That's the programming paradigm now.

I think one more, maybe example that comes to mind that is even more extreme than that, is when I was building menu gen.

So, menu gen is this idea where you come to a restaurant, they give you a menu, there's no pictures usually.

So, I don't know what any of these things are, usually, like 30% of the things I'd have no idea what they are, 50%.

So, I wanted to take a photo of the restaurant menu, and to get pictures of what those things might look like in a generic sense.

And so, I built, I've put it this app that basically lets you upload a photo, and it does all the stuff, and it runs on Versel, and it basically re-renders the menu, and it gives you like all the items, and it gives you a picture that it uses an image, you know, generator 4, so basically OCR all the different titles, use the image generated to get pictures of them, and then shows it to you.

And then I saw the software 3.0 version of this, which is, which blew my mind, which is literally just take your photo, give it to Gemini, and say, use Nanobanena to overlay the things onto the menu, and Nanobanena basically returned an image that is exactly the picture of the menu that I took, but it actually put into the pixels, it rendered, the different things in the menu, and this blew my mind, because actually all of my menu gen is perious.

It's working in the old paradigm that apps shouldn't exist, and yeah, the software 3.0 paradigm is a lot more kind of raw.

It just, your neural network is doing more and more of the work, and your prompt or context is just the image, and the output is an image, and there's no need to have any of the app in between.

So I think that people have to kind of like reframe, not to work in the existing paradigm of what things existed, and just think about it as a speed up of what exists.

It's actually like new things are available now, and going back to your programming question, it's not even, I think that's also an example of working in the old mindset, because it's not just about programming and programming becoming faster.

This is more general information processing that is automated now, so it's not just even about code.

Previous code worked over kind of like structured data, right?

And you write code over structured data, but like for example, with my LLan knowledge basis project, basically you get LLans to create wikis for your organization, for you in person, et cetera.

There's not even a program.

This is not something that could exist before, because there was no code that would create knowledge base based on a bunch of facts, but now you can just take these documents, and basically recompile them in a different way, and reorder them and create something that is new and interesting as a reframing of the data.

And so these are new things that weren't possible.

And so I think this is something that I keep trying to get back to as to not only what can we do that existed that is faster now, but I think there's new opportunities of just things that couldn't be possible before, and I almost think that that's more exciting.

I love the menu gen progression and dichotomy that you laid out, and I think even I'm sure many folks here followed your own progression of programming from last October to early January, February this year.

If you extrapolate that further, what is the 26 equivalent for building websites in the 90s, building mobile apps, in the 2010s, building SaaS, in the last cloud era, what will look completely obvious in hindsight that is still mostly unbuilt today?

Well, going with the example of menu gen, I guess, so a lot of this code shouldn't exist, and it's just neural network doing most of the work.

I do think that the extrapolation looks very weird because you could basically imagine, I don't think, yeah, so you could imagine completely neural computers in a circumstance.

You feed a raw videos, like imagine a device, you take raw videos or audio into basically what's a neural net, and uses diffusion to render a UI that is kind of unique for that moment in a circumstance.

I kind of feel like in the early days of computing, actually, people are a little bit confused as to whether computers would look like calculators or computers would look like neural nets.

And in 50s and 60s, it was not really obvious which way would go.

And of course, we went down the calculator path and ended up building classical computing.

And then neural nets are currently running virtualized on existing computers.

But you could imagine, I think, that a lot of this will flip, and that the neural net becomes kind of like the host process, and the CPUs become kind of like the co-processor.

So we saw the diagram of, you know, intelligence compute is going to, of neural networks is going to take over and become the dominant spend of flops.

So you could imagine something really weird and foreign when where neural nets are doing most of the heavily thin, they're using tool use as just like, you know, historical appendage for some kinds of deterministic tasks.

But what's really running the show is these neural nets that are networked in a certain way.

So you could imagine something extremely foreign as the extrapolation.

But I think we're going to probably get there sort of piece by piece.

And I don't, yeah, that that progression is TBD, I would say.

I'd let's talk a little bit about this concept of verifiability.

The fact that AI will automate faster and more easily domains where the output can be verified.

If that framework is right, what work is about to move much faster than people realize, and what professions do we have that people actually think are safe, but they're actually highly verifiable.

Yes, so I spent some time writing about verifiability and basically like traditional computers can easily automate what you can specify in code.

And kind of this latest round of LLMs can easily automate what you can verify in a certain sense.

Because the way this works is that when frontier labs are training these LLMs, these are giant reinforcement learning environments.

So they are given a verification rewards.

And then because of the way that these models are trained, they end up basically progressing and creating these like jagged entities that really peak in capability in kind of like very fabled domains like math and code and adjacent.

And kind of like stagnate and our little bit in a rougher on the edges when things are not kind of like in that space.

So I think the reason I wrote about verifiability, I'm trying to understand why these things are so jagged.

And some of it has to do with how the labs train the models, but I think some of it also has to do with the focus of the labs.

And what they happen to put into the data distribution.

Because some things basically are significantly more valuable in economy and end up creating more environments because the labs wanted to work in those settings.

So I think code is a good example of that.

There's probably lots of verifiable environments that could think about that happen not to make it into the mix because they're just not that useful to have the capability around.

But I think to me the big, I guess like the big mystery is the favorite example for a while was that how many layers are in the strawberry and the models with famously get this wrong.

And it's an example of jaggedness, the models now patch this I think.

But the new one is I want to go to a car wash to wash my car and it's 50 meters away.

Should I drive or should I walk?

And state of the art models today will tell you to walk because it's so close.

How is it possible that state of the art opus 4.7 will simultaneously refactor 100,000 like codebase, a line codebase, or find zero-day vulnerabilities.

And yet tells me to walk to this car wash.

This is insane.

And to whatever extent they these models are remained jagged, it's an indication that number one maybe something slightly off.

Or number two you need to actually be in the loop a little bit and you need to treat them as tools.

And you do have to kind of stay in touch with what they're doing.

And so I think all of my writing wants to worry short about verifiability just trying to understand why these things are jagged, is there any pattern to it?

And I think it's some kind of a combination of verifiable plus labs care.

Maybe one more anecdote that is instructive is from GPT 3.5 to GPT 4 people notice that chests improved a lot.

And I think a lot of people thought oh well it's just a progression of the capabilities.

But actually it's more that I think this is public information I think I said on the internet.

A huge amount of like data of chests made it into the pre-chaining set.

And just because in a data distribution basically the model improved a lot more than it would just by default.

So someone at OpenAI decided to add this data and now you have a capability that just peeked a lot more.

And so that's why I think I'm stressing this dimension of it.

As we are slightly at the mercy of whatever the labs are doing, whatever they happen to put into the mix.

And you have to actually explore this thing that they give you that has no manual.

And it works in certain settings but maybe not in some settings.

And you have to kind of explore it a little bit.

And if you're in the circuits that were part of the RL, you fly.

And if you're in the circuits that are out of the data distribution, you're going to struggle.

And you have to kind of figure out which circuits you're in in your application.

And if you're not in the circuits then you have to really look at fine tuning and doing some of your own work because it's not going to necessarily come out of the LM out of the box.

I'd love to come back to the concept of jagged intelligence in a little bit.

If you were a founder today and thinking about building a company, you are trying to solve a problem that you think is tractable.

Something that is a domain that is verifiable.

But you look around and you think, oh my gosh, well the labs have really, really started getting to escape velocity in the ones that seem most obvious.

Math, coding, and others.

What would your advice be to the founders in the audience?

So I think maybe that comes to the previous question of, I do think that verifiability because it let me think.

So verifiability makes something tractable in a current paradigm because you can throw a huge amount of RL at it.

So maybe one way to see it is that that remains true even if the labs are not focusing on it directly.

So if you are in a verifiable setting where you could create these RL environments or examples, then that actually sets you up to potentially do your own fine tuning and you might benefit from that.

But that is fundamentally technology that just works.

You can pull over.

If you have huge amount of diverse data sets of RL environments etc, you can use your favorite fine tuning framework and pull the lever and get something that actually works pretty well.

So I didn't know what the examples of this might be.

But I do think there are some very valuable reinforcement learning environments that we will think of that.

I think are not part of the, yeah, I don't want to give away the answer, but there is one domain that I think is very, oh, okay, sorry.

I don't mean to vague post on the stage, but there are some examples of this.

On the flip side, what do you think still feels automatable only from a distance?

I do think that ultimately almost everything can be made verifiable to some extent.

Some things easier than others because even for like things like writing or so on, you can imagine having a console of LM judges and probably get to some get something reasonable out of the, from this kind of an approach.

So it's more about what's easy or hard.

So I do think that ultimately, yeah, I think, everything.

Everything is automatable.

Amazing.

Okay.

So last year you coined the term vibe coding and today we're in a world that feels a little bit more serious, more regentic engineering.

What do you think is the difference between the two and what would you actually call what we're in today?

Yeah.

So I would say vibe coding is about raising the floor for everyone in terms of what they can do in software.

So the floor arises, everyone can vibe code anything and that's amazing, incredible.

But then I would say a gigentic engineering is about preserving the quality bar of what existed before in professional software.

So you're not allowed to introduce vulnerabilities due to vibe coding.

You are, you're still responsible for your software just as before.

But can you go faster?

And spoiler is you can.

But how do you, how do you do that properly?

And so to me, a gigentic engineering, I call it that because I do think it's kind of like an engineering discipline.

You have these agents, which are these like spiky entities, they're a bit fallible, a bit stochastic.

But the art extremely powerful is how do you how do you coordinate them to go faster without sacrificing your quality bar?

And doing that well and correctly is the realm of a gentic engineering.

So I kind of see them as different.

Like one is about maybe raising the floor and the other is about extrapolating.

And what I'm seeing, I think is there is a very high ceiling on a gentic engineer capability.

And people used to talk about the 10x engineer previously.

I think that this is magnified a lot more.

10x is not the speed up you gain.

And I think it does seem to me like people who are very good at this.

Beacle a lot more than 10x from my perspective right now.

I really like that framing.

One thing that when Sam Almond came to Ayeson last year, one memorable thing he said was that people have different generations used chatchipiti differently.

So if you're in your 30s use it as a Google search replacement.

But if you're in your teens, chatchipiti is your gateway to the internet.

What is the parallel here in coding today?

If we were to watch two people code using open claw, clawed code code acts.

One, you'd consider mediocre at it.

And one, you would consider fully AI native.

How would you describe the difference?

I mean, I think it's just trying to get the most out of the tools that are available, utilizing all of their features, investing into your own kind of setup.

So just like previously, all the engineers are used to basically getting the most out of the tools you use.

Either it's very more VS code or now it's a plot code or code X sort of so on.

So just investing into your setup and utilizing a lot of the tools that are available to you.

I think it just kind of looks like that.

I do think that maybe related thought is a lot of people are maybe hiring for this because they want to hire a strong, agentic engineers.

I do think that what I'm seeing is that the most people are still not refactored.

They're hiring process for agentic engineer capability, right?

Like if you're giving out puzzles to solve, then this is still the old paradigm.

I would say that hiring have to have to look like, give me a really big project and see someone implement that big project.

Like let's write, say a Twitter clone for agents and then make it really good, make it really secure and then have some agents simulate some activity on this Twitter.

And then I'm going to use 10 code X, 5.4 X high to try to break your break your website that you deployed.

And they're going to try to basically break it and they should not be able to break it.

And so maybe it looks like that, right?

And so yeah, watching people and that's that setting and building bigger projects and utilizing the tooling is maybe what I would look at if it was part.

And as agents do more, what humans skill do you think becomes more valuable, not less?

So yeah, it's a good question.

I think, um, well right now the answer is that the agents are kind of like these intern entities, right?

So it's remarkable, um, you basically still have to be in charge of the aesthetics, the judgment, the taste, and a little bit of oversight.

Maybe one, one of my favorite examples of like the the weirdness of agents is, um, for menu gen, uh, you sign up with a Google Google account, but you, um, purchase credits using a Stripe account, and both of them have email addresses.

And my agent actually tries to basically, um, like when you purchase credits, it is signed it using the email address from Stripe to the Google email address.

Like there wasn't a persistent user ID that that, uh, for people, it was trying to match up the email addresses, but you could use different email address for your Stripe in your Google, and basically would not associate the funds.

And so this is the kind of thing that these agents still will make mistakes about.

It's like, why would you use email addresses to try to cross-correlated the funds?

They can be arbitrary.

You can use different emails, et cetera.

Like this is such a weird thing to do.

So I think people have to be in charge of this spec, this plan, and um, actually don't even like the plan mode.

I, I would, I mean, obviously it's very useful, but I think there's something more general in here where you have to work with your agent to design a spec that is very detailed, and maybe it's, uh, maybe basically the docs, and then get the agents to write them.

And you're in charge of the oversight and the top level categories about the agents are doing a lot of the under the hood.

And, um, so I think you're not caring about some of the details.

So as an example, also with, um, arrays or tensors in neural networks.

Um, there's a ton of details between PyTorch and NumPy and all the different like pandas and so on.

For all the different little API details, and I'll, I already forgot about the keep dims versus keep them or whether it's dim or access or reshape or commute or transpose.

I don't remember this stuff anymore, right?

Because you don't have to.

This is the kind of details that are handled by the intern because they have very good recall.

And, but you still have to know, for example, that, um, you know, there's underlying tensor, there's an underlying view, and then you can manipulate view of the same storage, or you can have different storage, which will be less efficient.

And so you still have to have an understanding of what this stuff is doing, and some of the fundamentals, um, so that you're not copying memory around unnecessarily and so on.

But, uh, the details of the APIs are not handed off.

So it, um, you're in charge of the taste, the engineering, the design, uh, and that it makes sense, and that you're asking for the right things, and that you're saying that, okay, these have to be unique user IDs that we're going to tie everything to.

Um, and so you're doing some of the design and development and the engineers are doing the full end of blanks.

And that's currently kind of like where we are, and I think that's what everyone, of course, is seeing, I think, right now.

Do you think there's chance that this, um, taste and judgment matters less over time, or will the ceiling just keep rising?

Um, yeah, it's a good question.

I would say, um, I mean, I'm hoping that the, that it improves.

I think probably the reason it doesn't improve right now is again, it's not part of the RL.

There's probably no aesthetics, you know, cost or reward, or it's not good enough, or something like that.

Um, I do think that when you actually look at the code, sometimes I get a little bit of a heart attack because it's not like super amazing code, necessarily all the time, it's very bloaty, and there's a lot of copypaste, and there's awkward abstractions that are brittle, and like it works, but it's just really gross.

Um, and I do, I do hope that this can improve in future models.

Um, a good example also is this, uh, you know, a micro GPT project, which where I was trying to simplify, uh, LLM training to be as simple as the models hate this.

They can't do it.

I try to, I keep, I keep trying to prompt an LLM to simplify more, simplify more, and it just can't, you feel like you're outside of the RL circuits.

It feels like you're obviously, you know, you're pulling teeth.

It's not like light speed.

So I think, um, I do think that people are still remain in charge of this, but I do think that there's nothing fundamental again that's preventing it.

It's just the laps haven't done it yet, almost.

Yeah.

So I'd love to come back to this idea of, uh, jagged forms of intelligence.

You wrote a little bit about this with a very thought provoking piece around animals versus ghosts.

Um, and the idea is that we're not building animals.

We are summoning ghosts.

Um, and these are a jagged forms of intelligence that are shaped by data and reward functions, but not by intrinsic motivation or fun or curiosity or empowerment, uh, things that kind of came about via evolution.

Um, why does that framing matter?

And what does it actually change about how you build and deploy an evaluate or even trust them?

Uh, yes.

So yeah, I think the reason I wrote about this is because I'm trying to wrap my head around what these things are, right?

Because if you have a good model of what they are or are not, then you're going to be more competent at, uh, using them.

Um, and I do think that, um, I don't know if it has, I'm not sure if it actually has like real power.

I think it's a little bit of philosophizing, um, but I do think that, um, I think it's just, um, coming to terms with the fact that these things are not, you know, animal intelligence is like if you'll yell at them, they're not going to work better or worse or does that have any impact.

Um, and, uh, it's all just kind of like these, a statistical simulation circuits where the, the substrate is pre-training, so like statistics.

And then, but then there's RL bolting on top.

So it kind of like increases this appendages.

And, um, maybe it's just kind of like a mindset of what I'm coming into or what's likely to work or not likely to work or how to modify it.

But I don't actually, I don't know that I have like, here are the five obvious outcomes of how to make your system better.

It's more just being suspicious of it.

And, um, figuring out over time.

That's where it starts.

Um, okay.

So you are so deep in working with agents that don't just chat.

They have, um, real permissions.

They have local context.

They actually take action on your, your behalf.

What does a world look like when we all start to live in that world?

Uh, yeah, I think, I think a lot of people probably here are excited about what this agent, you know, native, agentic environment looks like.

And everything has to be rewritten.

Everything is still fundamentally written for humans and has to be moved around.

I still use most of the time when I use different frameworks or libraries or things like that.

They still have docs.

There are fundamentally written for humans.

This is my favorite pat peeve.

Like, I don't, uh, why are people still telling me what to do?

Like, I don't want to do anything.

What is the thing I should copy based on my agent?

Like, uh, so it's just, uh, every time I'm told, you know, go to this URL or something like that.

It's just like, oh, you know, so, um, everyone is, I think excited about how do we decompose the workloads that need to happen into fundamentally sensors over the world, actuators over the world?

How do we make it agent native?

Uh, basically, describe it to agents first.

Um, and then I have a lot of automation around, um, you know, the, um, yeah, around data structures that are very legible to the LLMs.

Uh, so I think, um, yeah, I'm hoping that there's a lot of agent first, um, infrastructure out there and that, you know, for menugen, famously, when I wrote the, not, I'm not sure how famously, but when I wrote the blog post about menugen, um, a lot of the work, a lot of the trouble was not even writing the code from menugen.

It was deploying it to personnel because I had to work with all these different services and I just dreamed them up and I had to go to their settings and the menus and, you know, configure my DNS and it was just so annoying.

And so, um, that's a good example of, I would hope that menugen that I could give a prompt to an LLM build menugen and that I didn't have to touch anything and it's deployed in that same way on the internet.

Uh, I think that would be a good kind of, a test for whether or not, uh, a lot of our infrastructure is becoming more and more agent native.

And then ultimately, I would say, yeah, I do think we're going towards a world where, um, there's agent representation for people and for organizations and, um, you know, I'll have my agent talk to your agent to figure out some of the details of our meetings or, or things like that.

So, um, I do think that that roughly where things are going, but, um, yeah, I think everyone here is excited about that.

I really like the visual analogy of sensors and actuators.

I actually had not thought about that.

That's super interesting.

Right.

Um, okay.

I think we have to end on a question about education, um, because you are probably one of the very best in the world at making complex technical concepts simple and deeply thoughtful about how we design education around it.

What still remains worth learning deeply when intelligence gets cheap as we move into the next era of AI.

Yeah.

Uh, there was a tweet that blew my mind recently and I keep thinking about it like every other day.

It was something along the lines of, um, you can outsource your thinking, but you can't outsource your understanding.

And, um, I think that's really nice.

So, yeah, because I still, I'm still part of the system and I still, I still have to somehow information still has to make it into my brain.

And I feel like I'm becoming a bottleneck of just even knowing, what are we trying to build?

Why is it worth doing?

Uh, how do I direct, you know, how do I direct my agents and so on?

So, I do still think that ultimately something has to direct the thinking and the processing and so on.

And, um, that's still kind of fundamentally constrained somehow by understanding.

And this is one reason I also was very excited about all the Ellen knowledge basis, because I feel like that's, that's a way for me to process information.

And anytime I see a different projection onto information, I always, like feel like I gain insight.

So, it's really just, uh, a lot of prompts for me to disintegrate a generation kind of over, over some fixed data.

Uh, so I really enjoy, uh, whenever I read an article, I have my, uh, you know, my wiki that's being built up from these articles and I love asking questions about things or, um, and I, I think that ultimately these are tools to enhance understanding in a certain way.

And this is still kind of like a bit of a bottleneck because then you can't direct the, uh, you can't be a good director if you still, uh, because the Ellen certainly don't excel at understanding.

You still are uniquely in charge of that.

So, uh, yeah, I think, uh, tools to that effect.

I think are incredibly interesting and exciting.

I'm excited to be back here in a couple of years and to see if we've been fully automated out of the loop and they actually take care of understanding as well.

Uh, thank you so much for joining us.

And I'm trying to really appreciate it.