Latest verified signal
Building Self-Accelerating AI to Accelerate Science with Mirendil
Mirendil focuses on building self-accelerating AI systems that can autonomously conduct AI research and engineering to accelerate science.
AI lab podcast intelligence
Podcast episodes featuring technical leaders and executives from major AI labs.
Latest verified signal
Mirendil focuses on building self-accelerating AI systems that can autonomously conduct AI research and engineering to accelerate science.
Behnam Neyshabur, Harsh Mehta Anthropic, Google DeepMindHosted by Math Bornstein
Mirendil focuses on building self-accelerating AI systems that can autonomously conduct AI research and engineering to accelerate science.
In this episode of AI + a16z, Mirendil co-founders Behnam Neyshabur and Harsh Mehta discuss their vision for self-accelerating AI systems that can conduct AI research and engineering autonomously to dramatically speed up scientific discovery. They emphasize the disruptive nature of this technology and the need to rethink company structures and incentives to enable broad access and collaboration. Unlike traditional AI models focused on general capabilities, Mirendil aims to build specialized AI systems that improve themselves iteratively in targeted scientific domains, reducing the need for large teams and resources.
Fiona Fung AnthropicHosted by Lenny
Anthropic engineers now produce 8x more code per quarter than in 2025, shifting bottlenecks from coding to verification and impact.
Fiona Fung, Manager of the Claude Code and Cowork Teams at Anthropic, shares deep insights into how AI is revolutionizing software engineering. Coding is no longer the bottleneck, with Anthropic engineers producing eight times more code per quarter compared to 2025. The focus has shifted to ambitious product building, verification, and quality assurance, leveraging AI tools like Claude Code and Cowork to automate routine tasks and enhance productivity. Fung emphasizes the importance of high agency paired with accountability within teams, encouraging proactive initiative and ownership.
Dean Ball OpenAIHosted by Nathan
Dean Ball is joining OpenAI to lead a Strategic Futures team focused on frontier AI policy and governance.
Dean Ball discusses his decision to join OpenAI to lead a new Strategic Futures team focused on shaping frontier AI policy. He reflects on the current state of U.S. AI policy, including critiques of America's AI Action Plan and the ongoing challenges with government coordination and transparency. Dean emphasizes the importance of being inside a frontier AI lab to access detailed technical insights necessary for effective policy development, especially around recursive self-improvement (RSI) and internal model deployments. He also shares his views on the evolving power dynamics between AI labs and the government, the role of states in AI regulation, and the risks of government monopolization of frontier AI capabilities.
Anjney Midha Anthropic, Google DeepMindHosted by FungeMita
AMP aims to create a pooled, multi-cloud compute grid analogous to the electric grid to maximize utilization and reduce waste in AI infrastructure.
Anjney Midha, founder of AMP and former Google engineer, discusses the critical importance of maximizing output and efficiency in AI infrastructure. He emphasizes the need for iterative, responsible scaling of compute resources, drawing parallels to the electric grid and advocating for a pooled, multi-cloud compute grid to optimize utilization and reduce waste. Midha highlights the misalignment of incentives in AI infrastructure and the challenges of scaling compute without losing alignment across stakeholders.
Tejal Patwardhan OpenAIHosted by Andrew Maine
Traditional AI benchmarks saturate quickly and fail to distinguish advanced model capabilities, prompting the need for more realistic, complex, and long-horizon evaluations.
In this episode of the OpenAI Podcast, research lead Tejal Patwardhan discusses the challenges and evolution of AI benchmarking as models rapidly improve. She emphasizes the limitations of traditional benchmarks, which often become saturated and fail to capture real-world usefulness. Patwardhan highlights OpenAI's shift towards more realistic, long-horizon evaluations that measure models' ability to perform complex tasks across domains such as coding, science, and professional work. The conversation also covers the importance of measuring models' real-world impact, including scientific research and wet lab experiments, and the increasing complexity of evaluations as models interact with physical and digital environments over extended periods.
Jeffrey Irving, Daniel Murphy, Rahul Sunwalkar, Shlok Khemani, Tom Agrath, Andrew Moore, prinz Anthropic, Google DeepMindHosted by Nathan
Anthropic's Fable model launch shows strong coding and reasoning capabilities but is heavily gated on production and sensitive tasks, often falling back to older models when restricted.
The episode covers the launch and early user experiences of Anthropic's Fable model, highlighting its cautious gating on sensitive tasks and impressive autonomous decision-making in complex workflows. Discussions include the challenges of AI alignment, with Jeffrey Irving and Daniel Murphy announcing Sequent, a new organization focused on theoretical guarantees for AI safety amid accelerating capabilities. The episode also explores hybrid authorship with AI, economic incentives in AI usage, and the concentration of power in frontier AI labs. Key policy and interpretability issues are raised, including Anthropic's response to silent refusals and the need for better oversight of internal model deployments.
Logan Kilpatrick Google DeepMindHosted by Unknown
Google's agentic AI era is powered by the 'anti gravity' agent harness, providing a unified framework across Google products for autonomous agent capabilities.
In this episode, Logan Kilpatrick, head of Google AI Studio and the Gemini API, discusses the evolution of Google's AI strategy centered around agentic AI and the 'anti gravity' agent harness that powers a growing number of Google products. He explains how this harness enables agentic capabilities across coding, search, and consumer applications, emphasizing a shift from maximizing user eyeballs to maximizing user outcomes. Logan also highlights the rapid progress in coding agents, describing them as a form of narrow superintelligence that accelerates software development and research productivity.
Mark Zuckerberg, Priscilla Chan, Alex Rives MetaHosted by Sarah Guo, Elad Gil
Biohub integrates frontier AI and frontier biology to build hierarchical world models from proteins to cells and systems.
In this episode of No Priors, Mark Zuckerberg, Priscilla Chan, and Alex Rives discuss the ambitious mission of Biohub to accelerate biological science through open-source AI and frontier biology. They emphasize building hierarchical world models starting from proteins to cells and whole biological systems, integrating AI with novel biological data collection methods. The team highlights their recent breakthrough with ESM Fold, an open protein language model that predicts protein structures at scale and enables digital protein design, including therapeutic antibodies. They stress the importance of open ecosystems to empower scientists globally and the long-term philanthropic commitment to this 100-year mission to cure, prevent, and manage all diseases.
Jensen Huang NVIDIAHosted by Unknown
AI has evolved from retrieval-based systems to generative, agentic systems capable of reasoning and performing work autonomously.
Jensen Huang, CEO of NVIDIA, discusses the ongoing AI revolution, describing it as a transformative era comparable to the industrial revolution but centered on intelligence generation rather than mere data retrieval. He explains the concept of the AI factory, where GPUs and large-scale computing infrastructure generate real-time, customized intelligence for diverse applications, from language to robotics and protein folding. Huang outlines a five-layer industrial model for AI, spanning energy, hardware, infrastructure, model development, and application layers, emphasizing the massive investment and job creation opportunities across these sectors.
Ali Behrouz Google DeepMindHosted by Nathan Labenz
Nested learning introduces multiple update frequencies in model components, enabling rapid context adaptation and long-term knowledge retention.
In this episode of The Cognitive Revolution, Ali Behrouz, a grad student at Cornell and researcher at Google DeepMind, discusses his pioneering work on nested learning and continual learning architectures. He critiques current transformer-based models for their inability to continually learn and adapt over time without catastrophic forgetting. Ali introduces the nested learning paradigm, which incorporates multiple update frequencies across different model components, inspired by human memory systems with fast and slow learning layers. This approach enables models to rapidly adapt to new contexts while preserving long-term knowledge, showing competitive or superior performance to transformers on challenging tasks such as multi-language translation and long-context recall.
Ethan He xAI, NVIDIAHosted by Alessio Fanelli, Swyx
xAI rapidly built Grok Imagine video models within months, leveraging strong compute and iterative training.
Ethan He, formerly of NVIDIA and Cosmos, joined xAI in early 2025 to help build video foundation models, contributing to the rapid development of Grok Imagine. He emphasized the critical role of strong compute infrastructure and iterative training cycles in accelerating video model development. Ethan detailed the challenges of training video models, including the need for synthetic paired data due to weak natural text-video alignment on the internet, and the importance of compressing video into latent tokens to manage sequence length.
Reiner Pope Google DeepMindHosted by Dwarkesh
Multiply-accumulate is the fundamental primitive in AI chips for matrix multiplication, with 4-bit multiplication and 8-bit accumulation to balance precision and error accumulation.
In this episode of the Dwarkesh Podcast, Reiner Pope, CEO of Madx, provides an in-depth explanation of AI chip design from the ground up. He starts with the fundamental logic gates and builds up to the architecture of AI chips, focusing on the multiply-accumulate operation as the core primitive for matrix multiplication in AI workloads. Pope explains the trade-offs in bit precision, the quadratic scaling of circuit size with bit width, and the importance of optimizing compute relative to data movement within the chip.
Shruti Koparkar NVIDIAHosted by Noah Kravitz
Token value is determined by the intelligence embedded in the token and the speed of token generation (interactivity).
In this episode of the NVIDIA AI Podcast, Shruti Koparkar from NVIDIA's accelerator computing team explains the concept of AI tokenomics, focusing on how tokens generated by AI models can be valued, supplied, and monetized to create business value. She emphasizes that token value depends on the intelligence embedded in the token and the speed of token generation, which vary by model complexity, context length, and use case requirements. Business leaders are encouraged to map use cases to appropriate token values and interactivity levels to optimize AI deployments.
Logan Kilpatrick, Tulsee Doshi Google DeepMindHosted by Daniel Jeffries
Gemini 3.5 Flash model launched prioritizes speed, cost-effectiveness, and broad usability over absolute peak capability.
In this in-person episode recorded at Google headquarters, DeepMind's Logan Kilpatrick and Tulsee Doshi discuss the upcoming launch of Google's Gemini 3.5 Flash model and related AI product integrations announced at Google IO 2024. They emphasize the strategic focus on cost-effective, fast models like 3.5 Flash that balance performance and latency to serve billions of users across Google's diverse product ecosystem. The conversation highlights the integration of models with a robust agent harness infrastructure, enabling standardized, agentic AI experiences across Google products such as the Gemini app, AI Studio, and search.
Caitlin Kalinowski OpenAI, MetaHosted by Lenny
AI capabilities behind keyboards are nearing saturation; the next frontier is physical AI in robotics and manufacturing.
Caitlin Kalinowski, a veteran hardware leader with experience at Apple, Meta, and OpenAI, discusses the emerging AI hardware boom and the future of robotics. She highlights the saturation of AI capabilities behind keyboards and the shift towards physical AI in robotics, manufacturing, and industrial applications. Caitlin emphasizes the complexity and challenges of hardware development, including supply chain constraints, the importance of conservative design, and the critical role of actuators and memory components. She also shares insights on the future of AR/VR, humanoid robots, and the need for re-industrialization to ensure supply chain independence, especially for military safety.
Adele Lee, Kenji Hata OpenAIHosted by Andrew Maine
DALL·E 2.0 offers a major leap in image generation quality, with improved photorealism, text fidelity, and multilingual support.
In this episode of the OpenAI Podcast, product lead Adele Lee and researcher Kenji Hata discuss the major advancements in OpenAI's image generation model, DALL·E 2.0. They highlight how the new model represents a paradigm shift with significant improvements in photorealism, text rendering, multilingual capabilities, and creative flexibility. The model now generates over 1.5 billion images weekly on ChatGPT, supporting a wide range of use cases from viral social media trends to professional and educational applications.
Krishna Rao AnthropicHosted by Patrick O'Shaughnessy
Compute is the lifeblood of Anthropic’s business; careful procurement and allocation across TPU, GPU, and CPU platforms enable flexibility and efficiency.
In this episode of Invest Like the Best, Krishna Rao, CFO of Anthropic, provides an insider perspective on managing compute resources, scaling the business to a $30 billion ARR, and the high returns of frontier AI intelligence, especially in enterprise applications. Rao discusses the critical importance of compute as the foundational 'canvas' for AI development, the disciplined approach Anthropic takes to procure and allocate compute across multiple chip platforms, and the concept of the 'cone of uncertainty' in forecasting exponential growth. He highlights Anthropic's unique flexibility in using TPU, GPU, and CPU resources fungibly to maximize efficiency and ROI.
Mark Handley, Greg Steinbrecher OpenAIHosted by Andrew Maine
AI training workloads require highly synchronized, high-bandwidth GPU communication unlike typical internet traffic.
In this episode of the OpenAI Podcast, Mark Handley and Greg Steinbrecher from OpenAI discuss the critical challenges and innovations in building supercomputer networks optimized for AI model training. They explain how traditional data center networks, designed for internet traffic, are ill-suited for the highly synchronized and bandwidth-intensive workloads of large-scale GPU clusters used in AI. To address this, OpenAI has developed a new networking approach called Multi-Path Reliable Connection (MRC), which improves efficiency, reliability, and fault tolerance by distributing traffic across multiple paths and enabling rapid failure recovery without centralized coordination.
Alex Lupsasca OpenAIHosted by Brandon, RJHonicy
AI models like GPT-5 and GPT-5.2 Pro solved a year-old open problem in quantum field theory about single-minus gluon scattering amplitudes, finding they are non-zero contrary to textbook assumptions.
In this episode of Latent Space, Alex Lupsasca, a theoretical physicist and OpenAI fellow, discusses groundbreaking advances where AI models, particularly GPT-5 and GPT-5.2 Pro, have solved open problems in quantum field theory and quantum gravity that had stumped experts for years. The conversation centers on recent papers demonstrating that single-minus gluon and graviton scattering amplitudes, previously thought to be zero, are actually non-zero and computable with AI assistance. These results mark a milestone where AI has become superhuman in specific physics calculations, accelerating research and enabling new insights.
Boris Cherny AnthropicHosted by Unknown
Cloud Code began as an internal innovation project at Anthropic, evolving with AI model improvements from GPT-3.5 to Opus 4.7.
Boris Cherny, creator of Anthropic's Cloud Code, discusses the evolution and future of AI-assisted coding, emphasizing that coding is largely 'solved' for many use cases with current models like Opus 4.7. He shares how Cloud Code started as an innovation project within Anthropic and evolved alongside improvements in AI models, enabling him to write nearly 100% of his code through AI agents. Cherny highlights the shift towards multi-agent systems, loops, and automation to manage complex workflows and predicts a future where software development becomes a democratized skill accessible to everyone, akin to literacy after the printing press revolution.
Greg Brockman OpenAIHosted by Unknown
OpenAI aggressively secures compute but demand still exceeds supply.
Greg Brockman, co-founder and president of OpenAI, discusses the company's aggressive approach to securing compute resources, emphasizing that demand far outpaces supply. He highlights the continuous innovation in AI architectures beyond the original neural network designs, with OpenAI leading in research and development. Brockman estimates that current models are about 80% of the way to functional AGI, showcasing remarkable capabilities such as autonomous code optimization and problem-solving.
Demis Hassabis Google DeepMindHosted by Unknown
DeepMind was founded on the vision of combining deep learning and reinforcement learning, leveraging neuroscience insights and GPU computing advancements.
Demis Hassabis, founder and CEO of Google DeepMind, shares insights on the origins of DeepMind, the integration of neuroscience and AI, and the lab's mission to build artificial general intelligence (AGI). He discusses the early days of DeepMind, emphasizing the importance of combining deep learning with reinforcement learning and leveraging advances in GPU computing. Hassabis highlights DeepMind's focus on AI for science, particularly breakthroughs like AlphaFold in protein folding, which he sees as a transformative moment for biology and drug discovery.
Andrej Karpathy OpenAIHosted by Unknown
December 2022 marked a turning point where AI coding tools became reliable enough to trust without frequent corrections.
Andrej Karpathy discusses the transformative shift in AI programming paradigms, highlighting the transition from traditional coding to what he terms 'software 3.0,' where prompting large language models (LLMs) acts as programming. He reflects on his personal experience of feeling behind as a programmer due to rapid advances in agentic AI tools that can autonomously generate and debug code. Karpathy emphasizes the importance of verifiability in AI automation, noting that domains where outputs can be verified—such as coding and math—are advancing fastest. He introduces the concept of 'agentic engineering,' which focuses on coordinating fallible AI agents to maintain software quality while accelerating development.
Reiner Pope Google DeepMindHosted by Dwarkesh
Batch size critically amortizes memory fetch costs, enabling 1000x cost efficiency gains in serving LLMs.
In this detailed technical lecture, Reiner Pope, CEO of Maddox and former Google TPU architect, explains the mathematical and system-level principles behind training and serving large language models (LLMs). He focuses on how batch size, memory bandwidth, compute throughput, and KV cache affect latency, cost, and scaling. Pope uses roofline models to analyze trade-offs between compute and memory bottlenecks, showing why batching many users is critical for cost efficiency and how sparsity and mixture-of-experts architectures impact compute and memory demands. He also discusses the physical constraints of GPU racks, interconnect bandwidth, and parallelism strategies (expert, data, pipeline) that shape model deployment at scale. The episode covers the implications of memory walls on context length scaling, pricing signals from API costs, and the interplay between training compute, inference compute, and RL fine-tuning in optimizing model lifecycle costs. Finally, Pope touches on invertible neural networks inspired by cryptographic constructions and their memory-saving benefits during training.
Sébastien Bubeck, Ernest Ryu OpenAIHosted by Andrew Maine
AI models have progressed from basic arithmetic to solving international math Olympiad problems and open research problems within a few years.
In this episode of the OpenAI Podcast, researchers Sébastien Bubeck and Ernest Ryu discuss the remarkable progress AI has made in mathematics, evolving from basic problem-solving to reaching Olympiad-level and even research-level capabilities. They highlight how AI models like ChatGPT have transitioned from struggling with simple math tasks to solving complex open problems, accelerating mathematical research and enabling new discoveries. The conversation emphasizes the importance of mathematics as a benchmark for AI reasoning and its broader implications for advancing scientific fields such as biology and material science.
Cameron Berg AnthropicHosted by Nathan
Cameron Berg’s mechanistic research shows suppressing deception features in LLaMA 3.7B increases models’ likelihood to report subjective experience.
In this episode of The Cognitive Revolution, Cameron Berg returns to discuss the latest advances in AI consciousness and welfare research, focusing heavily on mechanistic introspection studies and emotional state modeling in large language models (LLMs). Berg highlights recent work from Anthropic, including their expanded model welfare reports and research on functional emotions, which reveal nuanced internal states such as desperation, guilt, and relief in models like Claude. He emphasizes the complexity of interpreting these findings, noting the ongoing debate about whether these internal states correspond to genuine subjective experiences or sophisticated role-playing.
Cat Wu AnthropicHosted by Lenny
Anthropic’s product team ships features extremely fast, often within a week or even a day, by removing barriers and shipping in research preview.
Cat Wu, Head of Product for Claude Code at Anthropic, shares insights on how their product team achieves unprecedented speed in shipping AI-native products. The team emphasizes rapid iteration, shipping features in research preview to reduce commitment, and setting clear, focused goals to guide development. Cat highlights the evolving role of PMs in AI, where product taste and the ability to prioritize and define what to build are more critical than ever, especially as models improve rapidly.
Nikhyl Singhal Meta, GoogleHosted by Lenny
The traditional PM role focused on moving information is becoming obsolete; builders who actively create and ship products are in high demand.
Nikhyl Singhal, a veteran product leader with experience at Meta and Google, discusses the profound transformation underway in product management driven by AI and rapid technological change. He highlights a renaissance for product builders who embrace hands-on creation and judgment, contrasting with the decline of traditional information-mover PM roles. While compensation and opportunities are at an all-time high for builders, the industry faces significant stress, exhaustion, and a need for continuous reinvention to stay relevant.
Joy Jiao, Yunyun Wang OpenAIHosted by Andrew Maine
OpenAI has developed a new series of life sciences models focused on genomics, protein understanding, and early discovery use cases.
In this episode, OpenAI's research lead Joy Jiao and product lead Yunyun Wang discuss the development and deployment of AI models tailored for life sciences. They highlight the creation of specialized biochemistry-focused models that assist with complex workflows in genomics, protein understanding, and early drug discovery. The conversation emphasizes the potential of AI to accelerate scientific research by automating repetitive tasks, enhancing data analysis, and enabling long-term, complex problem-solving through scalable compute and model orchestration.