MLOps Community
Home
/
Collections
/
MLOps Community Podcast

MLOps Community Podcast

Popular topics
# LLMs
# LLM in Production
# AI Agents
# Agents in Production
# AI
# LLM
# Machine Learning
# MLOps
# Rungalileo.io
# MLops
# RAG
# Prosus Group
# Generative AI
# Interview
# Machine learning
# Tecton.ai
# Arize.com
# mckinsey.com/quantumblack
# Redis.io
# Zilliz.com
Video

Sandboxing, Agent Harnesses, and Agent Teamwork

Shahram Anver is the Co-Founder and CEO of Cleric, the autonomous AI SRE that investigates and root-causes production issues like an experienced teammate — often in under two minutes. Before Cleric, Shahram led MLOps, DevOps, and FinOps platform engineering at Gojek, Southeast Asia's super-app. In this conversation, he breaks down why production operations never kept pace with AI-accelerated development, and why the real unlock for an AI SRE isn't faster triage — it's an agent that *learns* and compounds operational memory across your whole org.
# AI SRE
# Site Reliability Engineering
# AI Agents
Shahram Anver
Demetrios Brinkmann
Shahram Anver & Demetrios Brinkmann · Jun 19th, 2026
1:19:53
Video

MCP Servers Are Becoming the UI for AI Agents

Naseem Al-Naji is the co-founder of MCPcat.io and the creator of Opal — a builder with deep roots in privacy-first developer tooling. In this conversation, he breaks down why MCP servers have become a black box in production, and how MCPcat gives teams X-ray vision into how agents and users actually behave.
# MCP
# AI Agents
# Observability
Naseem Al-Naji
Naseem Al-Naji · Jun 16th, 2026
47:22
Video

Agents & the $40M Bet on Multiplayer AI

Stanislas Polu is Co-Founder & CTO of Dust — the enterprise AI agent platform used by 51,000 workers at 3,000+ companies. Before Dust, he spent three years on OpenAI's research team under Ilya Sutskever, working on mathematical reasoning in language models, and prior to that was an engineer at Stripe. He brings a rare combination of frontier AI research and product-building experience to the enterprise agent space.
# Enterprise AI
# AI Agents
# Dust
Stanislas Polu
Demetrios Brinkmann
Stanislas Polu & Demetrios Brinkmann · Jun 12th, 2026
1:20:47
Video

From Single-Player to Multi-Player: Operating AI Agents at Scale

James Everingham is the CEO and Co-founder of Guild.ai — the AI agent control plane for production teams. With roots at Netscape, Instagram (Head of Engineering), and Meta (Head of Dev Infra, leading a 1,000-person org), James brings rare, hard-won expertise to the challenge of operating AI agents at scale.
# Agentic AI
# AI Agents
# AI Engineering
James Everingham
Demetrios Brinkmann
James Everingham & Demetrios Brinkmann · Jun 9th, 2026
55:55
Video

The Control-vs-Magic Spectrum Building Agents

Thiago Cardoso is the Director of Data & AI at iFood and the architect behind iFood Pago's AI agent platform. This fintech system serves millions of restaurants across Brazil through WhatsApp and the iFood app. In this episode, he breaks down what it actually takes to ship agentic AI in production at scale.
# iFood
# AI Agents
# Fintech AI
# Agentic AI
# Prosus Group
Thiago Cardoso
Demetrios Brinkmann
Thiago Cardoso & Demetrios Brinkmann · Jun 5th, 2026
43:19
Video

Logs Are All You Need: Rethinking Observability with AI Agents

Sherwood Callaway is the founder of Sazabi (YC P26), the AI-native observability platform built for engineering teams who ship fast. He previously founded and exited a YC company — now he's back, betting that logs are all you need to replace Datadog.
# AI Observability
# Datadog Alternative
# AI Agents
# Logs
Sherwood Callaway
Demetrios Brinkmann
Sherwood Callaway & Demetrios Brinkmann · Jun 2nd, 2026
46:40
Video

AI Is Fast. AI Projects Are Slow. Let's Fix That.

Joe Maionchi (Co-founder & COO) and Rod Christensen (Co-founder & Chief Architect) of RocketRide join the MLOps Community to walk through AIDE — the AI Integrated Development Environment. RocketRide is an open-source AI pipeline platform that lets developers build, debug, and run production-grade agentic AI workflows directly from their IDE, with support for 13+ LLM providers, 8+ vector databases, and full multi-agent orchestration.
# AI Agents
# Open Source AI
# LLM Pipelines
# RocketRide
Joe Maionchi
Rod Christensen
Demetrios Brinkmann
Joe Maionchi, Rod Christensen & Demetrios Brinkmann · May 29th, 2026
56:48
Video

Inside Just Eat's AI Lab: Voice Agents & Agentic Commerce

Guthrie Cooper (Senior Group Product Manager, AI & Robotics) and Nidhi Sharma (Global Head of Engineering AI & Incubation) from Just Eat Takeaway.com join the MLOps.community to pull back the curtain on how one of Europe's largest food delivery platforms is running an internal innovation engine. From autonomous delivery robots to agentic AI voice assistants, they share what it actually takes to build like a startup inside a 40,000-person company.
# Food Delivery AI
# Corporate Innovation
# Autonomous Delivery
# Just Eat AI
# Prosus Group
Guthrie Cooper
Nidhi Sharma
Demetrios Brinkmann
Guthrie Cooper, Nidhi Sharma & Demetrios Brinkmann · May 26th, 2026
1:18:39
Video

Autonomous Agents at Work: From OpenClaw Hype to Enterprise Reality

Pramod Krishnan is a Managing Director - AI Managed Services at PwC, specializing in enterprise AI transformation — helping large organizations move from AI experimentation to production operating models. In this episode with Demetrios, Pramod breaks down exactly what the OpenClaw wave means for enterprises, and the control frameworks PwC uses before a single agent touches production.
# OpenClaw
# PwC
# Agentic AI
Pramod Krishnan
Demetrios Brinkmann
Pramod Krishnan & Demetrios Brinkmann · May 19th, 2026
42:19
Video

The Latency Goldilocks Zone Explained

Rafael (Head of Innovation, iFood) and Daniel (Data and AI Manager, iFood) pull back the curtain on ILO-Agent — iFood's conversational AI ordering system built for 200 million users across Latin America. Recorded live at AI House Amsterdam, this conversation goes deep on the engineering and product decisions behind building recommendation systems, agentic AI, and why the speed of your AI's response might actually be destroying user trust.
# Conversational AI
# iFood
# AI Agents
# Prosus Group
Rafael Borger
Daniel Wolbert
Demetrios Brinkmann
Rafael Borger, Daniel Wolbert & Demetrios Brinkmann · May 12th, 2026
48:14
Video

Building MCP Before MCP Existed: Inside Despegar's Sofia Agent

Before MCP was a standard and before LangChain was widely adopted, his team had already shipped their own orchestration layer and tool protocol in production. This conversation is a rare look at what it takes to build an agentic system that actually books trips, runs on WhatsApp, and keeps adding capabilities without falling over.
# Agentic AI
# MCP
# Ai agents
Nicolás Alejandro  Bogliolo
Demetrios Brinkmann
Nicolás Alejandro Bogliolo & Demetrios Brinkmann · May 11th, 2026
41:13
Video

Voice Agent Use Cases

Anurag Beniwal (Member of Technical Staff at ElevenLabs) breaks down the real-world challenges of building voice agents—from latency, transcription accuracy, and turn-taking to the tradeoffs between cascaded systems and end-to-end speech models. The conversation explores why production systems rely on “constellations” of models, how to design for non-technical users (especially in customer support), and why voice unlocks richer context—but introduces far more complexity than chat. Ultimately, it’s a deep dive into making voice AI practical, reliable, and usable at scale.
# Voice
# AI Agents
# Customer Support AI
# Amazon
Anurag Beniwal
Demetrios Brinkmann
Anurag Beniwal & Demetrios Brinkmann · May 1st, 2026
51:05
Video

The Creator of Superpowers: Why Real Agentic Engineering Beats Vibe Coding

Jesse Vincent breaks down how modern “agentic” software development is shifting from writing code to managing intelligent systems. He shares how his Superpowers toolkit uses structured workflows, skills, and subagents to turn vague ideas into executable plans—emphasizing that clarity of intent matters more than coding itself. The conversation explores how AI agents can be guided using psychology, why separating roles (planner, implementer, reviewer) leads to better outcomes, and how iteration—not perfection—builds powerful workflows. Ultimately, the future of software isn’t code—it’s specs, judgment, and orchestrating agents to do the work.
# Superpowers
# Claude Code
# Developer Tools
Jesse Vincent
Demetrios Brinkmann
Jesse Vincent & Demetrios Brinkmann · Apr 24th, 2026
1:06:56
Video

It's 2026, and We're Still Talking Evals

Most teams treat evals like a last-minute checkbox—ship first, panic later—but that’s exactly backwards. The real edge comes from treating evals as a continuous, evolving system from day one, not a static test suite. Because here’s the uncomfortable truth: LLMs don’t fail cleanly or consistently, and neither do your users. If you’re not constantly adapting how you evaluate, you’re basically flying blind—just with more features to hide it.
# AI Evals
# LLM Evaluation
# AI Product Management
Maggie Konstanty
Demetrios Brinkmann
Maggie Konstanty & Demetrios Brinkmann · Apr 21st, 2026
40:57
Video

Why Agents are Driving Software Development to the Cloud

# AI Agents
# Cloud Development
# Warp Terminal
Zach Lloyd
Demetrios Brinkmann
Zach Lloyd & Demetrios Brinkmann · Apr 17th, 2026
51:08
Video

The Modern Software Engineer

Conversation with Mihail Eric on how agent-driven development is reshaping engineering work, faster iteration, new failure modes, and shifting team dynamics. Focus on validation, cost tradeoffs, and what breaks when code is mostly generated rather than written.
# Software Engineering
# Coding Agents
# AI Engineering
Mihail  Eric
Demetrios Brinkmann
Mihail Eric & Demetrios Brinkmann · Apr 15th, 2026
53:38
Video

How We Cut LLM Latency 70% With TensorRT in Production

Scaling LLMs in production requires balancing cost, latency, and performance. Through techniques like dynamic GPU scaling and TensorRT optimization, latency was reduced by up to 70%, while iterative learning and tight alignment with business goals ensured strong ROI.
# GPU
# GPU Optimization
# AI Agents
Maher Hanafi
Demetrios Brinkmann
Maher Hanafi & Demetrios Brinkmann · Apr 10th, 2026
1:05:20
Video

Getting Humans Out of the Way: How to Work with Teams of Agents

Most people cripple coding agents by micromanaging them—reviewing every step and becoming the bottleneck. The shift isn’t to better supervise agents, but to design systems where they work well on their own: parallelized, self-validating, and guided by strong processes. Done right, you don’t lose control—you gain leverage. Like paving roads for cars, the real unlock is reshaping the environment so AI can move fast.
# AI Agents
# Parallel Agents
# Broomy
Robert Ennals
Demetrios Brinkmann
Robert Ennals & Demetrios Brinkmann · Apr 7th, 2026
50:31
Video

Fixing GPU Starvation in Large-Scale Distributed Training

Kashish zooms out to discuss a universal industry pattern: how infrastructure—specifically data loading—is almost always the hidden constraint for ML scaling. The conversation dives deep into a recent architectural war story. Kashish walks through the full-stack profiling and detective work required to solve a massive GPU starvation bottleneck. By redesigning the Petastorm caching layer to bypass CPU transformation walls and uncovering hidden distributed race conditions, his team boosted GPU utilization to 60%+ and cut training time by 80%. Kashish also shares his philosophy on the fundamental trade-offs between latency and efficiency in GPU serving.
# GPU Starvation
# Uber ML
# ML Infrastructure
Kashish Mittal
Demetrios Brinkmann
Kashish Mittal & Demetrios Brinkmann · Apr 3rd, 2026
52:49
Video

This One Shift Makes Developers Obsolete

AI agents are shifting the role of developers from writing code to defining intent. This conversation explores why specs are becoming more important than implementation, what breaks in real-world systems, and how engineering teams need to rethink workflows in an agent-driven world.
# AI Agents
# Software Engineering
# AI in Production
Jens Bodal
Demetrios Brinkmann
Jens Bodal & Demetrios Brinkmann · Mar 31st, 2026
59:12
Video

arrowspace: Vector Spaces and Graph Wiring

Meet arrowspace — an open-source library for curating and understanding LLM datasets across the entire lifecycle, from pre-training to inference. Instead of treating embeddings as static vectors, arrowspace turns them into graphs (“graph wiring”) so you can explore structure, not just similarity. That unlocks smarter RAG search (beyond basic semantic matching), dataset fingerprinting, and deeper insights into how different datasets behave. You can compare datasets, predict how changes will affect performance, detect drift early, and even safely mix data sources while measuring outcomes. In short: arrowspace helps you see your data — and make better decisions because of it.
# arrowspace
# Vector Search
# Epipelxity
Lorenzo Moriondo
Demetrios Brinkmann
Lorenzo Moriondo & Demetrios Brinkmann · Mar 27th, 2026
56:01
Video

A New Kind of Marketplace

Marketplaces are about to get weird. With Pedro Chaves and Donné Stevenson: agents picking your house, negotiating deals, even talking to other agents for you. Less browsing. Less choice. More automation. Convenience… or giving up control?
# AI Agents
# Marketplace
# Prosus
# OLX
Donné Stevenson
Pedro Chaves
Demetrios Brinkmann
Donné Stevenson, Pedro Chaves & Demetrios Brinkmann · Mar 20th, 2026
51:26
Video

Durable Execution and Modern Distributed Systems

A new paradigm is emerging for building applications that process large volumes of data, run for long periods of time, and interact with their environment. It’s called Durable Execution and is replacing traditional data pipelines with a more flexible approach. Durable Execution makes regular code reliable and scalable. In the past, reliability and scalability have come from restricted programming models, like SQL or MapReduce, but with Durable Execution this is no longer the case. We can now see data pipelines that include document processing workflows, deep research with LLMs, and other complex and LLM-driven agentic patterns expressed at scale with regular Python programs. In this session, we describe Durable Execution and explain how it fits in with agents and LLMs to enable a new class of machine learning applications.
# AI Agents
# AI Engineer
# AI agents in production
# AI agent usecase
# System Design
Johann Schleier-Smith
Demetrios Brinkmann
Johann Schleier-Smith & Demetrios Brinkmann · Mar 17th, 2026
1:00:37
Video

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

In today’s era of massive generative models, it's important to understand the full scope of AI systems' performance engineering. This talk discusses the new O'Reilly book, AI Systems Performance Engineering, and the accompanying GitHub repo (https://github.com/cfregly/ai-performance-engineering). This talk provides engineers, researchers, and developers with a set of actionable optimization strategies. You'll learn techniques to co-design and co-optimize hardware, software, and algorithms to build resilient, scalable, and cost-effective AI systems for both training and inference.
# NVIDIA GPUs
# CUDA framework
# GitHub repo
Chris Fregly
Demetrios Brinkmann
Chris Fregly & Demetrios Brinkmann · Feb 24th, 2026
1:25:49
Video

The Future of Information Retrieval: From Dense Vectors to Cognitive Search

Information Retrieval is evolving from keyword matching to intelligent, vector-based understanding. In this talk, Rahul Raja explores how dense retrieval, vector databases, and hybrid search systems are redefining how modern AI retrieves, ranks, and reasons over information. He discusses how retrieval now powers large language models through Retrieval-Augmented Generation (RAG) and the new MLOps challenges that arise, embedding drift, continuous evaluation, and large-scale vector maintenance. Looking ahead, the session envisions a future of Cognitive Search, where retrieval systems move beyond recall to genuine reasoning, contextual understanding, and multimodal awareness. Listeners will gain insight into how the next generation of retrieval will bridge semantics, scalability, and intelligence, powering everything from search and recommendations to generative AI.
# AI Agents
# AI Engineer
# AI agents in production
# AI Agents use case
# System Design
Rahul   Raja
Demetrios Brinkmann
Rahul Raja & Demetrios Brinkmann · Feb 17th, 2026
1:02:53
Video

Rethinking Notebooks Powered by AI

Vincent Warmerdam joins Demetrios fresh off marimo’s acquisition by Weights & Biases—and makes a bold claim: notebooks as we know them are outdated. They talk Molab (GPU-backed, cloud-hosted notebooks), LLMs that don’t just chat but actually fix your SQL and debug your code, and why most data folks are consuming tools instead of experimenting. Vincent argues we should stop treating notebooks like static scratchpads and start treating them like dynamic apps powered by AI. It’s a conversation about rethinking workflows, reclaiming creativity, and not outsourcing your brain to the model.
# Vincent D. Warmerdam
# Calmcode
# marimo
# wandb
# Jupiter Notebooks
# Data Science
Vincent D. Warmerdam
Demetrios Brinkmann
Vincent D. Warmerdam & Demetrios Brinkmann · Feb 13th, 2026
26:14
Video

Software Engineering in the Age of Coding Agents: Testing, Evals, and Shipping Safely at Scale

A conversation on how AI coding agents are changing the way we build and operate production systems. We explore the practical boundaries between agentic and deterministic code, strategies for shared responsibility across models, engineering teams, and customers, and how to evaluate agent performance at scale. Topics include production quality gates, safety and cost tradeoffs, managing long-tail failures, and deployment patterns that let you ship agents with confidence.
# AI Agents
# AI Engineer
# AI agents in production
# AI Agents use case
# System Design
Ereli Eran
Demetrios Brinkmann
Ereli Eran & Demetrios Brinkmann · Feb 10th, 2026
57:24
Video

Physical AI: Teaching Machines to Understand the Real World

As AI moves beyond the cloud and simulation, the next frontier is Physical AI: systems that can perceive, understand, and act within real-world environments in real time. In this conversation, Nick Gillian, Co-Founder and CTO of Archetype AI, explores what it actually takes to turn raw sensor and video data into reliable, deployable intelligence. Drawing on his experience building Google’s Soli and Jacquard and now leading development of Newton, a foundational model for Physical AI, Nick discusses how real-time physical understanding changes what’s possible across safety monitoring, infrastructure, and human–machine interaction. He’ll share lessons learned translating advanced research into products that operate safely in dynamic environments, and why many organizations underestimate the challenges and opportunities of AI in the physical world.
# AI Agents
# AI Engineer
# AI agents in production
# AI Agents use case
# System Design
Nick Gillian
Demetrios Brinkmann
Nick Gillian & Demetrios Brinkmann · Feb 6th, 2026
52:04
Video

Speed and Scale: How Today's AI Datacenters Are Operating Through Hypergrowth

Hundreds of neocloud operators and "AI Factory" builders have emerged to serve the insatiable demand for AI infrastructure. These teams are compressing the design, build, deploy, operate, scale cycle of their infrastructures down to months, while managing massive footprints with lean teams. How? By applying modern intent driven infrastructure automation principles to greenfield deployments. We'll explore how these teams carry design intent through to production, and how operating and automating around consistent infrastructure data is compressing "time to first train".
# AI Agents
# AI Engineer
# AI agents in production
# AI Agents use case
# System Design
Kris Beevers
Demetrios Brinkmann
Kris Beevers & Demetrios Brinkmann · Feb 3rd, 2026
1:07:17
Video

Cracking the Black Box: Real-Time Neuron Monitoring & Causality Traces

As AI models move into high-stakes environments like Defence and Financial Services, standard input/output testing, evals, and monitoring are becoming dangerously insufficient. To achieve true compliance, MLOps teams need to access and analyse the internal reasoning of their models to achieve compliance with the EU AI Act, NIST AI RMF, and other requirements. In this session, Mike introduces the company's patent-pending AI assurance technology that moves beyond statistical proxies. He will break down the architecture of the Synapses Logger, a patent-pending technology that embeds directly into the neural activation flow to capture weights, activations, and activation paths in real-time.
# EU AI Act
# Regulations Compliance
# Tikos
Mike Oaten
Demetrios Brinkmann
Mike Oaten & Demetrios Brinkmann · Jan 27th, 2026
45:46
Code of Conduct
Your Privacy Choices