MLOps Community

Content Collections

Agents in Production - Prosus x MLOps

23:12
Video
Using Agents in Production: Past Present and Future // Euro Beinat
31:41
Video
From Chat Fatigue to Instant Action: Transforming Dealer Engagement Through Intelligent UI // Donné Stevenson

Agents in Production 2025

38:34
Video
The Future of Compute: How AI Agents Are Reshaping Infrastructure // Diego Oppenheimer - Keynote // Agents in Production 2025
28:13
Video
Driving Evaluation-Driven Development with MLflow 3.0 // Yuki Watanabe // Agents in Production 2025
27:17
Video
Beyond Chatbots: How to build Agentic AI systems with Google Gemini // Philipp Schmid // Agents in Production 2025
12:45
Video
Advancing the Cost-Quality Frontier in Agentic AI // Krista Opsahl-Ong // Agents in Production 2025
10:07
Video
How to Build Execution Layers That Don’t Burn Out // Tanmay Tiwari // Agents in Production 2025

AI in Production 2025

33:37
Video
Eval Driven Development: Best Practices and Pitfalls When Building with AI // Raza Habib & Brianna Connelly// AI in Production 2025
34:05
Video
The LLM Guardrails Index: Benchmarking Responsible AI Deployment // Shreya Rajpal // AI in Production 2025
16:50
Video
The AI Developer Experience Sucks so Let's Fix it // Erik Bernhardsson // AI in Production
32:21
Video
Bridging the Gap between Model Development and AI Infrastructure // Mohan Atreya // AI in Production 2025
37:51
Video
Challenges of Working with Voice AI Agents // Panel // AI in Production 2025

All Content

All Tags
All Types
Samuel Partee
Nate Barbettini
Demetrios Brinkmann
Samuel Partee, Nate Barbettini & Demetrios Brinkmann · Jun 23rd, 2026
The Dark Side of MCP Servers
Sam Partee (CTO & co-founder of Arcade.dev) and Nate Barbettini (Founding Engineer at Arcade.dev) sit down at the MCP Dev Summit to unpack what nobody wants to admit about the Model Context Protocol: the security model is still full of sharp edges. From tool poisoning and prompt injection to why OAuth got bolted onto the spec, this is a builder 's-eye view of where MCP breaks — and how to ship agents safely anyway.
# MCP
# AI Agent Security
# Tool Poisoning
# Arcade.dev
Médéric Hurier
Médéric Hurier · Jun 23rd, 2026
Managing subagent personas across multiple AI coding platforms (like Claude Code, Gemini CLI, and Copilot) leads to fragmented configuration files and instruction drift. This article introduces `supagents`, a lightweight, idempotent Python CLI tool that lets developers maintain subagent definitions in a single markdown file with multi-target frontmatter. The tool automatically compiles and distributes these instructions to the correct paths, ensuring a consistent persona across all development environments.
# Artificial Intelligence
# Software Engineering
# Open Source
# Ai agents
# Coding
Shahram Anver
Demetrios Brinkmann
Shahram Anver & Demetrios Brinkmann · Jun 19th, 2026
Shahram Anver is the Co-Founder and CEO of Cleric, the autonomous AI SRE that investigates and root-causes production issues like an experienced teammate — often in under two minutes. Before Cleric, Shahram led MLOps, DevOps, and FinOps platform engineering at Gojek, Southeast Asia's super-app. In this conversation, he breaks down why production operations never kept pace with AI-accelerated development, and why the real unlock for an AI SRE isn't faster triage — it's an agent that *learns* and compounds operational memory across your whole org.
# AI SRE
# Site Reliability Engineering
# AI Agents
Varsha  Prasad Narsing
Francisco Javier Arceo
David DeStefano
+3
Varsha Prasad Narsing, Francisco Javier Arceo, David DeStefano & 3 content:more content:speakers · Jun 17th, 2026
Retrieval-Augmented Generation and agentic AI are increasingly common in enterprise deployments, but real enterprise environments introduce challenges largely absent from academic treatments and consumer-facing APIs: multiple tenants with heterogeneous data, strict access-control requirements, regulatory compliance, and cost pressures that demand shared infrastructure. This paper identifies a fundamental problem underlying existing RAG architectures in these settings. Retrieval systems rank documents by relevance, not by authorization, so a query from one tenant can surface another tenant’s confidential data simply because it scores highest. The authors formalize this relevance-authorization gap alongside related shortcomings (tool-mediated disclosure, context accumulation across turns, client-side orchestration bypass) and introduce a layered isolation architecture combining policy-aware ingestion, retrieval-time gating, and shared inference, enforced through server-side orchestration. They validate it through an open-source implementation in OGX, a vendor-neutral OpenAI-compatible Responses API, showing empirically that ABAC gating eliminates cross-tenant leakage while introducing negligible overhead.
# Agent Security
# AgenticRAG
# RAG
German Krikorian
Raj Katakam
Mick Jermsurawong
+2
German Krikorian, Raj Katakam, Mick Jermsurawong & 2 content:more content:speakers · Jun 17th, 2026
Real-time ML use cases like personalization and risk decisioning come with a unique set of challenges: serving fresh feature values at low latency for inference, generating temporally consistent backfills for training, and building complex chains of on-demand, batch, and streaming transformations. In this roundtable, practitioners from Intuit, CreditKarma, Depop, and OpenAI share how they use Zipline and the OSS Chronon project to solve these challenges and deploy real-time ML use cases in production.
# Real-time ML
# Zipline
# Chronon
Naseem Al-Naji
Naseem Al-Naji · Jun 16th, 2026
Naseem Al-Naji is the co-founder of MCPcat.io and the creator of Opal — a builder with deep roots in privacy-first developer tooling. In this conversation, he breaks down why MCP servers have become a black box in production, and how MCPcat gives teams X-ray vision into how agents and users actually behave.
# MCP
# AI Agents
# Observability
This blog explains how Claude Code Hooks let you automate and control Claude's behavior throughout its lifecycle, turning it from a coding assistant into a programmable part of your engineering workflow. Hooks can run shell commands, call APIs, invoke LLMs, or launch subagents whenever specific events occur.
# Generative AI
# Agentic AI
# AI Agents
# Claude Code
# MCP
Stanislas Polu
Demetrios Brinkmann
Stanislas Polu & Demetrios Brinkmann · Jun 12th, 2026
Stanislas Polu is Co-Founder & CTO of Dust — the enterprise AI agent platform used by 51,000 workers at 3,000+ companies. Before Dust, he spent three years on OpenAI's research team under Ilya Sutskever, working on mathematical reasoning in language models, and prior to that was an engineer at Stripe. He brings a rare combination of frontier AI research and product-building experience to the enterprise agent space.
# Enterprise AI
# AI Agents
# Dust
James Everingham
Demetrios Brinkmann
James Everingham & Demetrios Brinkmann · Jun 9th, 2026
James Everingham is the CEO and Co-founder of Guild.ai — the AI agent control plane for production teams. With roots at Netscape, Instagram (Head of Engineering), and Meta (Head of Dev Infra, leading a 1,000-person org), James brings rare, hard-won expertise to the challenge of operating AI agents at scale.
# Agentic AI
# AI Agents
# AI Engineering
The blog breaks down human memory into functional layers and maps them to the architectural requirements of AI systems. It shows how separating sensing, storage, context, and reasoning leads to more robust agents, and why today’s embedding‑only approaches fall short.
# AI Agents
# Cognition
# Vector / Similarity / Semantic Search
# Knowledge Graph and Graph Databases
# Multimodal / Generative AI
Code of Conduct
Your Privacy Choices