MLOps Community
+00:00 GMT
Agents in Production - MLOps x Prosus
LIVESTREAM

Agents in Production - MLOps x Prosus

# Prosus AI

The Virtual AI Event That’s Actually... Fun

We know what you’re thinking: another virtual conference. Talking heads, awkward silences, and the constant urge to check your email… we’ve all been there.

Not this one.

100% AI Agents in Prod. BUT This Isn't Just Another Zoom Link.

Welcome to Agents in Production: the latest edition of the MLOps × Prosus AI Virtual Conference.

We’re bringing together the brightest minds building AI agents, with a high-energy format designed to keep you hooked from start to finish.

Last year, companies stopped experimenting with agents and started deploying them in the real world. We heard from the pioneers who turned hype into working systems - and this year, we’re doubling down.

Expect real progress, real lessons, and real breakthroughs shaping the future of agentic AI. You’ll get cutting-edge, actionable insights that move you from experimentation to full-scale deployment.

30+ Talks on AI Agents. I Promise You Won’t Log Off Early

Why Attend:

  1. Talks from top experts – Real-world lessons, practical insights, and breakthroughs defining agentic AI.
  2. Hilarious skits & live music – Because learning should be fun.
  3. High-energy engagement – Interactive moments that make you part of the action.

If You Miss This, You’ll Miss:

  1. Hard-won lessons – How leading companies are successfully deploying agents at scale.
  2. Deep dives – Technical sessions and workshops from the voices shaping the next generation of AI.
  3. Global connections – Network with innovators and practitioners across the ML community.

This is your chance to get up to speed on the global AI scene, connect with innovators, and experience a virtual event you’ll actually enjoy.

See you there! 

Speakers

Chip Huyen
Researcher @ Tep Studio
Aditya Gautam
Machine Learning Technical Lead @ Meta
Teodora Musatoiu
Solutions Architect @ OpenAI
Adel El Hallak
Senior Director Of Product @ NVIDIA
Panos Stravopodis
Co-Founder & CTO @ Elyos
Jiquan Ngiam
CEO and Co-Founder @ MintMCP
Chenyu Zhang
Founder @ GlowingStar Inc.
Rekha Singhal
Head Research @ Tata Consultancy Services
Santoshkalyan Rayadhurgam
Engineering Leader @ Meta
Swati Bhatia
Product Manager @ Google
Arushi Jain
Senior Applied Scientist @ Microsoft
Donné Stevenson
Machine Learning Engineer @ Prosus Group
Mefta Sadat
Staff Software Engineer @ Loblaw Digital
Sam Partee
Co-Founder @ Arcade.dev
Sachi Shah
Product Manager @ Sierra
Jasleen Singh
Staff Solutions Architect, Generative AI @ Google
Sanjana Sharma
AI Strategist @ Distyl AI
Artem Yushkovskiy
Sr ML Engineer @ Delivery Hero SE
Rosemary Nwosu-Ihueze
Founder @ Soteria
Euro Beinat
Global Head AI and Data Science @ Prosus Group
Washington Amolo
Product Developer @ NaviSmart AI
Benjamin Guo
Co Founder @ Zo Computer
Hamed Taheri
CEO & Founder @ Personize.ai
Phil Stafford
Principal Consultant, Cybersecurity & AI @ Singularity Systems
Quinten Rosseel
AI Engineer @ Wobby
Dirk Petzoldt
Co-Founder @ Explai.com
Tom Kaltofen
Engineer @ mloda
Rachitt Shah
Applied AI consultant @ Transfrm Labs
Vitor Balocco
Co-founder @ Runlayer
Frank Wittkampf
VP Applied AI @ Databook
Laurel Orr
AI Staff Software Engineer @ Stacklok
Benjamin Hindman
Founder & CEO @ Reboot
Ben Epstein
Co-Founder & CTO @ GrottoAI
Matt Sharp
AI Strategist and Principle Engineer @ Flexion
Audi Liu
Senior Product Manager @ Inworld AI
Olga Pavlov
Head of Product @ OLX Group
Isabella Piratininga
Director of Technology & Innovation @ iFood
Paul van der Boor
Senior Director Data Science @ Prosus Group
Demetrios Brinkmann
Chief Happiness Engineer @ MLOps Community
Ricky Doar
VP of Solutions @ Cursor
Nishikant Dhanuka
Senior Director of AI @ Prosus Group
Chiara Caratelli
Data Scientist @ Prosus Group
Simba Khadder
Sr. Manager & Software Engineer @ Redis

Agenda

From2:00 PM
To2:05 PM
GMT
Tags:
1:1 networking
Doors Open
From2:05 PM
To2:30 PM
GMT
Tags:
Opening / Closing
Using Agents in Production: Past Present and Future

Prosus has shipped over 7949 agents. 15% have worked. The rest have been learning experiences. Let's talk about what we have learned, and where we see things going.

+ Read More
Speakers:
user's Avatar
From2:35 PM
To3:00 PM
GMT
Tags:
Fireside Chat
Unlocking Enterprise Value: Proven Strategies for Successful Adoption of AI Developer Tools

In this session, Ricky Doar, VP of Solutions at Cursor, shares actionable insights from leading large-scale AI developer tool implementations at the world’s top enterprises. Drawing on field experience with organizations at the forefront of transformation, Ricky highlights key best practices, observed power-user patterns, and deployment strategies that maximize value and ensure smooth rollout. Learn what distinguishes high-performing teams, how tailored onboarding accelerates adoption, and which support resources matter most for driving enterprise-wide success.

+ Read More
Speakers:
user's Avatar
From3:05 PM
To3:30 PM
GMT
Tags:
Presentation
Shipping AI at Scale: A look back, current Enterprise patterns, and what’s next

2025 has truly been and still is the year of agents. We’ll take a pragmatic look at what's happened and what we learned along the way. Then we'll map the next wave. We’ll distill product patterns behind the wins and preview what’s coming next: from capable agents, to out-of-the-box AI building blocks, and how models like GPT-5 simplify shipping novel, long tail user experiences.

+ Read More
Speakers:
user's Avatar
From3:35 PM
To3:50 PM
GMT
Tags:
Presentation
When Agents Learn to Feel: Multi-Modal Affective Computing in Production

The next generation of AI agents won’t just respond to what we say—they will sense how we feel. As large language model–powered agents move from research prototypes into production, a critical frontier is the integration of multi-modal affective computing: combining voice, text, facial expressions, and interaction patterns to detect the learner’s or user’s emotional state in real time.

This talk explores the challenges and opportunities of deploying emotion-aware AI tutors in production environments. Drawing from ongoing research at MIT Media Lab and Harvard, and from startup experience building GlowingStar, I will share how multi-modal signals—speech tone, facial micro-expressions, response latency, and even silence—can be fused into affective state estimates that meaningfully improve user experience.

We will unpack the technical lessons learned from moving affective sensing beyond the lab: designing architectures that combine ensemble LLMs with sensor inputs, diagnosing when modalities conflict or sabotage each other, and establishing guardrails for privacy and consent in sensitive domains like education. In parallel, I will highlight multi-agent orchestration patterns—including critic–rewriter loops and role-based ensembles—that make it possible to personalize instruction, generate equitable feedback, and sustain engagement across diverse learners.

By the end of this session, attendees will have a clear picture of what it takes to move multi-modal, affect-sensing agents from demos to durable production systems: the architectures, the pitfalls, and the metrics that matter. More importantly, we will consider how these lessons extend beyond education to any industry where AI agents must not only think, but also feel with and for the human in the loop.

+ Read More
Speakers:
user's Avatar
From3:50 PM
To4:05 PM
GMT
Tags:
Presentation
Multi-Agent Personalization with Shared Memory: From Email to Website to Proposal

Personalization at scale needs deep understanding of each customer. You must collect data from many sources, read it, reason and infer, plan, decide, act, and write to each person. One agent doing everything gave us poor and inconsistent quality. Multi-agent systems changed that. They deliver mass personalization. They also break in edge cases, contradict each other, and are hard to debug.

I will share how we addressed this with Cortex UCM, a unified customer memory, and Generative Tables. We map noisy data into a clean, structured layer that agents read and write. We began with email for both outbound and inbound communication. Then we personalized websites and product pages for e-commerce at scale. I share customer stories. For example, one customer had over 60,000 product pages that required customization for thousands of communities and product offerings.

I will present our decentralized shared-memory orchestration briefly and how it stays transparent and debuggable. It opens safe paths for external agents. What failed. What worked. What we are building next.

+ Read More
Speakers:
user's Avatar
From4:05 PM
To4:35 PM
GMT
Tags:
Presentation
Simulate to Scale: How realistic simulations power reliable agents in production

In this session, we’ll explore how developing and deploying AI-driven agents demands a fundamentally new testing paradigm—and how scalable simulations deliver the reliability, safety and human-feel that production-grade agents require.

You’ll learn how simulations allow you to:

  • Mirror messy real-world user behavior (multiple languages, emotional states, background noise) rather than scripting narrow “happy-path” dialogues.
  • Model full conversation stacks including voice: turn-taking, background noise, accents, and latency – not just text messages.
  • Embed automated simulation suites into your CI/CD pipeline so that every change to your agent is validated before going live.
  • Assess multiple dimensions of agent performance—goal completion, brand-compliance, empathy, edge-case handling—and continuously guard against regressions.
  • Scale from “works in demo” to “works for every customer scenario” and maintain quality as your agent grows in tasks, languages or domains.

Whether you’re building chat, voice, or multi-modal agents, you’ll walk away with actionable strategies for incorporating simulations into your workflow—improving reliability, reducing surprises in production, and enabling your agent to behave as thoughtfully and consistently as a human teammate.

+ Read More
Speakers:
user's Avatar
From4:35 PM
To4:45 PM
GMT
Tags:
Break
Musical Intermission

Live improvised music with lyrical suggestions from the chat

+ Read More
Speakers:
user's Avatar
From4:45 PM
To5:10 PM
GMT
Tags:
Presentation
Building Alfred, the Orchestration Layer for Agentic Commerce at Loblaws

Developing AI agents for shopping is just the first step; the real challenge is reliably running them in production across complex, mission-critical e-commerce systems—a significant MLOps hurdle.

In this talk, we'll talk about Alfred, our agentic orchestration layer. Built with tools like Langgraph, LangFuse, LiteLLM, and Google Cloud components, Alfred is the critical piece that coordinates LLMs with our entire e-commerce backend—from search and recommendations to cart management. It handles the complete execution graph, secured tool calling, and prompt workflow.

We’ll share our journey in designing a reusable agent architecture that scales across all our digital properties. We’ll discuss the specifics of our tech stack and productionization methodology, including how we leveraged the MCP framework and our existing platform APIs to accelerate development of Alfred.

+ Read More
Speakers:
user's Avatar
From5:15 PM
To5:40 PM
GMT
Tags:
Presentation
From Chat Fatigue to Instant Action: Transforming Dealer Engagement Through Intelligent UI

This presentation discusses the evolution of AI agent interaction, focusing on transitioning from low-engagement text-based chat to more intuitive, GUI-driven experiences. It outlines critical challenges in creating an intuitive and impactful experience for busy dealers, proposing solutions that include quick actions, efficient data streaming, and agent interactivity to create a great user experience.

+ Read More
Speakers:
user's Avatar
From5:45 PM
To6:10 PM
GMT
Tags:
Presentation
From Zero to AILO: Lessons learned from building iFood's AI agent
Speakers:
user's Avatar
user's Avatar
From6:15 PM
To6:40 PM
GMT
Tags:
Presentation
Architecting Trust: Multi-Agent Systems for the Misinformation Lifecycle

The rapid spread of digital misinformation requires solutions that address the entire lifecycle, moving beyond single-LLM limitations. This talk, based on the author’s ICWSM research paper, offers a practitioner's guide to a novel, five-agent system—Classifier, Indexer, Extractor, Corrector, and Verification—designed for maximum scalability, modularity, and explainability. This paper aims at automating the working of fact-checkers, which is traditionally done through a team of experts, saving millions and increasing efficiency with a human-in-the-loop system. We will get the details for each specialized agent, detailing crucial elements like model sizing and fine-tuning—for example, matching small, fine-tuned encoder models for the Classifier's high-confidence multi-class labeling against the need for a strong reasoning LLM in the Corrector Agent. Topics include building an efficient Indexer Agent and reranking with retrieval through hybrid keyword and vector embeddings, enabling the Corrector Agent to use external search APIs for cross-validation, and the function of the Verification Agent as the final quality check for high precision.. The talk concludes by covering agent coordination protocols, cost, holistic evaluation, offline evaluation and online A/B testing and post-deployment metrics.

+ Read More
Speakers:
user's Avatar
From6:45 PM
To6:55 PM
GMT
Tags:
Break
Trivia Challenge - Airpods Giveaway

Think you know AI? Put your skills to the test as one lucky winner will be walking away with some new headgear.

+ Read More
Speakers:
user's Avatar
From6:55 PM
To7:20 PM
GMT
Tags:
Presentation
When AI Agents Argue: Structured Dissent Patterns for Production Reliability

Single-agent LLM systems fail silently in production - they're confidently wrong at scale with no mechanism for self-correction. We've deployed a multi-agent orchestration pattern called ""structured dissent"" where believer, skeptic, and neutral agents debate decisions before consensus. This isn't theoretical - we'll show production deployment patterns, cost/performance tradeoffs, and measurable reliability improvements. You'll learn when multi-agent architectures justify the overhead, how to orchestrate adversarial agents effectively, and operational patterns for monitoring agent reasoning quality in production.

Our first deployment of the debate swarm revolves around MCP servers - we use a security swarm specially built for MCP servers to analyze findings from open source security tools. This provides more nuanced reasoning and gives a confidence score to evaluate the security of unknown MCP tools.

+ Read More
Speakers:
user's Avatar
From7:25 PM
To7:50 PM
GMT
Tags:
Panel Discussion
Hardening Agents for E-commerce Scale: From RL Alignment to Reliability

The discussion centers on highly technical yet practical themes, such as the use of advanced post-training techniques like Direct Preference Optimization (DPO) and Parameter-Efficient Fine-Tuning (PEFT) to ensure LLMs maintain stability while specializing for e-commerce domains.

We compare the implementation challenges of Computer-Using Agents in automating legacy enterprise systems versus the stability issues faced by conversational agents when inputs become unpredictable in production.

We will analyze the role of cloud infrastructure in supporting the continuous, iterative training loops required by Reinforcement Learning-based agents for e-commerce!

+ Read More
Speakers:
user's Avatar
user's Avatar
user's Avatar
user's Avatar
user's Avatar
From7:55 PM
To8:20 PM
GMT
Tags:
Presentation
Keynote: Coding with AI

This talk covers an overview of AI coding tools and different levels of coding automation. It also discusses workflow patterns that have emerged and how they will change over time.

+ Read More
Speakers:
user's Avatar
Event has finished
November 18, 1:45 PM GMT
Online
Organized by
user's Avatar
MLOps Community
user's Avatar
Prosus
Event has finished
November 18, 1:45 PM GMT
Online
Organized by
user's Avatar
MLOps Community
user's Avatar
Prosus
Code of Conduct