LIVESTREAM

1:45 PM - 8:30 PM GMT

November 18, 2025

Agents in Production - MLOps x Prosus

# Prosus AI

The Virtual AI Event That’s Actually... Fun

We know what you’re thinking: another virtual conference. Talking heads, awkward silences, and the constant urge to check your email… we’ve all been there.

Not this one.

100% AI Agents in Prod. BUT This Isn't Just Another Zoom Link.

Welcome to Agents in Production: the latest edition of the MLOps × Prosus AI Virtual Conference.

We’re bringing together the brightest minds building AI agents, with a high-energy format designed to keep you hooked from start to finish.

Last year, companies stopped experimenting with agents and started deploying them in the real world. We heard from the pioneers who turned hype into working systems - and this year, we’re doubling down.

Expect real progress, real lessons, and real breakthroughs shaping the future of agentic AI. You’ll get cutting-edge, actionable insights that move you from experimentation to full-scale deployment.

30+ Talks on AI Agents. I Promise You Won’t Log Off Early

Why Attend:

Talks from top experts – Real-world lessons, practical insights, and breakthroughs defining agentic AI.
Hilarious skits & live music – Because learning should be fun.
High-energy engagement – Interactive moments that make you part of the action.

If You Miss This, You’ll Miss:

Hard-won lessons – How leading companies are successfully deploying agents at scale.
Deep dives – Technical sessions and workshops from the voices shaping the next generation of AI.
Global connections – Network with innovators and practitioners across the ML community.

This is your chance to get up to speed on the global AI scene, connect with innovators, and experience a virtual event you’ll actually enjoy.

See you there!

Speakers

Chip Huyen

Researcher @ Tep Studio

Aditya Gautam

Machine Learning Technical Lead @ Meta

Teodora Musatoiu

Solutions Architect @ OpenAI

Adel El Hallak

Senior Director Of Product @ NVIDIA

Panos Stravopodis

Co-Founder & CTO @ Elyos

Jiquan Ngiam

CEO and Co-Founder @ MintMCP

Chenyu Zhang

Founder @ GlowingStar Inc.

Rekha Singhal

Head Research @ Tata Consultancy Services

Santoshkalyan Rayadhurgam

Engineering Leader @ Meta

Swati Bhatia

Product Manager @ Google

Arushi Jain

Senior Applied Scientist @ Microsoft

Donné Stevenson

Machine Learning Engineer @ Prosus Group

Mefta Sadat

Staff Software Engineer @ Loblaw Digital

Sam Partee

Co-Founder @ Arcade.dev

Sachi Shah

Product Manager @ Sierra

Jasleen Singh

Staff Solutions Architect, Generative AI @ Google

Sanjana Sharma

AI Strategist @ Distyl AI

Artem Yushkovskiy

Sr ML Engineer @ Delivery Hero SE

Rosemary Nwosu-Ihueze

Founder @ Soteria

Euro Beinat

Global Head AI and Data Science @ Prosus Group

Washington Amolo

Product Developer @ NaviSmart AI

Benjamin Guo

Co Founder @ Zo Computer

Hamed Taheri

CEO & Founder @ Personize.ai

Phil Stafford

Principal Consultant, Cybersecurity & AI @ Singularity Systems

Quinten Rosseel

AI Engineer @ Wobby

Dirk Petzoldt

Co-Founder @ Explai.com

Tom Kaltofen

Engineer @ mloda

Vitor Balocco

Co-founder @ Runlayer

Frank Wittkampf

VP Applied AI @ Databook

Laurel Orr

AI Staff Software Engineer @ Stacklok

Benjamin Hindman

Founder & CEO @ Reboot

Ben Epstein

Co-Founder & CTO @ GrottoAI

Matt Sharp

AI Strategist and Principle Engineer @ Flexion

Audi Liu

Senior Product Manager @ Inworld AI

Olga Pavlov

Head of Product @ OLX Group

Isabella Piratininga

Director of Technology & Innovation @ iFood

Paul van der Boor

Senior Director Data Science @ Prosus Group

Demetrios Brinkmann

Chief Happiness Engineer @ MLOps Community

Ricky Doar

VP of Solutions @ Cursor

Nishikant Dhanuka

Senior Director of AI @ Prosus Group

Chiara Caratelli

Data Scientist @ Prosus Group

Simba Khadder

Sr. Manager & Software Engineer @ Redis

Simba Khadder

Founder & CEO @ Featureform

Agenda

Stage 1 - AI & E-Commerce

Stage 2 - AI Workforce

Stage 3 - MCP & Protocols

2:00 PM

2:05 PM

GMT

1:1 networking

Doors Open

2:05 PM

2:30 PM

GMT

Opening / Closing

Using Agents in Production: Past Present and Future

Prosus has shipped over 7949 agents. 15% have worked. The rest have been learning experiences. Let's talk about what we have learned, and where we see things going.

+ Read More

2:35 PM

3:00 PM

GMT

Fireside Chat

Unlocking Enterprise Value: Proven Strategies for Successful Adoption of AI Developer Tools

In this session, Ricky Doar, VP of Solutions at Cursor, shares actionable insights from leading large-scale AI developer tool implementations at the world’s top enterprises. Drawing on field experience with organizations at the forefront of transformation, Ricky highlights key best practices, observed power-user patterns, and deployment strategies that maximize value and ensure smooth rollout. Learn what distinguishes high-performing teams, how tailored onboarding accelerates adoption, and which support resources matter most for driving enterprise-wide success.

+ Read More

3:05 PM

3:30 PM

GMT

Presentation

Shipping AI at Scale: A look back, current Enterprise patterns, and what’s next

2025 has truly been and still is the year of agents. We’ll take a pragmatic look at what's happened and what we learned along the way. Then we'll map the next wave. We’ll distill product patterns behind the wins and preview what’s coming next: from capable agents, to out-of-the-box AI building blocks, and how models like GPT-5 simplify shipping novel, long tail user experiences.

+ Read More

3:35 PM

3:50 PM

GMT

Presentation

When Agents Learn to Feel: Multi-Modal Affective Computing in Production

The next generation of AI agents won’t just respond to what we say—they will sense how we feel. As large language model–powered agents move from research prototypes into production, a critical frontier is the integration of multi-modal affective computing: combining voice, text, facial expressions, and interaction patterns to detect the learner’s or user’s emotional state in real time.

This talk explores the challenges and opportunities of deploying emotion-aware AI tutors in production environments. Drawing from ongoing research at MIT Media Lab and Harvard, and from startup experience building GlowingStar, I will share how multi-modal signals—speech tone, facial micro-expressions, response latency, and even silence—can be fused into affective state estimates that meaningfully improve user experience.

We will unpack the technical lessons learned from moving affective sensing beyond the lab: designing architectures that combine ensemble LLMs with sensor inputs, diagnosing when modalities conflict or sabotage each other, and establishing guardrails for privacy and consent in sensitive domains like education. In parallel, I will highlight multi-agent orchestration patterns—including critic–rewriter loops and role-based ensembles—that make it possible to personalize instruction, generate equitable feedback, and sustain engagement across diverse learners.

By the end of this session, attendees will have a clear picture of what it takes to move multi-modal, affect-sensing agents from demos to durable production systems: the architectures, the pitfalls, and the metrics that matter. More importantly, we will consider how these lessons extend beyond education to any industry where AI agents must not only think, but also feel with and for the human in the loop.

+ Read More

3:50 PM

4:05 PM

GMT

Presentation

Multi-Agent Personalization with Shared Memory: From Email to Website to Proposal

Personalization at scale needs deep understanding of each customer. You must collect data from many sources, read it, reason and infer, plan, decide, act, and write to each person. One agent doing everything gave us poor and inconsistent quality. Multi-agent systems changed that. They deliver mass personalization. They also break in edge cases, contradict each other, and are hard to debug.

I will share how we addressed this with Cortex UCM, a unified customer memory, and Generative Tables. We map noisy data into a clean, structured layer that agents read and write. We began with email for both outbound and inbound communication. Then we personalized websites and product pages for e-commerce at scale. I share customer stories. For example, one customer had over 60,000 product pages that required customization for thousands of communities and product offerings.

I will present our decentralized shared-memory orchestration briefly and how it stays transparent and debuggable. It opens safe paths for external agents. What failed. What worked. What we are building next.

+ Read More

4:05 PM

4:35 PM

GMT

Presentation

Simulate to Scale: How realistic simulations power reliable agents in production

In this session, we’ll explore how developing and deploying AI-driven agents demands a fundamentally new testing paradigm—and how scalable simulations deliver the reliability, safety and human-feel that production-grade agents require.

You’ll learn how simulations allow you to:

Mirror messy real-world user behavior (multiple languages, emotional states, background noise) rather than scripting narrow “happy-path” dialogues.
Model full conversation stacks including voice: turn-taking, background noise, accents, and latency – not just text messages.
Embed automated simulation suites into your CI/CD pipeline so that every change to your agent is validated before going live.
Assess multiple dimensions of agent performance—goal completion, brand-compliance, empathy, edge-case handling—and continuously guard against regressions.
Scale from “works in demo” to “works for every customer scenario” and maintain quality as your agent grows in tasks, languages or domains.

Whether you’re building chat, voice, or multi-modal agents, you’ll walk away with actionable strategies for incorporating simulations into your workflow—improving reliability, reducing surprises in production, and enabling your agent to behave as thoughtfully and consistently as a human teammate.

+ Read More

4:35 PM

4:45 PM

GMT

Break

Musical Intermission

Live improvised music with lyrical suggestions from the chat

+ Read More

4:45 PM

5:10 PM

GMT

Presentation

Building Alfred, the Orchestration Layer for Agentic Commerce at Loblaws

Developing AI agents for shopping is just the first step; the real challenge is reliably running them in production across complex, mission-critical e-commerce systems—a significant MLOps hurdle.

In this talk, we'll talk about Alfred, our agentic orchestration layer. Built with tools like Langgraph, LangFuse, LiteLLM, and Google Cloud components, Alfred is the critical piece that coordinates LLMs with our entire e-commerce backend—from search and recommendations to cart management. It handles the complete execution graph, secured tool calling, and prompt workflow.

We’ll share our journey in designing a reusable agent architecture that scales across all our digital properties. We’ll discuss the specifics of our tech stack and productionization methodology, including how we leveraged the MCP framework and our existing platform APIs to accelerate development of Alfred.

+ Read More

5:15 PM

5:40 PM

GMT

Presentation

From Chat Fatigue to Instant Action: Transforming Dealer Engagement Through Intelligent UI

This presentation discusses the evolution of AI agent interaction, focusing on transitioning from low-engagement text-based chat to more intuitive, GUI-driven experiences. It outlines critical challenges in creating an intuitive and impactful experience for busy dealers, proposing solutions that include quick actions, efficient data streaming, and agent interactivity to create a great user experience.

+ Read More

5:45 PM

6:10 PM

GMT

Presentation

From Zero to AILO: Lessons learned from building iFood's AI agent

In this session, we share the development journey of Ailo, iFood's conversational AI agent. We cover the practical challenges and victories of navigating from concept to production, highlighting how robust MLOps practices and the integration of our proprietary Large Commerce Model (LCM) enable us to interpret complex intents and create the best personalization experience for our users.

+ Read More

6:15 PM

6:40 PM

GMT

Presentation

Architecting Trust: Multi-Agent Systems for the Misinformation Lifecycle

The rapid spread of digital misinformation requires solutions that address the entire lifecycle, moving beyond single-LLM limitations. This talk, based on the author’s ICWSM research paper, offers a practitioner's guide to a novel, five-agent system—Classifier, Indexer, Extractor, Corrector, and Verification—designed for maximum scalability, modularity, and explainability. This paper aims at automating the working of fact-checkers, which is traditionally done through a team of experts, saving millions and increasing efficiency with a human-in-the-loop system. We will get the details for each specialized agent, detailing crucial elements like model sizing and fine-tuning—for example, matching small, fine-tuned encoder models for the Classifier's high-confidence multi-class labeling against the need for a strong reasoning LLM in the Corrector Agent. Topics include building an efficient Indexer Agent and reranking with retrieval through hybrid keyword and vector embeddings, enabling the Corrector Agent to use external search APIs for cross-validation, and the function of the Verification Agent as the final quality check for high precision.. The talk concludes by covering agent coordination protocols, cost, holistic evaluation, offline evaluation and online A/B testing and post-deployment metrics.

+ Read More

6:45 PM

6:55 PM

GMT

Break

Trivia Challenge - Airpods Giveaway

Think you know AI? Put your skills to the test as one lucky winner will be walking away with some new headgear.

+ Read More

6:55 PM

7:20 PM

GMT

Presentation

When AI Agents Argue: Structured Dissent Patterns for Production Reliability

Single-agent LLM systems fail silently in production - they're confidently wrong at scale with no mechanism for self-correction. We've deployed a multi-agent orchestration pattern called ""structured dissent"" where believer, skeptic, and neutral agents debate decisions before consensus. This isn't theoretical - we'll show production deployment patterns, cost/performance tradeoffs, and measurable reliability improvements. You'll learn when multi-agent architectures justify the overhead, how to orchestrate adversarial agents effectively, and operational patterns for monitoring agent reasoning quality in production.

Our first deployment of the debate swarm revolves around MCP servers - we use a security swarm specially built for MCP servers to analyze findings from open source security tools. This provides more nuanced reasoning and gives a confidence score to evaluate the security of unknown MCP tools.

+ Read More

7:25 PM

7:50 PM

GMT

Panel Discussion

Hardening Agents for E-commerce Scale: From RL Alignment to Reliability

The discussion centers on highly technical yet practical themes, such as the use of advanced post-training techniques like Direct Preference Optimization (DPO) and Parameter-Efficient Fine-Tuning (PEFT) to ensure LLMs maintain stability while specializing for e-commerce domains.

We compare the implementation challenges of Computer-Using Agents in automating legacy enterprise systems versus the stability issues faced by conversational agents when inputs become unpredictable in production.

We will analyze the role of cloud infrastructure in supporting the continuous, iterative training loops required by Reinforcement Learning-based agents for e-commerce!

+ Read More

7:55 PM

8:20 PM

GMT

Presentation

Keynote: Coding with AI

This talk covers an overview of AI coding tools and different levels of coding automation. It also discusses workflow patterns that have emerged and how they will change over time.

+ Read More

Event has finished

1:45 PM - 8:30 PM GMT

November 18, 2025

Online

Organized by

MLOps Community

Prosus

Event has finished

1:45 PM - 8:30 PM GMT

November 18, 2025

Online

Organized by

MLOps Community

Prosus