MLOps Community
+00:00 GMT

Collections

All Collections
See all
MLOps Reading Group
8 Items

All Content

Popular topics
# LLMs
# LLM in Production
# AI
# LLM
# Machine Learning
# Rungalileo.io
# MLops
# MLOps
# RAG
# Interview
# Machine learning
# Generative AI
# Tecton.ai
# Arize.com
# AI Agents
# mckinsey.com/quantumblack
# Redis.io
# Zilliz.com
# Humanloop.com
# Snorkel.ai
All
Kostas Pardalis
Yoni Michael
Demetrios Brinkmann
Kostas Pardalis, Yoni Michael & Demetrios Brinkmann · Jun 27th, 2025
ML Engineers Who Ignore LLMs Are Voluntarily Retiring Early
LLMs are reshaping the future of data and AI—and ignoring them might just be career malpractice. Yoni Michael and Kostas Pardalis unpack what’s breaking, what’s emerging, and why inference is becoming the new heartbeat of the data pipeline.
# LLM
# AI infrastructure
# Typedef
1:37:23
Greg Kamradt
Demetrios Brinkmann
Greg Kamradt & Demetrios Brinkmann · Jun 24th, 2025
What makes a good AI benchmark? Greg Kamradt joins Demetrios to break it down—from human-easy, AI-hard puzzles to wild new games that test how fast models can truly learn. They talk hidden datasets, compute tradeoffs, and why benchmarks might be our best bet for tracking progress toward AGI. It’s nerdy, strategic, and surprisingly philosophical.
# AI Benchmark
# ARC AGI
# Data Independent
48:31
Deepti Srivastava
Demetrios Brinkmann
Deepti Srivastava & Demetrios Brinkmann · Jun 20th, 2025
I’m sure the MLOps community is probably aware – it's tough to make AI work in enterprises for many reasons, from data silos, data privacy and security concerns, to going from POCs to production applications. But one of the biggest challenges facing businesses today, that I particularly care about, is how to unlock the true potential of AI by leveraging a company’s operational business data. At Snow Leopard, we aim to bridge the gap between AI systems and critical business data that is locked away in databases, data warehouses, and other API-based systems, so enterprises can use live business data from any data source – whether it's database, warehouse, or APIs – in real time and on demand, natively. In this interview, I'd like to cover Snow Leopard’s intelligent data retrieval approach that can leverage business data directly and on-demand to make AI work.
# AI and Business Data
# LLM
# Snow Leopard AI
57:14
Sebastián Ramírez
Demetrios Brinkmann
Sebastián Ramírez & Demetrios Brinkmann · Jun 17th, 2025
The creator of FastAPI is back with a new chapter—FastAPI Cloud. From building one of the most loved dev tools to launching a company, Sebastián Ramírez shares how open source, developer experience, and a dash of humor are shaping the future of APIs.
# FastAPI
# FastAPI Cloud
# FastAPI Labs
1:09:38
Shreya Shankar
Willem Pienaar
Demetrios Brinkmann
Shreya Shankar, Willem Pienaar & Demetrios Brinkmann · Jun 13th, 2025
Willem Pienaar and Shreya Shankar discuss the challenge of evaluating agents in production where "ground truth" is ambiguous and subjective user feedback isn't enough to improve performance. The discussion breaks down the three "gulfs" of human-AI interaction—Specification, Generalization, and Comprehension—and their impact on agent success. Willem and Shreya cover the necessity of moving the human "out of the loop" for feedback, creating faster learning cycles through implicit signals rather than direct, manual review. The conversation details practical evaluation techniques, including analyzing task failures with heat maps and the trade-offs of using simulated environments for testing. Willem and Shreya address the reality of a "performance ceiling" for AI and the importance of categorizing problems your agent can, can learn to, or will likely never be able to solve.
# Production failure
# AI system
# Observability
47:03
Jukka Remes
Demetrios Brinkmann
Jukka Remes & Demetrios Brinkmann · Jun 10th, 2025
AI is already complex—adding the need for deep engineering expertise to use MLOps tools only makes it harder, especially for SMEs and research teams with limited resources. Yet, good MLOps is essential for managing experiments, sharing GPU compute, tracking models, and meeting AI regulations. While cloud providers offer MLOps tools, many organizations need flexible, open-source setups that work anywhere—from laptops to supercomputers. Shared setups can boost collaboration, productivity, and compute efficiency. In this session, Jukka introduces an open-source MLOps platform from Silo AI, now packaged for easy deployment across environments. With Git-based workflows and CI/CD automation, users can focus on building models while the platform handles the MLOps.
# Open Source Platforms
# AI Act Regulation
# Haaga-Helia UAS
55:31
Michael Del Balso
Demetrios Brinkmann
Michael Del Balso & Demetrios Brinkmann · Jun 6th, 2025
Tecton Founder and CEO Mike Del Balso talks about what ML/AI use cases are core components generating Millions in revenue. Demetrios and Mike go through the maturity curve that predictive Machine Learning use cases have gone through over the past 5 years, and why a feature store is a primary component of an ML stack.
# AI Adoption
# LLMs
# Tecton
48:43
Prateek Chhikara
Arthur Coleman
Nehil Jain
+1
Prateek Chhikara, Arthur Coleman, Nehil Jain & 1 more speaker · Jun 5th, 2025
Memory is one of the thorniest challenges in deploying LLMs. Mem0 introduces a scalable long-term memory architecture that dynamically extracts, consolidates, and retrieves key information from conversations. By using graph-based structures to model relationships between conversational elements, Mem0 enables AI agents that are more accurate, coherent, and production-ready.
# MEM0
# LLMs
# OpenAI memory
58:25
Raza Habib
Demetrios Brinkmann
Raza Habib & Demetrios Brinkmann · Jun 3rd, 2025
Raza Habib, the CEO of LLM Eval platform Humanloop, talks to us about how to make your AI products more accurate and reliable by shortening the feedback loop of your evals. Quickly iterating on prompts and testing what works, along with some of his favorite Dario from Anthropic AI Quotes.
# Generative AI
# LLMs
# Humanloop
53:07
An analysis of the May 2025 incident where xAI's Grok chatbot began inappropriately referencing 'white genocide' in South Africa. This post-mortem delves into the probable cause—a flawed post-processing prompt—framing it as a critical MLOps failure. It underscores the necessity of treating prompts as key artifacts, implementing progressive deployment strategies, and using appropriate metrics for AI safety and reliability.
# LLMs
# Prompt Deployment
# White Genocide
Popular