MLOps Community

Home

MLOps Community

The MLOps Community is where machine learning practitioners come together to define and implement MLOps. Our global community is the default hub for MLOps practitioners to meet other MLOps industry professionals, share their real-world experience and challenges, learn skills and best practices, and collaborate on projects and employment opportunities. We are the world's largest community dedicated to addressing the unique technical and operational challenges of production machine learning systems.

Events

4:00 PM - 5:00 PM, Feb 26 GMT
MLOps Reading Group February: Advancing Open-source World Models
user's Avatar
user's Avatar
user's Avatar
Learn More
8:00 PM - 9:00 PM, Feb 27 GMT
Coding Agents Lunch & Learn, Session 3: Working on an AI-First Team
8:00 PM - 9:00 PM, Feb 20 GMT
Coding Agents Lunch & Learn s.2

Content

Video
In today’s era of massive generative models, it's important to understand the full scope of AI systems' performance engineering. This talk discusses the new O'Reilly book, AI Systems Performance Engineering, and the accompanying GitHub repo (https://github.com/cfregly/ai-performance-engineering). This talk provides engineers, researchers, and developers with a set of actionable optimization strategies. You'll learn techniques to co-design and co-optimize hardware, software, and algorithms to build resilient, scalable, and cost-effective AI systems for both training and inference.
Feb 24th, 2026 | Views 18
Blog
A hands-on beginner roadmap for learning Kubernetes, designed to walk you through core concepts (like clusters, pods, services, deployments, storage, RBAC, autoscaling, etc.) with simple explanations, CLI examples, and practical exercises. By following it you build real experience and are prepared to use Kubernetes locally or on cloud platforms like GKE or EKS.
Feb 24th, 2026 | Views 15
Video
Experimenting with LLMs is easy. Running them reliably and cost-effectively in production is where things break. Most AI teams never make it past demos and proofs of concept. A smaller group is pushing real workloads to production—and running into very real challenges around infrastructure efficiency, runaway cloud costs, and reliability at scale. This session is for engineers and platform teams moving beyond experimentation and building AI systems that actually hold up in production.
Feb 19th, 2026 | Views 48
Code of Conduct