MLOps Community

Home

MLOps Community

The MLOps Community is where machine learning practitioners come together to define and implement MLOps. Our global community is the default hub for MLOps practitioners to meet other MLOps industry professionals, share their real-world experience and challenges, learn skills and best practices, and collaborate on projects and employment opportunities. We are the world's largest community dedicated to addressing the unique technical and operational challenges of production machine learning systems.

Events

4:00 PM - 5:00 PM, Jun 25 GMT
The Current State of Agentic Retrieval
user's Avatar
user's Avatar
user's Avatar
user's Avatar
user's Avatar
Learn More
4:00 PM - 5:00 PM, Jun 26 GMT
Coding Agents Lunch & Learn Session 16: Real-World Coding Agent Workflows

Content

Blog
Managing subagent personas across multiple AI coding platforms (like Claude Code, Gemini CLI, and Copilot) leads to fragmented configuration files and instruction drift. This article introduces `supagents`, a lightweight, idempotent Python CLI tool that lets developers maintain subagent definitions in a single markdown file with multi-target frontmatter. The tool automatically compiles and distributes these instructions to the correct paths, ensuring a consistent persona across all development environments.
Jun 23rd, 2026 | Views 11
Video
Shahram Anver is the Co-Founder and CEO of Cleric, the autonomous AI SRE that investigates and root-causes production issues like an experienced teammate — often in under two minutes. Before Cleric, Shahram led MLOps, DevOps, and FinOps platform engineering at Gojek, Southeast Asia's super-app. In this conversation, he breaks down why production operations never kept pace with AI-accelerated development, and why the real unlock for an AI SRE isn't faster triage — it's an agent that *learns* and compounds operational memory across your whole org.
Jun 19th, 2026 | Views 24
Video
Retrieval-Augmented Generation and agentic AI are increasingly common in enterprise deployments, but real enterprise environments introduce challenges largely absent from academic treatments and consumer-facing APIs: multiple tenants with heterogeneous data, strict access-control requirements, regulatory compliance, and cost pressures that demand shared infrastructure. This paper identifies a fundamental problem underlying existing RAG architectures in these settings. Retrieval systems rank documents by relevance, not by authorization, so a query from one tenant can surface another tenant’s confidential data simply because it scores highest. The authors formalize this relevance-authorization gap alongside related shortcomings (tool-mediated disclosure, context accumulation across turns, client-side orchestration bypass) and introduce a layered isolation architecture combining policy-aware ingestion, retrieval-time gating, and shared inference, enforced through server-side orchestration. They validate it through an open-source implementation in OGX, a vendor-neutral OpenAI-compatible Responses API, showing empirically that ABAC gating eliminates cross-tenant leakage while introducing negligible overhead.
Jun 17th, 2026 | Views 32
Code of Conduct
Your Privacy Choices