MLOps Community
Coding Agents Lunch & Learn: Skill Building Workshop (From Idea to Evaluation)
MEETING

Coding Agents Lunch & Learn: Skill Building Workshop (From Idea to Evaluation)

After several sessions of Coding Agents Lunch & Learn, we’re shifting gears into something more hands-on.

In this interactive workshop, we’ll go beyond discussion and actually build. Together, we’ll walk through how to create practical AI “skills”, from defining the problem and structuring the approach, to testing, evaluating, and improving them in real time.

A big theme we’ll touch on is First Principles Thinking, not just as a concept, but as something we can attempt to structure into usable, testable workflows. We’ll explore why some skills fail, how to scope them effectively, and how to introduce lightweight observability and evaluation so we can measure what’s actually working.

By the end of the session, we’ll have:

  1. Built a couple of simple, purpose-driven skills
  2. Defined evaluation criteria and tested them live
  3. Collected quick feedback signals (success rates, gaps, edge cases)
  4. Identified ways to iterate and improve

This session is designed to be interactive, experimental, and a bit messy, in the best way. If you’ve been thinking about building reusable AI workflows or want to better understand how to evaluate them, this workshop is for you.


Speakers

Leo Walker
AI Engineer @ KaiCare.ai
Rahul Parundekar
Founder @ A.I. Hero, inc.
Demetrios Brinkmann
Chief Happiness Engineer @ MLOps Community

Agenda

From4:00 PM
To4:05 PM
GMT
Tags:
Opening / Closing
Kickoff & Context

Quick intro

+ Read More
Speakers:
user's Avatar
user's Avatar
From4:05 PM
To4:10 PM
GMT
Tags:
Keynote
Framing the Problem

Why most skills fail (scope, structure, lack of evaluation) Intro to First Principles Thinking in this context

+ Read More
Speakers:
user's Avatar
From4:10 PM
To4:35 PM
GMT
Tags:
Breakout Session
Live Skill Build

Define a simple problem Break it down (first principles approach) Draft a scoped, purpose-built skill Refine structure Create supporting artifacts (instructions, steps, outputs) Prepare it for testing

+ Read More
From4:35 PM
To4:45 PM
GMT
Tags:
Presentation
Evaluation & Observability Setup

Define success criteria Quick scoring / grading system How to capture feedback (lightweight “skillbench” style)

+ Read More
Speakers:
user's Avatar
From4:45 PM
To4:55 PM
GMT
Tags:
Presentation
Live Testing + Audience Participation

Participants test the skill (5–10 min runs) Collect quick feedback / scores Identify failure points

+ Read More
Speakers:
user's Avatar
user's Avatar
From4:55 PM
To5:00 PM
GMT
Tags:
Opening / Closing
Wrap-up & Next Steps

What worked vs what didn’t How to iterate on skills after the session Preview of next session

+ Read More
Speakers:
user's Avatar

Attendees

Bessie's Avatar
Bessie's Avatar
Bessie
member
Arlene's Avatar
Arlene's Avatar
Arlene
member
Cody's Avatar
Cody's Avatar
Cody
member
Colleen's Avatar
Colleen's Avatar
Colleen
member
Kathryn's Avatar
Kathryn's Avatar
Kathryn
member
Bessie's Avatar
Bessie's Avatar
Bessie
member
Already registered?
Starting in 1 day
April 10, 4:00 PM GMT
Online
Organized by
user's Avatar
MLOps Community
Add to calendar
Starting in 1 day
April 10, 4:00 PM GMT
Online
Organized by
user's Avatar
MLOps Community
Add to calendar
Code of Conduct