Coding Agents Lunch & Learn: Skill Building Workshop (From Idea to Evaluation)

After several sessions of Coding Agents Lunch & Learn, we’re shifting gears into something more hands-on.

In this interactive workshop, we’ll go beyond discussion and actually build. Together, we’ll walk through how to create practical AI “skills”, from defining the problem and structuring the approach, to testing, evaluating, and improving them in real time.

A big theme we’ll touch on is First Principles Thinking, not just as a concept, but as something we can attempt to structure into usable, testable workflows. We’ll explore why some skills fail, how to scope them effectively, and how to introduce lightweight observability and evaluation so we can measure what’s actually working.

By the end of the session, we’ll have:

Built a couple of simple, purpose-driven skills
Defined evaluation criteria and tested them live
Collected quick feedback signals (success rates, gaps, edge cases)
Identified ways to iterate and improve

This session is designed to be interactive, experimental, and a bit messy, in the best way. If you’ve been thinking about building reusable AI workflows or want to better understand how to evaluate them, this workshop is for you.

Speakers

Leo Walker

AI Engineer @ KaiCare.ai

Rahul Parundekar

Founder @ A.I. Hero, inc.

Demetrios Brinkmann

Chief Happiness Engineer @ MLOps Community

Agenda

4:00 PM

4:05 PM

GMT

Opening / Closing

Kickoff & Context

Quick intro

+ Read More

4:05 PM

4:10 PM

GMT

Keynote

Framing the Problem

Why most skills fail (scope, structure, lack of evaluation) Intro to First Principles Thinking in this context

+ Read More

4:10 PM

4:35 PM

GMT

Breakout Session

Live Skill Build

Define a simple problem Break it down (first principles approach) Draft a scoped, purpose-built skill Refine structure Create supporting artifacts (instructions, steps, outputs) Prepare it for testing

+ Read More

4:35 PM

4:45 PM

GMT

Presentation

Evaluation & Observability Setup

Define success criteria Quick scoring / grading system How to capture feedback (lightweight “skillbench” style)

+ Read More

4:45 PM

4:55 PM

GMT

Presentation