MLOps Community
+00:00 GMT
Sign in or Join the community to continue

From Code Completion to Autonomous Software Engineering Agents // Kilian Lieret // Agent Hour

Posted Apr 02, 2025 | Views 58
# Code Completion
# Autonomous Agents
Share

speaker

user's Avatar
Kilian Lieret
Research Software Engineer @ Princeton University

I hold a PhD in high-energy physics, where I specialized in calibrating machine learning algorithms to analyze petabytes of particle collision data. During my postdoc at Princeton University, I developed graph neural networks to reconstruct billions of particle trajectories moving at near-light speed. Since last year, I have been part of the Princeton Language and Intelligence Initiative, focusing on agentic AI, in particular for software engineering purposes.

+ Read More

SUMMARY

As language models have advanced, they have moved beyond code completion and are beginning to tackle software engineering tasks in a more autonomous, agentic way. However, evaluating agentic capabilities is challenging. To address this, we first introduce SWE-bench, a benchmark built from real GitHub issues that has become the standard for assessing AI’s ability to resolve complex software tasks in large codebases. We will discuss the current state of the field, the limitations of today’s models, and how far we still are from truly autonomous AI developers. Next, we will explore the fundamentals of agents based on SWE-agent, a simple yet powerful agent framework designed for software engineering but adaptable to a variety of domains. By the end of this talk, you will have an understanding of the current frontier of agentic AI in software engineering, the challenges ahead, and various tips and tricks on optimizing AI agents for tool use and iterative problem solving of reasoning-heavy tasks.

+ Read More
Comment (1)
Popular
avatar


Watch More

From Few Shot Code Generation to Autonomous Software Engineering Agents // John Yang
Posted Nov 22, 2024 | Views 693
# shot code
# autonomous
# AI Agents
Intelligent Autonomous Multi Agent AI Systems // Natan Vidra // Agent Hour #2
Posted Dec 19, 2024 | Views 540
# Autonomous
# Multi-Agent
# Agents
# AI agents in production
Cleric AI SRE: Towards Self-healing Autonomous Software // Willem Pienaar // Agents in Production
Posted Nov 15, 2024 | Views 1.3K
# SRE Agents
# Cleric AI
# Agents in Production
Privacy Policy