Agents in Production 2024
# Data Analyst
# Toqan
# Prosus
# Agents in Production
From Agents to Product: Bringing Toqan Data Analyst to Hundreds of Users // Ioannes and Donné
In our journey from concept to production, we focused on delivering consistent behaviors to build user trust. This talk will cover the design and refinement of AI systems using agentic frameworks and deterministic components, emphasizing the integration of continuous learning and human oversight.
Ioannis Zempekakis, Donné Stevenson & Demetrios Brinkmann · Nov 20th, 2024
Popular topics
# MLops
# LLMs
# AI
# Machine learning
# Coding Workshop
# Presentation
# Interview
# Monitoring
# Case Study
# LinkedIn
# LLM in Production
# Generative AI
# Machine Learning
# Model Serving
# Analytics
# CPU
# GPU
# Run:AI
# Observability
# Python
Julia Kiseleva & Skylar Payne · Nov 20th, 2024
The rapid development of Large Language Models (LLMs) has led to a surge in applications that facilitate collaboration among multiple agents, assisting humans in their daily tasks. However, a significant gap remains in assessing to what extent LLM-powered applications genuinely enhance user experience and task execution efficiency. This highlights the need to verify utility of LLM-powered applications, particularly by ensuring alignment between the application's functionality and end-user needs. We introduce AgentEval, a novel framework designed to simplify the utility verification process by automatically proposing a set of criteria tailored to the unique purpose of any given application. This allows for a comprehensive assessment, quantifying the utility of an application against the suggested criteria.
# LLMs
# MultiOn
# Agents in Production
Eno Reyes · Nov 20th, 2024
The field of agentic system design is an exciting and quickly developing area with significant implications for software use across various sectors. These systems differ from conventional software by employing decision-making processes that are non-deterministic and often unpredictable. By drawing insights from diverse fields such as robotics, cybernetics, and biology, we can begin to cultivate an understanding of how to construct systems that are more dependable than their individual probabilistic components would suggest.
# Agentic
# AI Systems
# Factory
+1
Paul van der Boor, Meera Clark, George Robson & 1 more speaker · Nov 18th, 2024
In this segment, the Panel will dive into the evolving landscape of AI, where large language models (LLMs) power the next wave of intelligent agents. In this engaging panel, leading investors Meera (Redpoint), George (Sequoia), and Sandeep (Prosus Ventures) discuss the promise and pitfalls of AI in production. From transformative industry applications to the challenges of scalability, costs, and shifting business models, this session unpacks the metrics and insights shaping GenAI's future. Whether you're excited about AI's potential or wary of its complexities, this is a must-watch for anyone exploring the cutting edge of tech investment.
# VC
# Investor
# AI Agents
+2
Diego Oppenheimer, Jazmia Henry, Rogerio Bonatti & 2 more speakers · Nov 15th, 2024
This panel speaks about the diverse landscape of AI agents, focusing on how they integrate voice interfaces, GUIs, and small language models to enhance user experiences. They'll also examine the roles of these agents in various industries, highlighting their impact on productivity, creativity, and user experience and how these empower developers to build better solutions while addressing challenges like ensuring consistent performance and reliability across different modalities when deploying AI agents in production.
# AI agents landscape
# SLM
# Agents in Production
Samuel Partee · Nov 15th, 2024
Why can't my agent perform actions on my behalf? How is tool calling different from retrieval? How do I test my LLM tools? How do I evaluate different model's ability to call tools? These and many more questions were some of the main points we've been figuring out as we've been building Arcade AI. This talk will focus on the approaches we tried and the approaches we found that worked. If you want to or are building an Agent that can call tools, this will be a good talk for you.
# AI tools
# Agents
# ArcadeAI
Prashanth Chandrasekar · Nov 15th, 2024
The internet and its business models are changing. For the last 16 years, Stack Overflow has been at the forefront, helping to shape the future of the web. In this time of disruption comes a necessary time to reflect, change, and break norms. With the expansion of generative AI, many LLM providers are not allowing the world to operate as it has previously. Instead, a new paradigm has emerged: content created by thousands of creators across the Open Web is now being used to train models without respect or attribution for the original creator.Consequently, in the current transformation, human-centered sources of knowledge are obscured but a key component of the AI stack. Companies and organizations lucky enough to host these engines of knowledge production are at a decision point; how do they continue to grow and invest in their communities when the technological landscapes have changed. A change in strategy will allow these companies, AI tools, and the communities and data sets that power them to thrive.
Daniel Jeffries & Adam Becker · Nov 15th, 2024
If you think AGI is just around the corner, try building an agent. Today's frontier models often feel like idiot children, capable of brilliant superhuman replies and at the same time making absurd logical errors, while lacking any common sense. Even worse, those errors pile up, making them unreliable. AI Agents hold the incredible potential to change how we work, learn and play. But any company working in agents quickly realizes they're incredibly hard to build, to make reliable and to generalize. Instead, most teams have shifted to building narrow agents that they can scope to a particular problem with lots of glue code, heuristics and prompt engineering. In this talk we'll take a look at why agents are so challenging to build, what we can do about it and whether just when they'll live up to the hype of doing complex, open ended tasks in the real world.
# Logical Agents
# Kentauros AI
# Agents in Production
Patrick Marlow & Skylar Payne · Nov 15th, 2024
Generative AI Agents represent the current frontier of LLM technology, enabling dynamic interactions and intelligent workflow automation. Building and deploying production-ready agents, however, requires navigating various complexities and understanding key lessons learned in the field. This session distills over two years of practical experience into actionable best practices for successful agent development. We'll dive into techniques like meta-prompting for prompt optimization, establishing robust safety and guardrails, and implementing comprehensive evaluation frameworks. By addressing these critical areas, you'll be well-equipped to build more effective and reliable Generative AI Agents.
Check out Patrick's recently authored whitepaper published on Agents: https://www.kaggle.com/whitepaper-agents
# Generative AI Agents
# Vertex Applied AI
# Agents in Production
Nathan Benaich & Skylar Payne · Nov 15th, 2024
The State of AI Report analyses the most interesting developments in AI. We aim to trigger an informed conversation about AI's current state and its implications for the future.
# AI Report 2024
# Airstreet Capital
# Agents in Production
Willem Pienaar & Adam Becker · Nov 15th, 2024
Cleric is building an AI-powered Site Reliability Engineer capable of autonomously monitoring, diagnosing, and fixing issues in complex distributed systems. This talk will explore how Cleric's AI agent can handle novel situations, learn continuously from its environment, and respond to issues at speeds far beyond human capabilities. It will cover the technical approach behind creating an AI that can understand system architectures, reason about failures, and take appropriate actions without human intervention. The discussion will also examine how autonomous SRE agents could reshape the future of software operations and reliability.
# SRE Agents
# Cleric AI
# Agents in Production
Popular