LIVESTREAM
Boosting LLMs: Performance, Scaling, and Structured Outputs
# LLMs
# Performance and Scalability
# SAS
Join us for an in-depth exploration of advanced techniques to enhance the performance and scalability of large language models (LLMs). This event features three insightful sessions:
Learn how linguistic rule-based models and traditional text analytics improve LLM accuracy and efficiency in real-world applications like public comment analysis, focusing on data curation, fine-tuning, and prompt-tuning.
Discover strategies for scaling LLMs in high-traffic environments, including the limitations of inference, cost optimization, and the trade-offs of implementing guardrails.
Explore BAML, a new programming language that enhances LLM output quality by optimizing algorithms and enabling cost-effective, smaller models to perform at top-tier levels.
Speakers
Ben Epstein
Founding Software Engineer @ Galileo
Tom Sabo
Advisory Solutions Architect @ SAS
Matt Squire
CTO and Co-founder @ Fuzzy Labs
Vaibhav Gupta
CEO @ Boundary ML
Agenda
4:00 PM, GMT
-
4:05 PM, GMT
Opening / Closing
Introduction
4:05 PM, GMT
-
4:25 PM, GMT
Presentation
Bending the Rules: How to Use Information Extraction Models to Improve the Performance of Large Language Models
+ Read More
4:25 PM, GMT
-
4:40 PM, GMT
Presentation
Scaling Large Language Models in Production
+ Read More
4:40 PM, GMT
-
4:55 PM, GMT
Presentation
BAML: Beating OpenAI's Structured Outputs
+ Read More
4:55 PM, GMT
-
5:00 PM, GMT
Opening / Closing
Q&A and Wrap up