LIVESTREAM

LLMs in Production Conference

# Large Language Models

Large Language Models have taken the world by storm. But what are the real use cases? What are the challenges in productionizing them?

In this event, you will hear from practitioners about how they are dealing with things such as cost optimization, latency requirements, trust of output and debugging.

You will also get the opportunity to join workshops that will teach you how to set up your use cases and skip over all the headaches.

Speakers

Meryem Arik

Co-founder/CEO @ TitanML

Linus Lee

Research Engineer @ Notion

Lina Weichbrodt

Freelance Machine Learning Development + Consulting @ Pragmatic Machine Learning Consulting

Shreya Rajpal

Creator @ Guardrails AI

Daniel Jeffries

Chief Executive Officer @ Kentauros AI

Raza Habib

CEO and Co-founder @ Humanloop

Harrison Chase

CEO @ LangChain

Saahil Jain

Engineer @ You.com

Alex Ratner

CEO and Co-founder @ Snorkel AI

Justin Uberti

CTO and Co-founder @ Fixie

Hanlin Tang

CTO @ MosaicML

Mario Kostelac

Staff Machine Learning Engineer @ Intercom

Vin Vashishta

CEO @ V-Squared

Willem Pienaar

Co-Founder & CTO @ Cleric

Jared Zoneraich

Founder @ PromptLayer

Cameron Feenstra

Principal Engineer @ Anzen

Ashe Magalhaes

Founder @ Hearth AI

Luis Ceze

CEO and Co-founder @ OctoML

Eli Mernit

CEO / Founder @ Beam

Diego Oppenheimer

Head of Product @ Hyperparam

Gevorg Karapetyan

Co-founder and CTO @ ZERO Systems

Demetrios Brinkmann

Chief Happiness Engineer @ MLOps Community

Tanmay Chopra

Founder / CEO @ Emissary

Jon Turow

Partner @ Madrona

Daniel Campos

Research Scientist @ Snowflake

Jerry Liu

CEO @ LlamaIndex

Jacob van Gogh

Member of Technical Staff @ Adept AI

Andrew Seagraves

VP of Research @ Deepgram

Hannes Hapke

Principal Machine Learning Engineer @ Digits

Pascal Brokmeier

Lead Data Engineer @ McKinsey and Company

Torgyn Erland

Data Scientist @ QuantumBlack, AI by McKinsey

Samuel Partee

CTO & Co-Founder @ Arcade AI

Vikram Chatterji

Co-founder and CEO @ Galileo

Deepankar Mahapatro

Engineering Manager @ Jina AI

Adam Nolte

CTO and Co-founder @ Autoblocks

Daniel Herde

Lead Data Scientist @ QuantumBlack, AI by McKinsey

Braden Hancock

Co-founder and Head of Technology @ Snorkel AI

Viktoriia Oliinyk

Data Scientist @ QuantumBlack, AI by McKinsey

Agenda

Track View

Track 1

Track 2

Workshops

3:00 PM, GMT

3:10 PM, GMT

Stage 1

Opening / Closing

Welcome

3:10 PM, GMT

3:40 PM, GMT

Stage 1

Keynote

DevTools for Language Models: Unlocking the Future of AI-Driven Applications

In this talk, we explore the thriving ecosystem of tools and technologies emerging around large language models (LLMs) such as GPT-3. As the LLM landscape enters the "Holy $#@!" phase of exponential growth, a surge of developers are building remarkable product experiences on top of these models, giving rise to a rich collection of DevTools. We delve into the current state of LLM DevTools, their significance, and future prospects. We also examine the challenges and opportunities involved in building intelligent features using LLMs, discussing the role of experimentation, prompting, knowledge retrieval, and vector databases. Moreover, we consider the next set of challenges faced by teams looking to scale their LLM features, such as data labeling, fine-tuning, monitoring, observability, and testing. Drawing parallels with previous waves of machine learning DevTools, we predict the trajectory of this rapidly maturing market and the potential impact on the broader AI landscape. Join us in this exciting discussion to learn about the future of AI-driven applications and the tools that will enable their success.

+ Read More

3:40 PM, GMT

4:10 PM, GMT

Stage 1

Keynote

Age of Industrialized AI

The rise of LLMs means we're entering an era where intelligent agents with natural language will invade every kind of software on Earth. But how do we fix them when they hallucinate? How do we put guardrails around them? How do we protect them from giving away our secrets of falling prey to social engineering? We're on the cusp of a brand new era of incredibly capabilities but we've also got new attack vectors and problems that will change how we build and defend our systems. We'll talk about how we can solve some of these problems now and what we can do in the future to solve them better.

+ Read More

4:10 PM, GMT

4:40 PM, GMT

Stage 1

Panel Discussion

Data Privacy and Security

4:40 PM, GMT

4:50 PM, GMT

Stage 1

1:1 networking

Improvised Musical Break

4:50 PM, GMT

5:20 PM, GMT

Stage 1

Presentation

Efficiently Scaling and Deploying LLMs

5:20 PM, GMT

5:30 PM, GMT

Stage 1

Lightning Talk

No rose without a thorn - Obstacles to Successful LLM Deployments

LLMs have garnered immense attention in a short span of time - with their capabilities usually being conveyed to the world in low-precision demanding scenarios like demos and MVPs, but as we all know, deploying to prod is a whole other ballgame. In this talk, we'll discuss some pitfalls expected in deploying LLMs to production use-cases both at the terminal layer (direct-to-user) as well as intermediate layers. We'll approach this topic from both infrastructural and output-focused lenses and explore potential solutions to challenges ranging from foundational model downtime and latency concerns to output variability and prompt injections.

+ Read More

5:30 PM, GMT

5:40 PM, GMT

Stage 1

Lightning Talk

Emerging Patterns for LLMs in Production

As the landscape of large language models (LLMs) advances at an unprecedented rate, novel techniques are constantly emerging to make LLMs faster, safer, and more reliable in production. This talk explores some of the latest patterns that builders have adopted when integrating LLMs into their products.

+ Read More

5:40 PM, GMT

5:50 PM, GMT

Stage 1

Lightning Talk

LangChain: Enabling LLMs to Use Tools

This talk will cover everything related to getting LLMs to use tools. It will discuss why enabling tool use is important, different types of tools, popular prompting strategies for using tools, and what difficulties still exist.

+ Read More

5:50 PM, GMT

6:20 PM, GMT

Stage 1

Presentation

Generative Interfaces beyond Chat

Chat-based interfaces to LLMs are the command-line interfaces for interfacing with modern generative AI systems, and we should be more imaginative and ambitious when thinking about how we'll interact with them in the future. I'll share 5 concrete big ideas for how to design good interactions to LLMs and other generative AI models that I've found helpful in my own work, and hopefully in the process enable you to start to think outside of the chat box (hah, literally!) when building your own products.

+ Read More

6:20 PM, GMT

6:40 PM, GMT

Stage 1

1:1 networking

Prompt Hacking Competition

Prizes and swag for the person who can get the nasty prompt injections

+ Read More

6:40 PM, GMT

7:10 PM, GMT

Stage 1

Presentation

Solving the Last Mile Problem of Foundation Models with Data-Centric AI

Today, large language or “foundation” models (FMs) represent one of the most powerful new ways to build AI models; however, they still struggle to achieve production-level accuracy out of the box on complex, high-value, and/or dynamic use cases, often “hallucinating” facts, propagating data biases, and misclassifying domain-specific edge cases. This “last mile” problem is always the hardest part of shipping real AI applications, especially in the enterprise- and while FMs provide powerful foundations, they do not “build the house”.

+ Read More

7:10 PM, GMT

7:40 PM, GMT

Stage 1

Panel Discussion

Cost Optimization and Performance

7:40 PM, GMT

7:50 PM, GMT

Stage 1

1:1 networking

Guided Meditation Break

7:50 PM, GMT

8:20 PM, GMT

Stage 1

Presentation

Want high performing LLMs? Hint: It is all about your data

Building LLMs that work well in production, at scale, can be a slow, iterative, costly and unpredictable process. While new LLMs emerge each day, similar to what we saw with the Transformers era, models are getting increasingly commoditized – the differentiator and key ingredient for high performing models will be the data you feed it with.

This talk focuses on the criticality of ensuring data scientists work with high quality data across the ML workflow, the importance of pre-training and the common gotchas to avoid in the process.

+ Read More

8:20 PM, GMT

8:50 PM, GMT

Stage 1

Presentation

Using LLMs to Punch Above your Weight!

As a small business, competing with large incumbents can be a daunting challenge. They have more money, more people, and more data, but they can also be inflexible and slow to adopt new technologies. In this talk, we will explore how small businesses can use the power of large language models (LLMs) to compete with large incumbents, particularly in industries like insurance. We will present two examples of how we are using LLMs at Anzen to streamline insurance underwriting and analyze employment agreements and discuss ideas for future applications. By harnessing the power of LLMs, small businesses can level the playing field and compete more effectively with larger companies.

+ Read More

LLMs in Production Conference

Speakers

Agenda

Sponsors