MLOps Community
Home
/
Collections
/
LLMs in Production Conference Part III

LLMs in Production Conference Part III

Popular topics
# LLMs
# LLM in Production
# AI Agents
# Agents in Production
# AI
# LLM
# Machine Learning
# MLOps
# Rungalileo.io
# MLops
# RAG
# Prosus Group
# Generative AI
# Interview
# Machine learning
# Tecton.ai
# Arize.com
# mckinsey.com/quantumblack
# Redis.io
# Zilliz.com
Video

Finetuning Open-Source LLMs // LLMs in Production Conference 3 Keynote 1

This tutorial starts by surveying the different ways we can use LLMs. Then, we will take a deeper dive into various LLM finetuning strategies, such as low-rank adaptation, and learn how we can create custom LLMs using open-source software.
# Finetuning
# Open-Source
# LLMs in Production
# Lightning AI
Sebastian Raschka
Demetrios Brinkmann
Sebastian Raschka & Demetrios Brinkmann · Oct 9th, 2023
29:04
Video

What Drives GenAI Development in the Next 3 Years

The future of Generative AI will be shaped by many factors like scaling laws, the evolution of agents, multi-modality, open-source contributions, etc. However, challenges such as GPU and talent shortages and regulations could pose obstacles. Join us as we delve into the fascinating world of Generative AI and explore the key drivers that will shape its development in the next three years.
# Generative AI
# GPU
# Prosus
Euro Beinat
Demetrios Brinkmann
Euro Beinat & Demetrios Brinkmann · Oct 9th, 2023
18:54
Video

Fireside Chat with LLM Startups

Martian is focused on building a model router to dynamically route every prompt to the best LLM for the highest performance and lowest cost. Corti, the Al Co-Pilot for health care uses Al to improve patient care, demonstrating the potential of Al in healthcare and medical decision-making. They recently raised $60M, with Prosus being one of the lead investors. Transforms is pioneering in synthetic entertainment, showing how Al can transform the way we create and consume media.
# Startups
# LLMs
# Prosus
# prosus.com
# transforms.ai
# withmartian.com
# corti.ai
Paul van der Boor
Sandeep Bakshi
Shriyash Upadhyay
+2
Paul van der Boor, Sandeep Bakshi, Shriyash Upadhyay & 2 content:more content:speakers · Oct 9th, 2023
30:46
Video

Automating Data Annotation with LLMs // LLMs in Production Conference 3 Workshop1

This talk dives into using Large Language Models (LLMs) for data annotation automation. We'll cover techniques and workflows to run automation with LLMs, and discuss how to interpret, validate, and use these results in subsequent machine learning pipelines, such as model finetuning. Join to understand the practical side of LLM-driven data annotation.
# Data Annotation
# LLMs
# HumanSignal
Nikolai Liubimov
Michael Malyuk
Chris Hoge
+1
Nikolai Liubimov, Michael Malyuk, Chris Hoge & 1 content:more content:speaker · Oct 9th, 2023
1:03:04
Video

AI in Education Fireside Chat

Explore the transformative role of AI in EdTech, discussing its potential to enhance learning experiences and personalize education. The panelists share insights on AI use cases, challenges in AI integration, and strategies for building a differentiated business model in the evolving AI landscape. The discussion looks ahead at how the latest wave of GenAI is set to shape the future of education. Join us to understand the exciting prospects and challenges of AI in EdTech.
# AI
# Education
# Prosus
# Prosus.com
# Duolingo.com
# brainly.com
# sololearn.com
Paul van der Boor
Klinton Bicknell
Bill Salak
+1
Paul van der Boor, Klinton Bicknell, Bill Salak & 1 content:more content:speaker · Oct 9th, 2023
31:01
Video

Observability for LLMs

Making LLMs reliable is hard. You can't debug or unit test them, not in the traditional sense at least. Instead, you'll need to turn to the practice of Observability, by instrumenting your feature to produce rich telemetry and analyzing behavior from that data. Observability can also act as a key source of data for evaluations.
# Observability
# LLMs
# Honeycomb
Phillip Carter
Demetrios Brinkmann
Phillip Carter & Demetrios Brinkmann · Oct 9th, 2023
12:26
Video

Unlocking Real-World LLM Use Cases

Unlock the power of real-world LLM use cases and learn how to keep them grounded and deliver accurate results through techniques such as Retrieval Augmented Generation (RAG). Leverage Databases with vector support as a bridge between LLMs and your enterprise Gen AI apps.
# LLM Use Cases
# RAG
# Google Cloud
Hamsa Buvaraghan
Demetrios Brinkmann
Hamsa Buvaraghan & Demetrios Brinkmann · Oct 9th, 2023
12:16
Video

Using Product Analytics to Build Better LLM Applications

So you’ve built your first LLM product. Now what? If you’re a Product person, you need to understand how people are using it, and how it's performing. That’s where product analytics come in. But it's a totally different problem to product analytics for graphical user interfaces - you need to understand mountains of text. This talk will cover the key considerations for building great end-user experiences with LLMs, from a Product Managers perspective.
# Product Analytics
# LLM Applications
# ContextAI
Henry Scott-Green
Demetrios Brinkmann
Henry Scott-Green & Demetrios Brinkmann · Oct 9th, 2023
9:27
Video

Exploring the Latency/Throughput & Cost Space for LLM Inference

Getting the right LLM inference stack means choosing the right model for your task, and running it on the right hardware, with proper inference code. This talk will go through popular inference stacks and set-ups, detailing what makes inference costly. We'll talk about the current generation of open-source models and how to make the best use of them, but we will also touch on features currently missing from the open-source serving stack as well as what the future generations of models will unlock.
# LLM Inference
# Latency
# Mistral.AI
Timothée Lacroix
Demetrios Brinkmann
Timothée Lacroix & Demetrios Brinkmann · Oct 9th, 2023
30:25
Video

Amplifying Impact with Generative AI: Insights from 10,000 Colleagues

Prosus AI, a top-tier applied AI centre, drives rapid experimentation and implementation of AI throughout Prosus's global portfolio, which includes over 80 technology companies with more than 800 AI experts. In this talk, we show how AI is harnessed for discovery within the Prosus network. We will share insights gained from 10,000 colleagues who utilise generative AI daily across the group, significantly enhancing the impact of their work.
# Generative AI
# LLMs
# Prosus AI
Paul van der Boor
Demetrios Brinkmann
Paul van der Boor & Demetrios Brinkmann · Oct 9th, 2023
32:12
Video

The Truth About AI Agents

AutoGPT sparked the imaginations of millions, it’s exciting because you can see what they will be able to do just by talking to them like you would with a human. It blew up because of this, not because of actual use case. First, what is an interact through natural language, performs actions in the real world. RPA. What agents could do in the future. But they suck right now. Why no actual use case yet? Reliability Memory, llm model complexity, architecture, tokens per second But ultimately we need a loss function to do tdd and improve agents. Getting thousands of prs and no way to test them. Don't know how to step if you don't know where you're going. How to get there, related to building language models Performance (benchmark) Safety (monitor) Standardization (agent protocol) Research pedigree is no longer the barrier to making an impact in the space, creativity is. And now there's a clear way to make it happen, mention and some of the work there AutoGPT.
# AI Agents
# AutoGPT
# RPA
# Autogpt.net
Silen Naihin
Demetrios Brinkmann
Silen Naihin & Demetrios Brinkmann · Oct 9th, 2023
31:40
Video

LLMs in Production at GetYourGuide

Discover how GetYourGuide navigates the dynamic landscape of LLMs and delivers products valuable to consumers and business. Will cover decision-making process of when to opt for LLMs over supervised models, offering practical insights into implementation and how these are put into production at GetYourGuide. On top of this will go deeper into strategic LLM prioritisation, streamlining their integration into product processes, and ensuring safe deployment to consumers.
# LLMs
# Production
# GetYourGuide
Meghana Satish
Tina Treimane
Demetrios Brinkmann
Meghana Satish, Tina Treimane & Demetrios Brinkmann · Oct 18th, 2023
29:39
Video

Building Context-Aware Reasoning Applications with LangChain and LangSmith

How can companies best build useful and differentiated applications on top of language models? Many of the products and companies built do this by providing the relevant context to LLMs and asking them to reason appropriately. In this talk, Harrison will discuss the different types of context you should be aware of, the different levels of cognitive architectures that are emerging, and how LangChain and LangSmith are built to help with this journey.
# Context-Aware Reasoning Applications
# LangSmith
# LangChain
Harrison Chase
Demetrios Brinkmann
Harrison Chase & Demetrios Brinkmann · Oct 18th, 2023
9:11
Video

LLM Valley

A video-game themed walkthrough of LLM products in today's markets.
# LLM Products
# Video Game
# Contenda
Lilly Chen
Demetrios Brinkmann
Lilly Chen & Demetrios Brinkmann · Oct 18th, 2023
11:57
Video

Efficient Serving of LLMs for Experimentation and Production with Fireworks.ai

Deploying LLMs to products is no easy feat. It's common to have dozens of model variants when trying things out. As usage scales up, cost-to-serve and latency become primary concerns. In this talk, we will dive into how Fireworks.aI GenAI Platform helps developers on the journey from early experimentation to highly loaded production deployments without breaking the bank.
# LLMs for Experimentation
# LLMs for Production
# Fireworks.ai
Dmytro Dzhulgakov
Demetrios Brinkmann
Dmytro Dzhulgakov & Demetrios Brinkmann · Oct 18th, 2023
11:43
Video

Data Quality’s Impact on Large Language Models

Data quality is the foundation of successful Generative A, traditional ML, and data-driven initiatives. In this talk, I will share our research results on this as there is a tangible impact of poor data quality on model performance and training cost.
# Generative A
# LLMs
# Telmai
Mona Rakibe
Maxim Lukichev
Demetrios Brinkmann
Mona Rakibe, Maxim Lukichev & Demetrios Brinkmann · Oct 18th, 2023
26:41
Video

Addressing Privacy and the GDPR in LLM Applications

In this talk, Pieter covers some key things to keep in mind when building ML applications using LLMs regarding privacy.
# ML Applications
# LLMs Privacy
# Private AI
Pieter Luitjens
Demetrios Brinkmann
Pieter Luitjens & Demetrios Brinkmann · Oct 18th, 2023
9:03
Video

From Building Self-driving Cars to Building LLM Applications

What Effy learned from building tools for self-driving cars, and how might we apply those learnings to building LLM applications?
# LLM Applications
# Self-driving Cars
# Baserun
Effy Zhang
Effy Zhang · Oct 18th, 2023
11:45
Video

Current State of LLMs in Production

The rapid advancements in Natural Language Processing (NLP) have paved the way for the deployment of Large Language Models (LLMs) in real-world production systems. This talk aims to provide a succinct overview of the current state of Large Language Models (LLMs) in production, emphasizing their capabilities, deployment strategies, and the challenges encountered.
# Natural Language Processing
# LLMs
# Truckstop
# Truckstop.com
Apurva Misra
Demetrios Brinkmann
Apurva Misra & Demetrios Brinkmann · Oct 18th, 2023
11:46
Video

Assess the Value and Feasibility of LLM Use Cases with a Checklist

Have you ever had someone suggest "ohhh we could use LLMs for that?" And you knew the idea had some painpoints...but you couldn't put your finger on the sore point? Hopefully my little checklist can help you out. In this talk I will introduce a simple tool for assessing the value and feasibility of an LLM use case. It will make it easier to discuss these. It's not rocket science, but it does help when discussing use cases with stakeholders of different experience levels.
# LLM Value and Feasibility
# Checklist
# Xebia
Eva Bosma
Rens Dimmendaal
Adam Becker
Eva Bosma, Rens Dimmendaal & Adam Becker · Oct 24th, 2023
10:39
Video

Synthetic Data for Robust LLM Application Evaluation

We will start with a brief overview of synthetic data generation. Next, we will inspect its effectiveness by referring to some of the latest works in this space. Then we will move on to see how this can be leveraged in evaluating LLM-driven applications pipelines.
# Synthetic Data
# Application Evaluation
# ExplodingGradients
Shahul Es
Adam Becker
Shahul Es & Adam Becker · Oct 24th, 2023
8:44
Video

False Starts and Dead Ends: Building a Retrieval Augmented Generation System

Discover the highs and lows of building a Retrieval Augmented Generation (RAG) system as we walk through the crucial challenges: data quality, query engines, and contextualization. Gain key insights into the pitfalls and best practices that can help you make informed decisions in your own projects.
# Retrieval Augmented Generation System
# Best Practices
# Train GRC Inc
Wes  Ladd
Adam Becker
Wes Ladd & Adam Becker · Oct 24th, 2023
11:43
Video

Deploying LLMs on Structured Data Tasks: Lessons from the Trenches

Join us for an introduction to NSQL, a new family of open-source foundation models with up to 7B parameters automating SQL generation tasks. We will explore the limitations of existing open and closed-source foundation models for enterprise use, including issues of customization, quality, and privacy. We will highlight how NSQL addresses these challenges with its open-source nature, specialized training for SQL tasks, and a range of model sizes to accommodate diverse hardware configurations. Included in the talk will be NSQL's data generation process and GPU training approach, underlining its advantages over other foundation models for SQL generation. We will demonstrate how the NSQL models outperform existing open source models for SQL generation and, by starting from the newest LLama2 commercially available model, we even beat closed source models.
# Deploying LLMs
# Structured Data Tasks
# Numbers Station
Laurel Orr
Adam Becker
Laurel Orr & Adam Becker · Oct 24th, 2023
31:07
Video

Building RAG-based LLM Applications for Production

In this talk, we will cover how to develop and deploy RAG-based LLM applications for production. We will cover how the major workloads (data loading and preprocessing, embedding, serving) can be scaled on a cluster, how different configurations can be evaluated and how the application can be deployed. We will also give an introduction to Anyscale Endpoints which offers a cost-effective solution for serving popular open-source models.
# LLM Applications
# RAG
# Anyscale
Yifei Feng
Philipp Moritz
Adam Becker
Yifei Feng, Philipp Moritz & Adam Becker · Oct 26th, 2023
30:23
Video

TimeGPT: The First Foundation Model for Time Series

Time series—data ordered chronologically—constitutes the underlying fabric of systems, enterprises, and institutions. Its impact spans from measuring ocean tides to tracking the daily closing value of the Dow Jones. This type of data representation is indispensable in sectors such as finance, healthcare, meteorology, and social sciences. However, the current theoretical and practical understanding of time series hasn't yet achieved a consensus among practitioners that mirrors the widespread acclaim for generative models in other fundamental domains of the human condition, like language and perception. Our field is still divided and highly specialized. Efforts in forecasting science have fallen short of fulfilling the promises of genuinely universal pre-trained models. In this talk, we will introduce TimeGPT, the first pre-trained foundation model for time series forecasting that can produce accurate predictions across various domains and applications without additional training. A general pre-trained model constitutes a groundbreaking innovation that opens the path to a new paradigm for the forecasting practice that is more accessible and accurate, less time-consuming, and drastically reduces computational complexity. We will show how to use TimeGPT in a live demo.
# TimeGPT
# Time Series
# Nixtla
Azul Garza
Adam Becker
Azul Garza & Adam Becker · Oct 26th, 2023
9:56
Video

Finetuning LLMs

An opinionated view of how to build production LLMs.
# Fine-tuning LLMs
# Building Production
# PowerML, Inc
Greg Diamos
Adam Becker
Greg Diamos & Adam Becker · Oct 26th, 2023
8:45
Video

Product Strategy for LLM features when LLM isn’t your Product

The rapid adoption of Large Language Models (LLMs) has swept across various industries, inspiring companies to incorporate them into their products, regardless of the industry's domain. Unlike organizations like OpenAI, Google, or Microsoft, most entities view LLMs as powerful tools rather than standalone products. In this context, it remains paramount to uphold core product principles where customer satisfaction reigns supreme. In this talk, we will see how to pick practical use cases for using LLMs, where they are not essentially user-facing chatbots but valuable tools to build new features into existing products - with examples from the cybersecurity domain. We will also see a sample product strategy for how to plan and roadmap LLM features, along with key performance metrics to gauge success.
# LLM features
# Product Strategy
# LLMs
Harini Kannan
Adam Becker
Harini Kannan & Adam Becker · Oct 26th, 2023
22:59
Video

Product Engineering for LLMs Panel

A product-minded engineering perspective on UX/design patterns, product evaluation, and building with AI.
# UX
# Product-minded Engineering
# Building with AI
Charles Frye
Sahar Mor
Sarah Guo
+3
Charles Frye, Sahar Mor, Sarah Guo & 3 content:more content:speakers · Oct 26th, 2023
31:46
Video

Speed and Sensibility: Balancing Latency and UX in Generative AI

Conversational AI demands low latency for a seamless dialogue between humans and AI. However, engineers face the dilemma that some latency is inherently required in order to process human speech and craft a response. Some incremental wins to shave off milliseconds involve trade-offs against how the AI response could be enriched during the additional processing time. Others simply refactor out inefficiency to obtain more performant results from AI devtools. This talk presents best practices of designing streaming speech-to-text applications, as well as reasons to accept extra latency for the sake of an enhanced product experience.
# Conversational AI
# Humans and AI
# Deepgram
Julia  Kroll
Adam Becker
Julia Kroll & Adam Becker · Oct 26th, 2023
8:58
Video

AI Squared: Breaking LLMs out of the Chat Application

AI Squared is an AI platform designed for product owners, data scientists, and enterprise leaders. We empower you to accelerate both predictive and generative AI projects, measure their benefits, and drive significant revenue growth and cost reduction. The largest gap in the LLM developer stack is creating different experiences for users to leverage LLM results. Many companies have only utilized generative AI inside of chat applications. AI Squared has developed a framework that empowers our customers to harvest context and connect to additional content as well as other AI models from across the organization while integrating these insights directly into currently existing tools and applications.
# AI Platform
# LLM Developer Stack
# AI Squared
Benjamin Harvey
Adam Becker
Benjamin Harvey & Adam Becker · Oct 26th, 2023
52:20
Code of Conduct
Your Privacy Choices