MLOps Community
+00:00 GMT
Blog
# MLOps
# Data Science
# AI
# Machine Learning
# Course

MLOps Coding Course: Mastering Observability for Reliable ML

This article delves into the essential tools and practices for achieving comprehensive observability in your ML projects. We’ll unravel key concepts, showcase practical code examples from the accompanying MLOps Python Package, and explore the benefits of integrating industry-leading solutions like MLflow.This article delves into the essential tools and practices for achieving comprehensive observability in your AI/ML projects.
Médéric Hurier
Médéric Hurier · Aug 5th, 2024
Popular topics
# MLops
# LLMs
# AI
# Interview
# Machine Learning
# Machine learning
# Model Serving
# Observability
# MLOps
# GetYourGuide
# Presentation
# Monitoring
# Cultural Side
# Scaling
# Security
# ML Platform
# Responsible AI
# Run:AI
# Databricks
# Python
All
Sonam Gupta
Sonam Gupta · Jul 29th, 2024
Learn how to build a semantic search and a Q/A chat assistant that lets you ask questions about your podcast transcripts to gain further insights. This is especially useful if you want to save time by avoiding the need to go back and forth through each of the videos.
# Semantic Search
# Podcast
# OpenAI
David Weik
David Weik · Jul 25th, 2024
This blog explores LLMOps, focusing on integrating LLMs into business workflows. It addresses key challenges such as handling unstructured data and ensuring output accuracy and offers insights into maintaining the reliability and effectiveness of LLMs.
# LLMs
# LLMOps
# SAS
This is my story of building KinConnect, a tool designed to help hackathon participants form effective teams using AI-driven participant profiles and matching algorithms. The project was developed during the MongoDB GenAI Hackathon using tools like Google Forms, Pipedream, FireworksAI, Modal Labs, and MongoDB Hybrid Search. Some lessons learnt are the importance of experimentation, prompt engineering, and leveraging synthetic data.
# KinConnect
# Fireworks.ai
# Stealth AI Startup
Xin Huang
Rachna Chadha
Qing Lan
+5
Xin Huang, Rachna Chadha, Qing Lan & 5 more authors · Jul 17th, 2024
AWS introduces Meta Llama 3 models on Trainium and Inferentia in SageMaker JumpStart, reducing deployment costs by up to 50%. This enables scalable, customizable deployments using a no-code approach or the SageMaker JumpStart SDK.
# Meta Llama 3
# AWS Inferentia
# AWS Trainium
After diving into the transcription process in the first part of my series, I'm excited to share the next step in "Semantic Search to Glean Valuable Insights from Podcasts: Part 2." In this post, I walk you through the process of embedding the podcast transcripts using Cohere's embed model and efficiently storing them in ApertureDB. This step is crucial for enhancing search capabilities and ensuring our data is well-organized and ready for semantic querying. If you're interested in how to transform raw podcast data into a searchable database, this post is for you!
# Podcast Transcripts
# Cohere
# ApertureDB
While robust test coverage is critical for high-quality products, the non-deterministic nature and complexity of LLMs often render traditional testing methods inadequate. In addition to LLM evaluation strategies like gold data benchmarks, cross-model evaluations, and probabilistic assertions, this article discusses scenarios where mocking is beneficial. It also provides concrete examples of different mocking techniques to ensure robust testing. By strategically applying these methods, development teams can enhance productivity, control costs, and ensure the reliability of LLM applications in production.
# LLMs
# Software Development
# AGIFlow
This article provides a guide on creating an LLM application using LlamaIndex and Qdrant to interact with GitHub repositories for easy retrieval of code snippets. The application is deployed on Google Kubernetes Engine with Docker and FastAPI and includes a user-friendly Streamlit interface for sending queries. You can code along following the repository and the instructions step by step.
# LLMs
# Autoscale
# Martin Data Solutions
This blog post explores the process of building and deploying a serverless LLM application to perform semantic searches over academic papers using AWS Lambda and Qdrant. The project involves using LangChain and OpenAI’s embeddings for vector representation, Docker for deployment, and Streamlit for the UI. Detailed instructions and the complete code are provided to help you replicate the setup.
# LLMs
# AWS Lambda
# Qdrant
# Martin Data Solutions
This article shares a personal journey of being diagnosed with endometriosis and the subsequent deep dive into understanding and managing the condition. It highlights the societal neglect of endometriosis despite its prevalence and insufficient funding. The author leverages Large Language Models (LLMs) and data from Reddit to analyze common experiences and advice from fellow sufferers. Using LLMs, the study analyzed Reddit posts to understand sentiments and key topics, revealing frequent emotions like frustration and happiness. It underscores the importance of community support and practical advice for managing endometriosis, advocating for greater societal awareness and research funding.
# LLMs
# Sentiment Analysis
# Endometriosis
# Sentick
Kopal Garg
Kopal Garg · Jun 26th, 2024
The article covers several hands-on techniques to ensure the output from language models like GPT-4 and Gemini Pro meets specific requirements. It details how to use regular expressions, JSON schemas, context-free grammars, and templates to shape text outputs precisely. It also highlights the setup of the Vertex AI client, emphasizing safety in generation processes. Practical examples and code snippets are provided throughout, showing how these methods can be practically applied to get consistent and structured results from language models.
# LLMs
# Guided Generation
# Cartography Bio
Popular
Extending AI: From Industry to Innovation
Sophia Rowland, David Weik & Demetrios Brinkmann