MLOps Community
+00:00 GMT
Sign in or Join the community to continue

Authoring Interactive, Shareable AI Evaluation Reports with Zeno

Posted Nov 01, 2023 | Views 699
# AI Evaluation Reports
# Foundation Models
# Zeno
Share
speakers
avatar
Alex Cabrera
Co-Founder @ Zeno

Alex Cabrera is a Ph.D. candidate at Carnegie Mellon University. He works on human-centered AI, specifically in applying techniques from HCI and visualization to help people better understand and improve their AI systems. He is supported by an NSF Graduate Research Fellowship and has spent time at Apple AI/ML, Microsoft Research, and Google.

+ Read More
avatar
Adam Becker
IRL @ MLOps Community

I'm a tech entrepreneur and I spent the last decade founding companies that drive societal change.

I am now building Deep Matter, a startup still in stealth mode...

I was most recently building Telepath, the world's most developer-friendly machine learning platform. Throughout my previous projects, I had learned that building machine learning powered applications is hard - especially hard when you don't have a background in data science. I believe that this is choking innovation, especially in industries that can't support large data teams.

For example, I previously co-founded Call Time AI, where we used Artificial Intelligence to assemble and study the largest database of political contributions. The company powered progressive campaigns from school board to the Presidency. As of October, 2020, we helped Democrats raise tens of millions of dollars. In April of 2021, we sold Call Time to Political Data Inc.. Our success, in large part, is due to our ability to productionize machine learning.

I believe that knowledge is unbounded, and that everything that is not forbidden by laws of nature is achievable, given the right knowledge. This holds immense promise for the future of intelligence and therefore for the future of well-being. I believe that the process of mining knowledge should be done honestly and responsibly, and that wielding it should be done with care. I co-founded Telepath to give more tools to more people to access more knowledge.

I'm fascinated by the relationship between technology, science and history. I graduated from UC Berkeley with degrees in Astrophysics and Classics and have published several papers on those topics. I was previously a researcher at the Getty Villa where I wrote about Ancient Greek math and at the Weizmann Institute, where I researched supernovae.

I currently live in New York City. I enjoy advising startups, thinking about how they can make for an excellent vehicle for addressing the Israeli-Palestinian conflict, and hearing from random folks who stumble on my LinkedIn profile. Reach out, friend!

+ Read More
SUMMARY

LLMs and foundation models are unlocking thousands of possibilities for AI-driven products, from writing assistants to art platforms. Despite their abilities, these AI systems are complex and can fail in significant ways, such as producing hallucinations, biased outputs, and more. In this talk, I’ll introduce Zeno, an interactive platform for creating and sharing in-depth evaluations of complex AI systems. Zeno lets users explore the inputs and outputs of any AI system, from text to audio and image models, and create interactive reports. We envision Zeno being the go-to tool both AI developers and auditors use to share reproducible evaluations of AI systems.

+ Read More

Watch More

Navigating through Retrieval Evaluation to demystify LLM Wonderland // Atita Arora // AI in Production
Posted Feb 18, 2024 | Views 797
# LLM
# Evaluation
# AI
# ML
LLM Evaluation with Arize AI's Aparna Dhinakaran // MLOps Podcast #210
Posted Feb 09, 2024 | Views 492
# LLM Evaluation
# MLOps
# Arize AI
Building Conversational AI Agents with Voice
Posted Mar 06, 2024 | Views 1.4K
# Conversational AI
# Voice
# Deepgram