MLOps Community
+00:00 GMT

How I built KinConnect to win 2000$ of Fireworks.ai!

How I built KinConnect to win 2000$ of Fireworks.ai!
# KinConnect
# Fireworks.ai
# Stealth AI Startup

Build a low-cost low-code recommendation system with LLMs

July 22, 2024
Nehil Jain
Nehil Jain
How I built KinConnect to win 2000$ of Fireworks.ai!

Here is my story of building a low-code low-cost recsys for finding teammates for a hackathon!

I write about experimentation with synthetic data, scoping down a solution and getting an end to end RAG application out with multiple OSS LLMs, MongoDB Hybrid Search.

If this sounds interesting, read on…

I started with: Is it possible to turn a hackathon into a melting pot of startup ideas and invaluable connections for future innovators and builders?

Imagine anyone—even your introvert geniuses—being able to thrive in a team of individuals sharing the same goal of wining a hackathon.

My team KinConnect wanted to get to the bottom of it at the MongoDB GenAI Hackathon in Mountain View, CA.

My motivation: I had struggled to find great team members and always felt it was random. I believed there had to be a better way. We all answer the participation request survey with questions like, “What is the idea you want to hack on?”, “What are some projects you are proud of?” … There has to be a better way!

So my team KinConnect embarked on a journey: Build a tool to help organizers and participants make better teams for the hackathons using real signal from the surveys.

While speaking with Gina, who is a community manager herself, we discussed the ingredients which make successful connections at these events. Gen AI alone wont be able to automate it all the way, so we wanted to build a tool for the organizer to manage community members and help in team formation.

This puzzle had two parts: rich participant profiles and a solid matching algorithm.

Because we only had a few hours to build and demo this, we brainstormed and divided the components into digestible chunks. We were going to do: Profile creation and verification, Profile matching and explanation, UI/UX, Data creation and testing.

Of course, given it’s a hackathon, we wanted to scope it down to what is possible in 4-5 hours.

User Flow:

  1. User creates a profile by filling a form. For the hackathon we used this google form.
  2. They get a confirmation email after their profile has been ingested into the database.
  3. They get an email with up to 5 matches.

Our technical architecture:

some-file-6057cbac-da56-47e7-a7a3-e3a92d885767


UI:

In the theme of scoping things down, we decided not to build a custom UI and rather use Google Forms. This gives us the flexibility to easily change the questions and store the responses. They are even editable if people want to add more details or change their previous responses.

To show the matches, we chose email. Simply, get an email from the AI when matches have been found.

some-file-75a3c3af-ec48-4f62-8081-11800f837017


Backend:

We build two components in the backend. Profiler and recommender. The original two pieces of the puzzle, remember! We used pipedream for event driven integration of google forms and our backend. Its a no-code platform which allows you to build automated workflows with apps and custom code. We off-loaded complex compute to Modal Labs, hosting a FastAPI for doing the matching.


Fireworks AI:

We wanted to test how different models perform for our tasks. Fireworks gave us ability to use Llama3 and Mistral with function calling. They were sponsoring the event with credits which made it easier to start hitting the endpoints and experimenting.

Because Fireworks AI made all their models OpenAI compatible, it was easy to integrate with existing tools that don’t support fireworks out of the box.

some-file-5c6a86cc-d53a-45aa-9d00-0b9a453e6969

All of the development costed me less than 1$ 🤯


MongoDB:

Another sponsor of the event is a popular mature database technology. Because each questionnaire can be different document store like mongodb suits the bill perfectly. We wanted to employ Hybrid search to in our matching algorithm. Hybrid = Vector Search + Text Search. MongoDB has a great tutorial on how to do Hybrid Search for RAG all inside the db.

Many experts like @jobergum from Vespa Engine talk about why Hybrid search is important for optimizing retrieval. Here is what he said recently:

some-file-fad86389-2fe8-422e-abb8-82ca5d5f5861

In our reciprocal rank fusion, we found that keyword search has higher signal than semantic search so we weighted it higher.


Experimentation:

I have learned that the key to making a successful LLM app is experimenting and evaluating different prompts, llms and methods for a given task. So from the start I wanted to make sure we can iterate and evaluate the outputs from LLMs quickly. We used promptfoo for it. Some of our team members focused on creating fake but real looking profiles by filling a google form.


So how did it turn out?

You can try it out at https://forms.gle/6Phohd7QP22EchS79. I would love feedback if you do 🙂. Please reach out at @nehiljain.


My highlights and lessons from the day of hacking:

Experiment with Promptfoo to get the prompts right

Using promptfoo we setup prompts with various fireworks AI models to test which works for our use case. We used LLMs for many tasks in our backend.

We generated synthetic profile data, created structured attributes from form submissions, rewrite user requests during our matching algo and generated summary explanation using a mixture of models.

Here is an example set of results: Parsing structured career history from the form submissions.

some-file-ac83a629-eaf8-4f53-b655-6036c87e99cd


What we found with experimentation is that most of the smaller LLMs were not able to parse complex nested data structures but when we break down the task to smaller chunks they excelled at it.

Smaller models have trouble extracting complex nested structured outputs like these from the input text. Look at the ProfileModel class which composes an object with list of Career Entries and Project Entries.

some-file-266a6279-c913-4595-987b-70e3265ec1f4

But if I break it down to individual tasks for Career History, Portfolio and Profile Details like below, it works very well. Here is an example of how we extracted Portfolio.

some-file-fe996630-3413-496f-8b04-3337ee98de1c

All the code for experimentation can be found in https://github.com/nehiljain/kinconnect/tree/main/kinconnect_api/prompts.


Launch Your Hackathon App quickly with Synthetic Data

Imagine building your app with a real looking dataset from the start. You can design a better UX, thoroughly test your matching algorithms, and write solid assertions. Synthetic data turns your vision into a robust reality from day one.

We used mixtral-8x22b-instruct for generating synthetic form submissions.

Our observation?

LLMs still struggle with diversity in synthetic data generation, but they offer a promising start. We had to hack together some pieces together to get it right.

Synthetic data generation at scale is a unsolved problem people are working on. This approach could be promising.


Use Pipedream/Zapier to reduce your code footprint

We wanted to integrate 4 components - Google Forms, Email, MongoDB, APIs together. Writing the boilerplate code and hosting it on some cloud server can be a rabbit hole that can take days. For hackathon, we were not worried about cost at scale, so we relied on low-code workflow automation tools.

This is what my backend looks like:

some-file-75e95c4f-e88b-4ff3-b463-6ecb4dd0f013


Other details:

  1. We used llama-v3-70b-instruct to rewrite queries into detailed, rich queries. Users often give brief descriptions, so detailed queries improve accuracy.
  2. We used excellent docs on MongoDB https://www.mongodb.com/docs/atlas/atlas-vector-search/ai-integrations/langchain/ to quickly integrate with MongoDB as our data store.

All the code for the repository is available at https://github.com/nehiljain/kinconnect


My takeaways for winning:

  1. Doing one thing well can help you win a Hackathon.
  2. If you don't have a UI use a great deck to present the idea and solution clearly
  3. Function calling is a great way to build reliable applications on top of LLM. Fireworks gives you much cheaper models to work with.
  4. Fireworks allows you to interact with their models on OpenAI chat api spec. I love to see the industry converge on 1 standard so that developers can standardize the tooling on top of it.

Code Repo: https://github.com/nehiljain/kinconnect/


Next Steps:

  1. Add data enrichment modules by scraping LinkedIn and Github
  2. Build a robust and flexible matching algorithm
  3. Trial this as a bot in discord and slack for next hackathons we attend

In the end, we learned a lot. There was awesome camaraderie in our team, which helped us push all day till the end. The food was awesome. Thank you Karissa and Team from MongoDB to host us. Winning was the cherry on top of this experience and will help us bring our idea to the next level by taking it to a production ready scale with Fireworks AI LLM credits. This is a great read for my future self. How to optimize LLM inference for my workload

I am bullish that the features and integrations built into MongoDB make it a great one-size-fits-all database for app developers. I know 'Postgres is all you need' is another sentiment floating around in the AI engineering community, but I think MongoDB is well-poised to be a great db for the Gen AI stack.


Dive in
Related
Blog
How to Adapt your LLM for Question Answering with Prompt-Tuning using NVIDIA NeMo and Weights & Biases
By Anish Shah • Feb 13th, 2024 Views 109
Blog
Redis Vector Search Engineering Lab Review
By Samuel Partee • Dec 8th, 2022 Views 143
Blog
How to Build a Knowledge Assistant at Scale
By Mohamed Abusaid • Dec 22nd, 2023 Views 147