LLM Data Engine

This is an open-source tool for collecting AI conversation datasets to fine-tune Large Language Models (LLMs) easily and effectively. Currently a work in progress.

Features

Create Projects: Kick things off by creating a project where you can configure the settings and system prompts for your project.
Link with Langfuse: Connect your projects to Langfuse, an open-source LLM engineering platform that will be used to store conversation/annotation data and user interactions.
Get Contributors: Generate a shareable link for your project for human experts to have mock conversations and annotate AI responses, building you a tailored, high-quality dataset to help train your model!

Demo Video

In this demo, I use the LLM Data Engine to collect data for training a model to behave as a helpful elementary school tutor.

I create a project and link it to Langfuse.
For demonstration, I act as an annotator, using the shareable link to simulate a conversation as a student and annotate/refine the AI responses.
I go to Langfuse to see the collected dataset, which can be used to fine-tune my model for my tutoring use case!

LLM.Data.Engine.Demo.mp4

Join in!

If you have ideas, find bugs, or want to help build something the Data Engine, don’t hesitate to open an issue, submit a pull request, or directly reach out!

Tech Stack

This project uses TypeScript, Next.js, Prisma/PostgreSQL, NextAuth, and Tailwind CSS, along with the OpenAI and Langfuse API's.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.devcontainer		.devcontainer
prisma		prisma
public		public
src		src
.env.example		.env.example
.eslintrc.cjs		.eslintrc.cjs
.gitignore		.gitignore
README.md		README.md
components.json		components.json
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
plate-components.json		plate-components.json
postcss.config.cjs		postcss.config.cjs
prettier.config.js		prettier.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Data Engine

Features

Demo Video

Join in!

Tech Stack

About

Releases

Packages

Languages

aakashg00/LLM-Data-Engine

Folders and files

Latest commit

History

Repository files navigation

LLM Data Engine

Features

Demo Video

Join in!

Tech Stack

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages