Minimal RAG NextJS

Overview

Minimal RAG NextJS is a proof-of-concept demonstrating that you can implement Retrieval-Augmented Generation (RAG) without relying on expensive vector database subscriptions. This project leverages the power of pgvector, a simple PostgreSQL extension, combined with OpenAI embeddings to create a cost-effective, local vector search solution.

Features

Uses pgvector extension for PostgreSQL
Integrates OpenAI embeddings for vector creation
Implements a NextJS web application for file upload and search
Chunks and stores vectors in a local database
Provides a simple, intuitive user interface

Prerequisites

Node.js (v14 or later)
Docker and Docker Compose
OpenAI API key

Installation

Clone the repository:

git clone https://github.com/MartinKondor/minimal-rag-nextjs.git
cd minimal-rag-nextjs

Install dependencies:
```
npm install
```

Set up the database:

docker-compose up -d
npx prisma generate
npx prisma db push --force-reset

Modify the given .env file in the root directory (if you wish):

SKIP_ENV_VALIDATION=0
NODE_ENV="development"
POSTGRES_URL="postgres://postgresuser:postgrespassword@localhost:54322"
POSTGRES_URL_NON_POOLING="postgres://postgresuser:postgrespassword@localhost:54322?pool=false"

Usage

Start the development server:
```
npm run dev
```
Open your browser and navigate to http://localhost:3000.
Follow the on-screen instructions:
- Enter your OpenAI API key (this is used client-side and not stored)
- Upload a text file (max 20,000 characters)
- Enter a search query
- View the most relevant chunks from your uploaded file

How It Works

File Upload: The app chunks your uploaded text file and creates embeddings using OpenAI's API.
Vector Storage: These embeddings are stored in the local PostgreSQL database using pgvector.
Search: When you enter a query, it's converted to an embedding and compared against the stored vectors.
Results: The most similar text chunks are retrieved and displayed, ranked by relevance.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Future Improvements

Implement user authentication
Add support for multiple file types
Optimize vector search for larger datasets
Implement more advanced RAG techniques

Acknowledgements

OpenAI for their embedding API
pgvector for enabling vector operations in PostgreSQL
Next.js for the React framework

Contact

Martin Kondor - https://martinkondor.github.io/

Project Link: https://github.com/MartinKondor/minimal-rag-nextjs

License

MIT License. See the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.vscode		.vscode
app		app
docs		docs
lib		lib
prisma		prisma
sample_source		sample_source
.env		.env
.eslintrc		.eslintrc
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.nvmrc		.nvmrc
.prettierignore		.prettierignore
.prettierrc		.prettierrc
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Minimal RAG NextJS

Overview

Features

Prerequisites

Installation

Usage

How It Works

Contributing

Future Improvements

Acknowledgements

Contact

License

About

Languages

License

MartinKondor/minimal-rag-nextjs

Folders and files

Latest commit

History

Repository files navigation

Minimal RAG NextJS

Overview

Features

Prerequisites

Installation

Usage

How It Works

Contributing

Future Improvements

Acknowledgements

Contact

License

About

Resources

License

Stars

Watchers

Forks

Languages