RAG Based LLM AI Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Container)
RAG Based LLM AI Chatbot is a powerful Streamlit-based application designed to simplify document management. Upload your PDF documents, create embeddings for efficient retrieval, and interact with your documents through an intelligent chatbot interface. 🚀
- 📂 Upload Documents: Easily upload and preview your PDF documents within the app.
- 🧠 Create Embeddings: Generate embeddings for your documents to enable efficient search and retrieval.
- 🤖 Chatbot Interface: Interact with your documents using a smart chatbot that leverages the created embeddings.
- 📧 Contact: Get in touch with the developer or contribute to the project on GitHub.
- 🌟 User-Friendly Interface: Enjoy a sleek and intuitive UI with emojis and responsive design for enhanced user experience.
The Document Buddy App leverages a combination of cutting-edge technologies to deliver a seamless and efficient user experience. Here's a breakdown of the technologies and tools used:
-
LangChain: Utilized as the orchestration framework to manage the flow between different components, including embeddings creation, vector storage, and chatbot interactions.
-
Unstructured: Employed for robust PDF processing, enabling the extraction and preprocessing of text from uploaded PDF documents.
-
BGE Embeddings from HuggingFace: Used to generate high-quality embeddings for the processed documents, facilitating effective semantic search and retrieval.
-
Qdrant: A vector database running locally via Docker, responsible for storing and managing the generated embeddings for fast and scalable retrieval.
-
LLaMA 3.2 via Ollama: Integrated as the local language model to power the chatbot, providing intelligent and context-aware responses based on the document embeddings.
-
Streamlit: The core framework for building the interactive web application, offering an intuitive interface for users to upload documents, create embeddings, and interact with the chatbot.
document_buddy_app/
│── logo.png
├── new.py
├── vectors.py
├── chatbot.py
├── requirements.txt
Follow these instructions to set up and run the Document Buddy App on your local machine.
git clone https://github.com/GURPREETKAURJETHRA/RAG-Based-LLM-Chatbot.git
cd RAG-Based-LLM-Chatbot
- Create a Virtual Environment
You can either use Python’s venv or Anaconda to create a virtual environment for managing dependencies.
Option 1: Using venv
On Windows:
python -m venv venv
venv\Scripts\activate
On macOS and Linux:
python3 -m venv venv
source venv/bin/activate
Option 2: Using Anaconda
Follow these steps to create a virtual environment using Anaconda:
- Open the Anaconda Prompt.
- Create a new environment:
conda create --name Chatbot python=3.10
(Replace Chatbot with your preferred environment name if desired).
- Activate the newly created environment:
conda activate Chatbot
- Install Dependencies
Once the environment is set up (whether venv or Conda), install the required dependencies using requirements.txt:
pip install -r requirements.txt
- Run the App
Start the Streamlit app using the following command:
streamlit run new.py
Note: If your main application file is named differently, replace new.py with your actual file name (e.g., app.py).
This command will launch the app in your default web browser. If it doesn’t open automatically, navigate to the URL provided in the terminal (usually http://localhost:8501).
Contributions are welcome! Whether it’s reporting a bug, suggesting a feature, or submitting a pull request, your input is highly appreciated. Follow these steps to contribute:
- Fork the Repository: Click on the “Fork” button at the top-right corner of the repository page.
- Clone Your Fork
- Create a New Branch:
git checkout -b feature/YourFeatureName
- Make Your Changes: Implement your feature or fix.
- Commit Your Changes:
git commit -m "Add Your Feature Description"
- Push to Your Fork:
git push origin feature/YourFeatureName
- Create a Pull Request: Navigate to the original repository and create a pull request from your fork.
• Streamlit Documentation: https://docs.streamlit.io/
• LangChain Documentation: https://langchain.readthedocs.io/
• Qdrant Documentation: https://qdrant.tech/documentation/
• ChatOllama Documentation: https://github.com/langchain-ai/langchain-llms#ollama
Happy coding! 🚀✨
Distributed under the MIT License. See LICENSE
for more information.