NAVATAR-Helper

NAVATAR-Helper is an AI-driven chatbot developed to assist with health-related NEET (Not in Education, Employment, or Training) queries. It leverages the Retrieval-Augmented Generation (RAG) framework to provide accurate, bilingual answers in Norwegian and English based on verified sources, while minimizing hallucinations.

Overview

Project Goal: Develop a fully working MVP of a RAG-based chatbot that delivers accurate, sourced & verified answers to NEET-related queries while minimizing hallucinations.
Technology Stack:
- Backend: Python, LangChain, custom LLMs.
- Frontend: Streamlit for user interface.
- Embedding Model: Alibaba-NLP/gte-multilingual-base for question vectorization.
- Vector Database: Milvus for storing and retrieving vectorized data.
- LLM: norallm/normistral-7b-warm-instruct for generating responses.
- GPU Server: Nvidia GeForce GTX 1080 Ti (11GB VRAM).

Key Features

RAG Framework: Integrates retrieval and generation for precise answers.
Bilingual Support: Handles both Norwegian and English queries.
Source Attribution: Provides references for each response.
Reduced Hallucinations: Strict use of retrieved, relevant data ensures factual outputs.
Extensive Testing: Both RAG and non-RAG approaches for performance comparison

Architecture

User Input: Processes user questions via a Streamlit-based frontend.
Language Detection: Determines the input language (Norwegian or English).
Embedding: Converts questions into vectors using Alibaba-NLP/gte-multilingual-base.
Context Retrieval: Utilizes Milvus to find relevant document chunks.
Response Generation: Uses norallm/normistral-7b-warm-instruct to produce context-informed answers.
Attribution: Displays sources for user verification.

Installation & Setup:

Clone the repository:

git clone https://github.com/sirin-koca/NAVATAR-Helper.git
cd NAVATAR-Helper

Create virtual env: python3 -m venv venv
Activate:
- Linux/macOS: source venv/bin/activate
- Windows: venv\Scripts\activate
Install deps: pip install -r requirements.txt
Run Services:

HTTP Server: python3 -m http.server 8503 (in Server folder)
Client: streamlit run Client.py --server.port 8080 (in Client folder)
Backend: python3 Server.py (in Server folder)

Access:

Open: rag2.cs.oslomet.no (requires OsloMet network)
Ensure all 3 services run simultaneously.

Backend scripts:

popDB.py: Prepares the vector database with embeddings.
setupLLM.py: Initializes the LLM for use.
ragMain.py: Main interaction handler for user queries.

Prerequisites

Python 3.8 or higher
PyCharm or any Python IDE
A virtual environment for dependency management
Access to a GPU server (e.g., Nvidia GeForce GTX 1080 Ti or higher)
Git for version control

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
NAVATAR-Helper		NAVATAR-Helper
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NAVATAR-Helper

Overview

Key Features

Architecture

Installation & Setup:

Backend scripts:

Prerequisites

Team Members: Sirin, Rafey, Younes, Morten, Valerie

About

Releases

Packages

Contributors 3

Languages

License

sirin-koca/NAVATAR-Helper

Folders and files

Latest commit

History

Repository files navigation

NAVATAR-Helper

Overview

Key Features

Architecture

Installation & Setup:

Backend scripts:

Prerequisites

Team Members: Sirin, Rafey, Younes, Morten, Valerie

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages