Version: v1.0.0
Release Date: [2024-12-25]
Highlights
The initial release of QA with Your Documents, a Streamlit-based interactive application for querying uploaded documents using OpenAI's GPT-3.5, LangChain, and ChromaDB.
Features
-
Document Upload:
- Upload multiple PDF and CSV files.
- Extract content using LangChain's
PyPDFLoader
andCSVLoader
.
-
Vector Database Integration:
- Store document embeddings in a persistent ChromaDB vector database.
- Automatically associate metadata with documents for better citation.
-
Intelligent Query System:
- Query across all uploaded documents using OpenAI's GPT-3.5.
- Retrieve accurate answers with source citations.
-
Cited Files Download:
- Display files cited in query responses.
- Download cited files directly from the application.
-
State Persistence:
- Session-based state management to keep track of uploaded and cited files.
-
Simple and Interactive Interface:
- Powered by Streamlit for easy deployment and usability.
Known Issues
- Currently supports only PDF and CSV files.
- Query performance may depend on the size and number of uploaded documents.
Getting Started
- Follow the instructions in the README to set up and run the application locally.
Future Improvements
- Add support for more file types (e.g., Word, Excel).
- Implement fine-tuning for domain-specific query optimization.
- Enhance multi-language support for documents and queries.
- Provide query result export options.
Special thanks to the open-source community for their tools and support!