Skip to content

v1.0.0 - Initial Release of QA with Your Documents

Latest
Compare
Choose a tag to compare
@stacksapien stacksapien released this 24 Dec 22:22

Version: v1.0.0

Release Date: [2024-12-25]

Highlights

The initial release of QA with Your Documents, a Streamlit-based interactive application for querying uploaded documents using OpenAI's GPT-3.5, LangChain, and ChromaDB.

Features

  • Document Upload:

    • Upload multiple PDF and CSV files.
    • Extract content using LangChain's PyPDFLoader and CSVLoader.
  • Vector Database Integration:

    • Store document embeddings in a persistent ChromaDB vector database.
    • Automatically associate metadata with documents for better citation.
  • Intelligent Query System:

    • Query across all uploaded documents using OpenAI's GPT-3.5.
    • Retrieve accurate answers with source citations.
  • Cited Files Download:

    • Display files cited in query responses.
    • Download cited files directly from the application.
  • State Persistence:

    • Session-based state management to keep track of uploaded and cited files.
  • Simple and Interactive Interface:

    • Powered by Streamlit for easy deployment and usability.

Known Issues

  • Currently supports only PDF and CSV files.
  • Query performance may depend on the size and number of uploaded documents.

Getting Started

  • Follow the instructions in the README to set up and run the application locally.

Future Improvements

  • Add support for more file types (e.g., Word, Excel).
  • Implement fine-tuning for domain-specific query optimization.
  • Enhance multi-language support for documents and queries.
  • Provide query result export options.

Special thanks to the open-source community for their tools and support!