Skip to content

Latest commit

 

History

History
39 lines (25 loc) · 1.41 KB

README.md

File metadata and controls

39 lines (25 loc) · 1.41 KB

Basic RAG implementation for Wordpress blogs

A basic implementation of RAG for any worpress blog. Leverages the Milvus vector DB. Options for either using OpenAI or Anthropic as your LLM and Sentence BERT for embeddings.

On first run all blog posts will be scrapped. Embeddings are based on sentence parsing of each post.

alt text

Running Locally

Make sure you have Python installed

$ git clone https://github.com/speechwrecko/wordpress-rag.git # or clone your own fork
$ cd wordpress-rag
$ pip install -r requirements.txt
$ python app.py

Your app should now be running on localhost:5000.

Required third party services

This project requires API access to several 3rd party services:

note: the current scraper assumes you are using the Diffbot free version which only allows for 5 API calls per minute. If you have a paid account you can remove the 12 second sleep that exists in the code to rate limit calls to Diffbot.

Credits

Initial basis for flask / bootstrap implementation

Peter Simeth's basic flask pretty youtube downloader (v1.3) https://github.com/petersimeth/basic-flask-template © MIT licensed, 2018-2023