Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 338 Bytes

README.md

File metadata and controls

3 lines (2 loc) · 338 Bytes

This project can take in a list of URLS in filePaths.json and create an inverted index with TFIDF scores, cached page content, and metadata. Data will need to be stored in a database which can be linked in credentials.py (MongoDB recommended).

This was a collaborative project with equal contributions from Danielle, Gayatri, and Greta.