G1 WEBSCRAPER

What it does?

A headline finder from the G1 site, basically takes as data the headline, link, date and respective image.

The software utilizes SQLAlchemy for database interaction and FastAPI for web framework. It scrapes news data from G1 website, stores it in a CSV file, and then checks if each news already exists in the database before sending the new data to the database.

$ git clone https://github.com/slocksert/g1_WebScrapper.git

$ docker compose up

$ python3 app/main.py