Welcome to the 545 Group Project repository! This repository is dedicated to the DevOps and Data Engineering aspects of our project. Here, you will find everything you need to set up, manage, and deploy our project infrastructure and data pipelines
This repository is a part of our group project for course 545. Our project involves building a data-driven application that provides insights from a data source. The DevOps and Data Engineering team is responsible for managing the infrastructure, setting up data pipelines, and ensuring smooth data processing.
This script main.yml sets up a Python environment in a Microsoft Server Farm and runs main.py
everyday day at midnight EST.
The backup of our database is run before new records are inserted at midnight EST every day. This is found in the file backup.csv.
The Python script that scrapes data from a website checks the data, and then inserts it into an Aiven database is in the file named main.py.
Meetings notes are contained in the file Meetings.md.
SQL to create our Feature_Store
table DDL_Feature_Store.sql.
Our example API is housed in the file API.py.
This is the requirements file for the main.py
script to run requirements.txt.
Dashboard file housed in Dash.pbix
This project is licensed under the MIT License. Feel free to use, modify, and distribute the code as per the terms of the license.
Thank you for your interest in our 545 Group Project DevOps and Data Engineering repository. If you have any questions or encounter issues, please don't hesitate to reach out to the project maintainers.
Happy coding! 🚀