This project involves scraping data from the Redbus website using Selenium, storing the data in a MySQL database, and creating a Streamlit app for data filtration.
- Selenium: Used for web scraping to extract data from the Redbus website.
- MySQL: Database to store the scraped data.
- Streamlit: Web application framework to create an interactive data filtration interface.
- Python 3.x
- Selenium
- MySQL
- Streamlit
- Pandas
- Numpy
-
Clone the repository:
git clone https://github.com/yasararafath-s/Redbus_website_scraping_project.git cd Redbus_website_scraping_project
-
Install dependencies:
pip install -r requirements.txt
-
Configure MySQL:
- Create a database and update the connection details in the script.
-
Run the scraper:
redbus_scraping-sql.ipynb
-
Start the Streamlit app:
streamlit run redbus-data_app.py
- Run the scraper to collect data from the Redbus website.
- Use the Streamlit app to filter and visualize the data.