This repository contains code that utilizes Selenium and ChromeDriver to scrape text data of short blogs with their headings from the Cricinfo website. It provides a simple and efficient way to extract specific information from Cricinfo and store it in a structured format.
To run this code, you will need the following:
-
Install Selenium using the following command:
pip install selenium
-
Download the appropriate version of ChromeDriver for your system and place it in the project directory. Make sure the ChromeDriver version is compatible with your installed Chrome browser.
-
Install Pandas using the following command:
pip install pandas
-
Clone or download this repository to your local machine.
-
Update the
PATH
variable in the code with the path to your ChromeDriver executable. -
Run the script
cricinfo.py
to initiate the web scraping process. -
The extracted data will be saved as a CSV file named
cricinfo.csv
in the project directory.
Special thanks to the developers of Selenium, ChromeDriver, and Pandas for their invaluable contributions.
Happy web scraping!