A repository in which I scrape Indian Premier League (IPL) Hawkeye data to extract relevant ball by ball information.
Cricket, like many other sports, is a complex game where strategic decisions can heavily influence outcomes. Hopefully, data is being used in some capacity to drive these decisions. Unfortunately, access to such detailed data isn't always available to the general public, despite its tremendous potential to unearth new insights.
This project came about because I wanted to dig deeper into IPL matches, gathering ball-by-ball data for completed IPL matches. By scraping this data and storing it neatly in CSV files (can be found in the data directory), I hope to make this dataset accessible to others who share a passion for cricket analysis.
- Scrapes bbb data of completed IPL 2024 matches.
- Stores data in an organized CSV files that can be used in analysis.
Would like to thank Himanish Ganjoo for his support. The ball by ball dataset provided by him here was used in enhancing the quality of this dataset by providing information about certain attributes (extras, dismissals) that Hawkeye surprisingly didn't provide.
- Fix wrong player names? (For example, correct Ganesh to Ruturaj)
- Add info about which batter is run out?