Repository containing files related to MLB data pipeline project
- Statcast_scraper.ipynb : Jupyter notebook with POC to use url with pandas.read_csv to import data from baseballsavant.com
- MLB_2018_Data_Download : Using statcast_scraper.ipynb as a base, this jupyter notebook uses a 2018 schedule csv file that contains the dates of games played in 2018. Then concatenates the url string with specific dates to import data. Next steps will be to look through schedule csv to download all of 2018 data.
- CLean Project Code : clean version of code to be implemented. Currently consisted of function to download data and a basic pitch visualization.
- MLB_CONFIG : contains a stadiums dictionary and baseball savant null values dictionary to be passed to read_csv in data download function.