This workshop will cover how to extract data from the web using Python. We'll be covering both APIs and webscraping.
- How the web works
- Accessing databases via RESTful APIs
- HTML / CSS
- Manipulating a webpage with Google DevTools
- Webscraping with Beautiful Soup
- Scraping javascript-heavy sites and interactive sites with Selenium
This workshop will be using the Python programming language. See the software requirements here.
We will assume a basic knowledge of Python. If you've taken the D-Lab's Python Intensive, that should be sufficient.
Please note that materials are still in development, and will be changing.
Rochelle Terman: rterman@gmail.com
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.