Climate Inventory

This repository contains several scrapers that pull relevant data for the Green House Gas Emissions Inventory for the City of Long Beach. We also include the streamlit_app.py file that contains the streamlit app that allows the user to run the scrapers and download the data.

Scrapers

The scrapers are located in the src/extrations folder. There, there are several scrapers that pull data from different sources. The scrapers are:

California Air Resources Board (CARB)

We scrape the data from the following website. There, there are several excel files that contain emissions per facility. The code scrapes the excel that the user desires, and filters then by Long Beach. There is also the option to only get the dataframe with the corresponding urls.

EMFAC

The data from the EMFAC website is scraped, but this is done through an API call (note: this is not a documented API, and it was found by tracking the requests the website makes when a query is called).

This website contains the onroad and offroad emissions per vehicle type for the years and county selected. In the API call, the South LA County is selected, along with the selected year.

Environmental Protection Agency - FLIGHT App (EPAFlight)

The webscraped site is the EPAFlight Website. This website also shows emissions per facility for the selected city. As with the EMFAC code, the scraping is also done by calling their API and using the adequate payload (which includes the city and the year selected). By default, we select the Long Beach city, and the year is up to the user.

Environmental Protection Agency - Hub

The webscraped site is the EPAHub Website This website contains the emission factors for several categories, including electricity, vehicles and waste. The scraper is very similar to the CARB one, where the user selects a year, which is associated with an excel file that is downloaded and parsed.

There is also the option to only get the dataframe with the corresponding urls so the user can download the excel file themselves.

Google - Environmental Insights Explorer (GoogleEIE)

This scraper gets the Long Beach data from the Google EIE Website. This website contains the emissions per sector for Long Beach. The scraper is done by calling their API.

This was the hardest scraper since the API results are very inconsistent, and they do not have all of the information. For example, the CO2 emissions for the transportation vehicles were not available from the API response, but two 'factors' were available in the response. The code then calculates the CO2 emissions by using those two factors and the total distance.

US Census (USCensus)

This scraper gets the Long Beach house heating fuel consumption from the US Census Website.

The US Census API is queried to get the table from this data. The user has the option to choose the year and the type of estimations (5-year or 1-year).

How to run this thing?

Streamlit App

If you want to run the application locally, then you can do the following:

Run the streamlit application from your terminal:

python -m streamlit run streamlit_app.py

Or you can also do it from the pip environment:

python -m pipenv run streamlit run streamlit_app.py

Note that you may need to put the full path to the streamlit_app.py file in any of the above commands.

Scrapers

You can take a look at the streamlit_app.py file to see how the scrapers are called. The scrapers are called from the src/extractions folder and then ran with the scraper.run() method.

For example, to run the CARB scraper to get the 2019 emissions data, you can do the following:

# Import scraper
from extractions.CARB.CARBExtractor import CARBExtractor

# Instantiate scraper and run it
carb_scraper = CARBExtractor()
carb_scraper.run(year=2019, {'option': 'Emissions'})

Future Work

Some future work that could be done include:

Adding more scrapers to get more data
Adding a functionality to the streamlit app to merge as much data as possible into a single dataframe
Adding a functionality to the streamlit app to upload the data either to a database or a sharepoint folder.

Contact Information:

Federico Dominguez Molina, fd.molina@outlook.com

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
src		src
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Climate Inventory

Scrapers

How to run this thing?

Streamlit App

Scrapers

Future Work

About

Releases

Packages

Contributors 2

Languages

City-of-Long-Beach-Public/climate_inventory

Folders and files

Latest commit

History

Repository files navigation

Climate Inventory

Scrapers

How to run this thing?

Streamlit App

Scrapers

Future Work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages