Firecast.ai - machine learning wildfire risk forecasting

The goal of this project is to build a machine learning model which can predict wildfire ignition risk in California from publicly available meteorology and fire activity data.

-- Project Status: [Active]

Project Description

Wildfires are common, destructive and deadly natural disasters. Current meteorology based wildfire risk prediction methods can be improved upon by:

The application of modern data pipeline automation and machine learning techniques
Use of historical wildfire data for model training and validation

This project uses a parallel LSTM neural network to predict geospatially resolved wildfire ignition risk in California. The model was trained on a combined dataset produced from the USDA historical wildfire activity dataset(1) and meterological data from NOAA's North American Regional Reanalysis(2). This project is currently in the deployment phase. Live prediction data will be available for 7 days into the future via API. For more background information please see the full project proposal

Using this repository

First, clone the repo:

git clone https://github.com/gperdrizet/firecast.ai.git

Next, you have two options to install required packages:

A) Conda.

This will install a complete copy of the development environment, including all dependencies.

cd firecast.ai
conda env create -f environment.yml

B) using pip and venv.

python3 -m venv firecast.ai
source firecast.ai/bin/activate
cd firecast.ai
pip install -r requirements.txt

Due to size and space constraints, only the final training dataset and its derivatives are included in this repo. Raw and intermediate data files created by the training data pipeline are not hosted on github, but can be found here. Note: total size on disk is 326G, ~2500 files.

Featured notebooks

Data Sources

Historical wildfire activity: United States Department of Agriculture Research Data Archive, Spatial wildfire occurrence data for the United States, 1992-2015¹
Historical metrology data: National Oceanic and Atmospheric Administration, North American Regional Reanalysis²

Methods Used

Machine Learning
Gradient boosted decision trees
Deep neural networks
Long short term memory neural networks
Cartographic Projection
Time Series Analysis
Feature Engineering
Hyperparameter optimization
Metaparameter optimization
Gaussian process optimization
Cox-Box quantile normalization
Kolmogorov–Smirnov
Recursive sample stratification

Technologies

Python
PySpark
Luigi
Flask
Tensorflow
Keras
Scikit-Learn
Pandas
NumPy
Shaply
GeoPandas
Xarray
Matplotlib
Seaborn

Contributing Members

Team Lead (Contact) : George Perdrizet

References

Short, Karen C. 2017. Spatial wildfire occurrence data for the United States, 1992-2015 [FPA_FOD_20170508]. 4th Edition. Fort Collins, CO: Forest Service Research Data Archive. https://doi.org/10.2737/RDS-2013-0009.4
NCEP Reanalysis data provided by the NOAA/OAR/ESRL PSD, Boulder, Colorado, USA, from their Web site at https://www.esrl.noaa.gov/psd/

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
deployment		deployment
notebooks		notebooks
project_info		project_info
training_data_pipeline		training_data_pipeline
Perdrizet_Springboard_MLE_certificate.pdf		Perdrizet_Springboard_MLE_certificate.pdf
README.md		README.md
environment.yml		environment.yml
requierments.txt		requierments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Firecast.ai - machine learning wildfire risk forecasting

-- Project Status: [Active]

Project Description

Using this repository

A) Conda.

B) using pip and venv.

Featured notebooks

Data Sources

Methods Used

Technologies

Contributing Members

References

About

Releases

Packages

Languages

gperdrizet/firecast.ai

Folders and files

Latest commit

History

Repository files navigation

Firecast.ai - machine learning wildfire risk forecasting

-- Project Status: [Active]

Project Description

Using this repository

A) Conda.

B) using pip and venv.

Featured notebooks

Data Sources

Methods Used

Technologies

Contributing Members

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages