This is our regular quarterly release for 2025Q1. It includes updates to all the datasets that are published with quarterly or higher frequency, plus initial versions of a few new data sources that have been in the works for a while.
One major change this quarter is that we are now publishing all processed PUDL data as Apache Parquet files, alongside our existing SQLite databases. See Data Access for more on how to access these outputs.
Some potentially breaking changes to be aware of:
- In the EIA Form 930 – Hourly and Daily Balancing Authority Operations Report a number of new energy sources have been added, and some old energy sources have been split into more granular categories. See Changes in energy source granularity over time.
- We are now running the EPA’s CAMD to EIA unit crosswalk code for each individual year starting from 2018, rather than just 2018 and 2021, resulting in more connections between these two datasets and changes to some sub-plant IDs.
Many thanks to the organizations who make these regular updates possible! Especially GridLab, RMI, and the ZERO Lab at Princeton University. If you rely on PUDL and would like to help ensure that the data keeps flowing, please consider joining them as a PUDL Sustainer, as we are still fundraising for 2025.
See the v2025.2.0 release notes in our documentation for more details.
What's Changed
New & Updated Data
- Integrate EIA 861 2023 final release data by @e-belfer in #3911
- Integrate EIA 930 data up through October by @e-belfer in #3946
- Add Q3 2024 CEMS data to ETL by @e-belfer in #3948
- EIA bulk elec q3 2024 integration by @aesharpe in #3951
- Eia860m q3 24 by @aesharpe in #3949
- Eia923 q3 24 by @aesharpe in #3950
- Eia176 wide table by @davidmudrauskas in #3590
- EIA 176 wide tables - follow-up fixes by @jdangerx in #3978
- Add EIA 176 2023 data by @e-belfer in #4004
- add censusfips metadata by @cmgosnell in #4006
- Enable sec10k distribution by @zschira in #4026
- Integrate EIA 860 multifuel table by @jmelot in #3988
- Add Q4 2024 CEMS data to ETL by @cmgosnell in #4052
- Update Q1 2025 EIA bulk electricity data by @e-belfer in #4051
- Eia860m eia923 q1 2025 by @aesharpe in #4047
Other Changes
- Post v2024.10.0 release errata cleanup by @zaneselvans in #3917
- Add some links to the VCE RARE data source page. by @zaneselvans in #3921
- Close out PUDL v2024.11.0 release notes by @zaneselvans in #3961
- Reduce peak memory use of VCE RARE assets by @zschira in #3959
- Clean up some nightly build infrastructure cruft by @zaneselvans in #3962
- Fix nested list formatting in CONTRIBUTING by @crd477 in #3974
- Updates to splink FERC to EIA record linkage notebook by @katie-lamb in #3976
- Correct flipped starting/ending balance column rename. by @zaneselvans in #3979
- Add FERC1 validation tests by @jdangerx in #3860
- Fix
pudl.settings
class doc strings by @bendnorman in #3986 - Add parquet download link to data dictionary by @bendnorman in #3984
- Refactor entity matching name cleaner to be more efficient by @katie-lamb in #3953
- Fix row count asset check for vcerare by @zschira in #3993
- Make asset checks run in integration test by @jdangerx in #3990
- Make the reconstructable job not take any arguments at all. by @jdangerx in #3996
- Add NREL GHA runner WIF setup by @jdangerx in #3952
- Add boolean column naming standard to naming_conventions doc by @aesharpe in #3998
- Make it possible to pass another
sources
dict to DataSource by @e-belfer in #4003 - Finish unittest -> pytest conversion by @jdangerx in #4014
- Pass
sources
toDataSource.from_id()
by @e-belfer in #4013 - Acknowledge PUDL Sustainers by @zaneselvans in #4025
- Sec distribution fix by @zschira in #4031
- Add sec10k metadata directly in PUDL by @zschira in #4035
- Update PUDL bibliography by @e-belfer in #4049
- Change SEC 10-K table schemas to fix FK errors and use quarterly naming. by @zaneselvans in #4046
- fix: Normalize VCE RARE spellings of great lakes. by @krivard in #4029
- Update Q1 2025 EIA 930 data by @krivard in #4054
- Update EIA-EPA crosswalk to include multiple years of crosswalk data by @e-belfer in #4056
- Publish yearly detailed FERC1 accounting tables by @cmgosnell in #4057
- Close out release notes for PUDL v2025.2.0 by @zaneselvans in #4062
New Contributors
- @crd477 made their first contribution in #3974
- @jmelot made their first contribution in #3988
- @krivard made their first contribution in #4029
Full Changelog: v2024.10.0...v2025.2.0