-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Transportation Year 2 Updates #35
base: dsgrid-project-IEF-Phase2-2025
Are you sure you want to change the base?
Transportation Year 2 Updates #35
Conversation
Tagging @ahcyip @elainethale @daniel-thom for any feedback. Sorry if I've totally butchered the existing schema. |
I think the idea is that the data on our side should be as raw and detailed as possible and the config should be modified to handle the raw output.
@bborlaug should make the call on whether our data should retain BEV separate from PHEV. On one hand, the data will be very different (magnitude and timing and locations of load per BEV vs. load per PHEV are very different etc.), but on the other, if our pipeline is already aggregating, I don't know if we will be checking the results with distinct BEV and PHEVs or doing any analysis that uses the distinction, so we may not need to go "backwards" and keep the distinction for dsgrid. |
Regarding date format, you may have to change from 0-167 to P.S. @daniel-thom helped me with the dsgrid software last time, but @nreinicke is a software pro, so Nick could probably handle everything discussed in https://dsgrid.github.io/dsgrid/tutorials/create_and_submit_dataset.html (if it is up to date). Also, @nreinicke sorry I forgot to pass this dsgrid documentation to you earlier - this may have covered a lot of what we chatted about. |
The data tables are already in a very good format for dsgrid. Here are the minor changes that need to be made:
I'd be happy to help with the post-processing to convert to a dsgrid format. This would be a simple Spark query. |
I'm very new to this data format and so starting this as a draft to get feedback. I've attempted to update the schema to accommodate the new load data from the evi-grid-national-framework. The new raw data is hourly load profiles spanning once week for each month of the year. We've run these for four scenarios:
The raw data is parquet format on kestrel at (
/projects/evix/evbps/grid-team-deliverables/2025-01-24-IEF/aggregations/load-profiles/month-week-hour
). Here's a sample of what the data looks like:Some questions:
electricity_ev_ldv_work_l2
. Should we run our data through a script that does the mapping from our own format into that format or should that be something we modify in the config so dsgrid can handle that?ldv-car
. Do we need to retain the distinction between BEV and PHEV for this analysis?