feat(Germany): Add Germany solar generation data pipeline #134
Open
Sharkyii wants to merge 2 commits intoopenclimatefix:mainfrom
Open
feat(Germany): Add Germany solar generation data pipeline #134Sharkyii wants to merge 2 commits intoopenclimatefix:mainfrom
Sharkyii wants to merge 2 commits intoopenclimatefix:mainfrom
Conversation
Author
Adds a complete end-to-end pipeline for training solar PV forecasting models for Germany, using GFS weather data (NOAA) and PV generation data from the SMARD API.What's included
Data format
Checkout - README Next StepsTrain Model still in development |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Pull Request
Description
Adds a complete end-to-end pipeline for Germany solar PV forecasting - including data download scripts for SMARD PV generation and GFS weather data, a processing/validation pipeline, and a baseline model training script, all configurable via YAML configs
Fixes #121
How Has This Been Tested?
All scripts were validated end-to-end against real data:
PV data: 34,944 time steps, 1 GSP region (Germany 2021)
GFS data: 28 init times, 17 forecast steps, 33×41 spatial grid
Verified data loading, validation (negative value checks, NaN percentage), temporal alignment, and normalization constant calculation
If your changes affect data processing, have you plotted any changes? i.e. have you done a quick sanity check?
Checklist: