This repository was archived by the owner on Sep 11, 2023. It is now read-only.
-
-
Notifications
You must be signed in to change notification settings - Fork 6
Stuff that needs to be finished before we can create a new pre-prepared dataset #393
Closed
31 of 34 tasks
Labels
enhancement
New feature or request
Comments
I like this summary issues you do - nice to get a good overview of things |
Thanks! (Although I was also wondering if this list of issues actually be done using a github "milestone"... but really not sure what's best!) |
Ive added - #454 to this list |
ive added #483 |
I'd be tempted to say this is done? This milestone of getting a 'prepared ML dataset' seems complete. There are a few nice to haves left over. |
I agree! Great work everyone! Let's mark this as "closed" now. Woo! |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Issues that probable need to be done before we can start training ML models "properly":
NWPDataSource
with new NWP variable names #371float16
! Converting tofloat16
results in infinities indswrf
#370Manager._get_first_batches_to_create()
should set first batch to 0 if no batches are on disk #469prepare_ml_data.py
this weekend #471And some issues that aren't absolutely essential but are pretty important:
prepare_ml_data.py
should save separate logs toOutputData.filepath/<SplitName>/<DataSource>
#156Manager
usemultiprocessing.Pool
notProcessPoolExecutor
#325prepare_ml_data.py
#341hcct
channel in it nwp#6leonardo
claim to only have 94.3 GB RAM installed? #486Nice to have:
start_date
andend_date
setting in the config YAML #425prepare_ml_data.py
? #317nowcasting_dataset
outputs? #227The text was updated successfully, but these errors were encountered: