Full test set (including all GSPs for all datetimes) #491

peterdudfield · 2021-11-22T13:21:09Z

In order to compare with ESO - do we need a test set, that is ALL GSPs for all datetimes?

Detailed Description

To compare against ESO we might need predictions from models for all gsps and all datetimes. This would be roughly
338 * (8 * 31 * 24 * 12) = 24 million predictions

Context

We want to compare our models to the ESO ones

Possible Implementation

make batches for all timesstamps.
just make 10,000 and comare ESO against them?

peterdudfield · 2021-11-22T13:21:23Z

@JackKelly @jacobbieker thoughts?

JackKelly · 2021-11-22T13:28:13Z

Ah, yes, very good point!

To do an exhaustive comparison against ESO's GSP-level PV forecasts, you're right that we should really run inference for every timestep of the test period (basically most of 2021). And, as you say, nowcasting_dataset doesn't currently do that: It currently produces batches of random examples (and doesn't guarantee full coverage over the test period).

But, during ML model training, we also want a smallish validation set, so we can look for overfitting during training.

It shouldn't be too hard to get Manager to produce a "full test set".

But let's set this as a lower priority that some of the other issues (like fixing NaNs) because, if worse comes to worst, we can still evaluate against ESO's forecasts, even with our current random test set

JackKelly · 2021-11-22T13:28:50Z

I've added this to the "nice to have" list of #393 but feel free to edit!

jacobbieker · 2021-11-22T13:34:49Z

Yeah, I agree with @JackKelly, once we have other stuff finished, then make the whole test set, so we know the data is looking good!

JackKelly · 2021-12-03T07:53:18Z

As Peter pointed out in openclimatefix/nowcasting_utils#72 we need a test set that includes all GSPs for all datetimes in order to compute national PV

peterdudfield · 2021-12-03T11:11:15Z

#515 might be good also only to do up to 2021-09-01 for the moment too

peterdudfield · 2021-12-06T09:26:01Z

Perhaps a different method would be to make sure the test set has all GSP's but doenst cover all datetimes.

Then we could do say 1000 datetimes. Total would be
1000 * 338 / 32 = ~ 10,000 batches

This seems a better way to start than the ~ 1 million batches for all datetimes

peterdudfield added the enhancement New feature or request label Nov 22, 2021

JackKelly added this to Nowcasting Nov 22, 2021

JackKelly moved this to Todo in Nowcasting Nov 22, 2021

JackKelly mentioned this issue Nov 22, 2021

Stuff that needs to be finished before we can create a new pre-prepared dataset #393

Closed

34 tasks

JackKelly changed the title ~~Full test set~~ Full test set (including all GSPs for all datetimes) Dec 3, 2021

peterdudfield added this to the v16 dataset milestone Dec 3, 2021

peterdudfield self-assigned this Dec 6, 2021

peterdudfield linked a pull request Dec 6, 2021 that will close this issue

Issue/393 test all gsp #527

Merged

7 tasks

peterdudfield mentioned this issue Dec 6, 2021

Issue/393 test all gsp #527

Merged

7 tasks

peterdudfield closed this as completed in #527 Dec 8, 2021

Repository owner moved this from Todo to Done in Nowcasting Dec 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full test set (including all GSPs for all datetimes) #491

Full test set (including all GSPs for all datetimes) #491

peterdudfield commented Nov 22, 2021 •

edited

Loading

peterdudfield commented Nov 22, 2021

JackKelly commented Nov 22, 2021

JackKelly commented Nov 22, 2021 •

edited

Loading

jacobbieker commented Nov 22, 2021

JackKelly commented Dec 3, 2021

peterdudfield commented Dec 3, 2021

peterdudfield commented Dec 6, 2021

Full test set (including all GSPs for all datetimes) #491

Full test set (including all GSPs for all datetimes) #491

Comments

peterdudfield commented Nov 22, 2021 • edited Loading

Detailed Description

Context

Possible Implementation

peterdudfield commented Nov 22, 2021

JackKelly commented Nov 22, 2021

JackKelly commented Nov 22, 2021 • edited Loading

jacobbieker commented Nov 22, 2021

JackKelly commented Dec 3, 2021

peterdudfield commented Dec 3, 2021

peterdudfield commented Dec 6, 2021

peterdudfield commented Nov 22, 2021 •

edited

Loading

JackKelly commented Nov 22, 2021 •

edited

Loading