You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 11, 2023. It is now read-only.
overview](https://github.com/openclimatefix/nowcasting) of the Open
33
+
Climate Fix solar PV nowcasting project, and how our code repositories
34
+
fit together.
20
35
21
-
##Installation
36
+
# Installation
22
37
23
-
### `conda`
38
+
##Conda
24
39
25
40
From within the cloned `nowcasting_dataset` directory:
26
41
@@ -35,129 +50,32 @@ and [pytorch_lightning](https://github.com/PyTorchLightning/pytorch-lightning) u
35
50
```shell
36
51
pip install -e .[torch]
37
52
```
38
-
but it is only used to create a dataloader for machine learning models, and will not be necessary
39
-
soon (when the dataloader is moved to `nowcasting_dataloader`).
53
+
but it is only used to create a dataloader for machine learning models.
40
54
41
-
42
-
### `pip`
55
+
## Pip
43
56
44
57
A (probably older) version is also available through `pip install nowcasting-dataset`
45
58
46
-
47
-
### `RuntimeError: unable to open shared memory object`
59
+
## `RuntimeError: unable to open shared memory object`
48
60
49
61
To prevent PyTorch failing with an error like `RuntimeError: unable to open shared memory object </torch_2276740_2849291446> in read-write mode`, edit `/etc/security/limits.conf` as root and add this line: `* soft nofile 512000` then log out and log back in again (see [this issue](https://github.com/openclimatefix/nowcasting_dataset/issues/158) for more details).
50
62
51
-
52
-
### PV Live API
63
+
## PV Live API
53
64
If you want to also install [PVLive](https://github.com/SheffieldSolar/PV_Live-API) then use `pip install git+https://github.com/SheffieldSolar/PV_Live-API
54
65
`
55
66
56
-
###Pre-commit
67
+
## Pre-commit
57
68
58
69
A pre commit hook has been installed which makes `black` run with every commit. You need to install
59
70
`black` and `pre-commit` (these will be installed by `conda` or `pip` when installing
60
71
`nowcasting_dataset`) and run `pre-commit install` in this repo.
61
72
62
-
63
-
## Testing
73
+
# Testing
64
74
65
75
To test using the small amount of data stored in this repo: `py.test -s`
66
76
67
77
To test using the full dataset on Google Cloud, add the `--use_cloud_data` switch.
68
78
79
+
# Documentation
69
80
70
-
## Downloading data
71
-
72
-
### Satellite data
73
-
74
-
Use [Satip](https://github.com/openclimatefix/Satip) to download
75
-
native EUMETSAT SEVIRI RSS data from EUMETSAT's API and then convert
76
-
to an intermediate file format.
77
-
78
-
79
-
### PV data from PVOutput.org
80
-
81
-
Download PV timeseries data from PVOutput.org using
Please see the `data_sources/<modality>/<modality>_model.py` files
150
-
(where `<modality>` is one of {datetime, metadata, gsp, nwp, pv,
151
-
satellite, sun, topographic}) for documentation about the different
152
-
data fields in each example / batch.
153
-
154
-
155
-
# History of nowcasting_dataset
156
-
When we first started writing `nowcasting_dataset`, our intention was
157
-
to load and align data from these three datasets on-the-fly during ML
158
-
training. But it just isn't quite fast enough to keep a modern GPU constantly fed
159
-
with data when loading multiple satellite channels and multiple NWP
160
-
parameters. So, now, this code is used to pre-prepare thousands of
161
-
batches, and save these batches to disk, each as a separate NetCDF
162
-
file. These files can then be loaded super-quickly at training time.
163
-
The end result is a 12x speedup in training.
81
+
Please see the [`Example` class](https://github.com/openclimatefix/nowcasting_dataset/blob/main/nowcasting_dataset/dataset/example.py) for documentation about the different data fields in each example / batch.
0 commit comments