Code for "Structure-preserving contrastive learning for spatial time series"

This study was first submitted to ICLR 2025 and got rejected. Its record on OpenReview is at https://openreview.net/forum?id=sz7HdeVVHo

After revision based on the advices from ICLR reviewers and extension for more scientific insights, we are submitting a new paper (preprinted at arXiv) to a journal. This code repository is provided for repeating the experiments and reusing the proposed methods.

Abstract

Informative representations enhance model performance and generalisability in downstream tasks. However, learning self-supervised representations for spatially characterised time series, like traffic interactions, poses challenges as it requires maintaining fine-grained similarity relations in the latent space. In this study, we incorporate two structure-preserving regularisers for the contrastive learning of spatial time series: one regulariser preserves the topology of similarities between instances, and the other preserves the graph geometry of similarities across spatial and temporal dimensions. To balance contrastive learning and structure preservation, we propose a dynamic mechanism that adaptively weighs the trade-off and stabilises training. We conduct experiments on multivariate time series classification, as well as macroscopic and microscopic traffic prediction. For all three tasks, our approach preserves the structures of similarity relations more effectively and improves state-of-the-art task performances. The proposed approach can be applied to an arbitrary encoder and is particularly beneficial for time series with spatial or geographical features. Furthermore, this study suggests that higher similarity structure preservation indicates more informative and useful representations. This may help to understand the contribution of representation learning in pattern recognition with neural networks.

Dependencies

torch numba numpy scipy pandas tqdm scikit-learn tslearn h5py pytables zarr scikit-image

For an encapsulated environment, you may create a virtual environment with Python 3.12.4 and install the dependencies by running the following command:

pip install -r requirements.txt

In order to repeat the experiments:

Step 0: Ensure that all dependencies are installed as mentioned above. Put datasets in the datasets directory. The UEA archive datasets are available at https://www.timeseriesclassification.com/dataset.php. The MacroTraffic and MicroTraffic datasets will be provided if requested via email or GitHub Issues.
Step 1: Test the environment setting by running environment_test.py. This script tests imports and random seeds that will be used to repeat experiments.
Step 2: Precompute the distance matrices for the UEA datasets using precompute_distmat.py. Computed distance matrices are saved in corresponding folders where the data are.
Step 3: Grid search for hyperparameters using ssrl_paramsearch.py. The hyperparameters are saved in results/hyper_parameters/ and we make them openly available for convenience*.
Step 4: Train various encoders using ssrl_train.py. The trained encoders are saved in the results/pretrain directory. We make the trained encoders openly available*.
Step 5: Apply the trained models for downstream tasks using tasks/uea_classification.py, tasks/macro_progress.py, and tasks/micro_prediction.py for UEA classification, macroTraffic traffic flow prediction, and microTraffic trajectory prediction. The completely trained models are saved in the results/finetune directory; the evaluation results are saved in the results/evaluation directory. We also make the models and evaluation results openly available*.

*Note: The resulting data are too large (21.2 GB) to be provided in this repository. You are welcome to download them from the following link: https://surfdrive.surf.nl/files/index.php/s/2wNdn6MxIAndxrs

Analysis and visualisation

To analyse and visualise the results, use figures/visual.ipynb. The notebook generates tables and plots for the evaluation results and saves them in the results/figures directory.

Citation

@article{jiao2025structure,
    title = {Structure-preserving contrastive learning for spatial time series},
    author = {Yiru Jiao and Sander {van Cranenburgh} and Simeon C. Calvert and Hans {van Lint}},
    year = {2025},
    journal = {arXiv preprint},
    pages = {arXiv:2502.06380}
}

Repo references

Thanks to GitHub for offering the open environment, from which this work reuses/learns/adapts the following repositories to different extents:

TS2Vec https://github.com/zhihanyue/ts2vec
SoftCLT https://github.com/seunghan96/softclt
TopoAE https://github.com/BorgwardtLab/topological-autoencoders
GGAE https://github.com/JungbinLim/GGAE-public
TAM https://github.com/dmfolgado/tam/

We thank the authors for their contributions to open science.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
datasets		datasets
figures		figures
model_utils		model_utils
modules		modules
results		results
tasks		tasks
.gitignore		.gitignore
LICENSE		LICENSE
clean_pycache.py		clean_pycache.py
environment_test.py		environment_test.py
model.py		model.py
precompute_distmat.py		precompute_distmat.py
readme.md		readme.md
requirements.txt		requirements.txt
ssrl_paramsearch.py		ssrl_paramsearch.py
ssrl_train.py		ssrl_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for "Structure-preserving contrastive learning for spatial time series"

Abstract

Dependencies

In order to repeat the experiments:

Analysis and visualisation

Citation

Repo references

About

Releases

Packages

Languages

License

Yiru-Jiao/spclt

Folders and files

Latest commit

History

Repository files navigation

Code for "Structure-preserving contrastive learning for spatial time series"

Abstract

Dependencies

In order to repeat the experiments:

Analysis and visualisation

Citation

Repo references

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages