Memorization-Curvature (ICML 2024 Spotlight)

This code is the official implementation for the paper, Memorization Through the Lens of Curvature of Loss Function Around Samples. This code contains the experiments described in the paper.

Accepted at ICML 2024 Spotlight (3.5% Acceptance), if you use this github repo consider citing our work

@inproceedings{
    garg2024memorization,
    title={Memorization Through the Lens of Curvature of Loss Function Around Samples},
    author={Garg, Isha and Ravikumar, Deepak and Roy, Kaushik},
    booktitle={Forty-first International Conference on Machine Learning},
    year={2024},
    url={https://openreview.net/forum?id=WQbDS9RydY}

}

Full paper available paper link. You may also be interested in our work (ICML '24) on Privacy Memorization and Curvature paper, code.

Setup

Create a folder ./pretrained/<dataset name> and ./pretrained/<dataset name>/temp i.e.

mkdir pretrained
mkdir pretrained/imagenet
mkdir pretrained/imagenet/temp

Update the paths for the datasets in ./utils/load_dataset.py, replace the string 'Set path'

Code Flow

Train model using train_<dataset>.py
- Trains the models and saves checkpoint every epoch for analysis in step 2.
Run score_<dataset>_checkpoint.py
- Computes the input loss curvature for the model for each epoch and saves in ./curv_scores
Use the analyze_<dataset>_curv.ipynb for analysis

Training

To train ImageNet model use train_imagenet.py. This code is made to be trained in a distributed manner and makes use of torch run. To train on 1 node and 1 GPU use the following command

OMP_NUM_THREADS=12 torchrun \
--nproc_per_node=1 \
--nnodes=1 \
--node_rank=0 \
--rdzv_id=456 \
--rdzv_backend=c10d \
--rdzv_endpoint=<ip-address-of-machine>:26903 \ 
train_imagenet.py

For multi-gpu single node use

OMP_NUM_THREADS=12 torchrun \
--nproc_per_node=3 \
--nnodes=1 \
--node_rank=0 \
--rdzv_id=456 \
--rdzv_backend=c10d \
--rdzv_endpoint=<ip-address-of-machine>:26903 \ 
train_imagenet.py

This uses 3 GPUs on a single node Output of each node is logged into ./logs folder with a corresponding log file.

Visualizations

See visualizations for additional results and higher quality images from the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
analysis_checkpoints		analysis_checkpoints
images		images
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analyze_imagenet_curv.ipynb		analyze_imagenet_curv.ipynb
score_imagenet_checkpoint.py		score_imagenet_checkpoint.py
train_imagenet.py		train_imagenet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memorization-Curvature (ICML 2024 Spotlight)

Setup

Code Flow

Training

Visualizations

About

Releases

Packages

Languages

License

purdue-nrl/Memorization-Curvature

Folders and files

Latest commit

History

Repository files navigation

Memorization-Curvature (ICML 2024 Spotlight)

Setup

Code Flow

Training

Visualizations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages