A Few Large Shifts (AFLS) Assumption-based Adversarial Examples (AEs) Detection

A Few Large Shifts: Layer-Inconsistency Based Minimal Overhead Adversarial Example Detection
Sanggeon Yun, Ryozo Masukawa, Hyunwoo Oh, Nathaniel D. Bastian, Mohsen Imani
arXiv:2505.12586

Figure 1. Illustration of the A Few Large Shifts (AFLS) assumption.

A lightweight, plug-in detection framework that leverages internal layer-wise inconsistencies of a frozen classifier to detect adversarial examples--is self-sufficient (no adversarial examples), model-local (no external pre-trained models such as SSL), and low-overhead (no complex structures such as kNN graphs to maintain reference sets or excessive augmentations), making it ideal for scalable deployment.

Figure 2. Recovery Testing (RT) and Logit‑layer Testing (LT) are fused into RLT via quantile normalisation.

📁 Repository Structure

├── src
│   ├── backbone_trained
│   │   └── ResNet110_CIFAR10.pt      # Pre-trained ResNet-110 on CIFAR-10
│   ├── iGAT                          # AutoAttack (iGAT) implementation
│   ├── attacks.py                   # Wrappers for FGSM, PGD, AutoAttack, CW, Square
│   ├── backbone.py                  # Dynamic backbone loader (timm, TorchHub, custom)
│   ├── config.py                    # MODEL_SETTINGS, BACKBONES, DATASETS, ATTACKS
│   ├── data.py                      # Dataset loaders & transforms
│   ├── model.py                     # ModelWrapper (hooks) & Tester (recovery & masks)
│   ├── resnet.py                    # ResNet-110 implementation
│   ├── train.py                     # Training, evaluation, robustness & detection loop
│   └── utils.py                     # Utilities (e.g., compute_auc)
├── requirements.txt                 # Python dependencies
└── README.md                        # You are here

🚀 Quickstart

1. Clone the repo

git clone https://github.com/c0510gy/AFLS-AED.git
cd AFLS-AED

2. Install dependencies

pip install -r requirements.txt

3. Run training & detection evaluation

cd src
python train.py \
  --dataset cifar10 \
  --arch resnet110 \
  --data_dir ./data \
  --batch_size 64 \
  --epochs 100 \
  --lr 1e-4 \
  --wd 1e-2 \
  --attacks FGSM PGD AutoAttack Square

Outputs

Clean train/test accuracy
For each attack: robust accuracy, RT-AUC, LT-AUC, RLT-AUC
Cached adversarial samples in attack_samples_{dataset}_{arch}_{attack}.pt

🔧 Supported Configurations

Backbones

ResNet-18, ResNet-110, DenseNet-121, ShuffleNetV2_x1_0, MobileNetV2_x0_5, RepVGG_A0

Datasets

CIFAR-10, CIFAR-100, ImageNet

Attacks

Supported
- FGSM (via foolbox)
- PGD (via foolbox)
- AutoAttack (via iGAT)
- Square Attack (via iGAT)
CW Attack: CW adversarial examples are generated using an external library and are not combined in this repository.

⚙️ Customizing Detection

By default, recovery/testing modules use:

MODEL_SETTINGS = {
  "num_aug":   4,    # number of learned masks (G)
  "recov_dim": 128,  # hidden dim of each adapter
  "recov_depth": 4,  # layers in each MLP
}

Override via CLI:

cd src
python train.py \
  --num_aug 6 \
  --recov_dim 256 \
  --recov_depth 5 \
  [other args...]

📝 Usage Examples

CIFAR-10 + ResNet-110

python train.py --dataset cifar10 --arch resnet110

CIFAR-100 + ResNet-18

python train.py --dataset cifar100 --arch resnet18

CIFAR-100 + ShuffleNetV2

python train.py --dataset cifar100 --arch shufflenetv2_x1_0

CIFAR-100 + MobileNetV2

python train.py --dataset cifar100 --arch mobilenetv2_x0_5

CIFAR-100 + RepVGG

python train.py --dataset cifar100 --arch repvgg_a0

ImageNet + DenseNet-121 (Set --ata_dir to your ImageNet path)

python train.py --dataset imagenet --arch densenet121 --data_dir /path/to/imagenet --recov_depth 3 --train_break True

🔍 Code Highlights

attacks.py: High-level wrappers for FGSM, PGD, CW, Square Attack and AutoAttack (foolbox, iGAT).
backbone.py: Unified interface for loading models from timm, TorchHub, or custom implementations.
model.py
- ModelWrapper: Hooks into intermediate layers to extract activations.
- Tester: Implements Recovery Testing (RT) and Logit-layer Testing (LT).
train.py Orchestrates:
1. Training of recover modules & augmentation matrices
2. Robustness evaluation
3. Detection AUC computation
utils.py: Helper functions (e.g., compute_auc for ROC-AUC).

📚 References

If you find our work useful, please cite:

@article{yun2025few,
  title={A Few Large Shifts: Layer-Inconsistency Based Minimal Overhead Adversarial Example Detection},
  author={Yun, Sanggeon and Masukawa, Ryozo and Oh, Hyunwoo and Bastian, Nathaniel D and Imani, Mohsen},
  journal={arXiv preprint arXiv:2505.12586},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
figures		figures
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A Few Large Shifts (AFLS) Assumption-based Adversarial Examples (AEs) Detection

📁 Repository Structure

🚀 Quickstart

1. Clone the repo

2. Install dependencies

3. Run training & detection evaluation

Outputs

🔧 Supported Configurations

Backbones

Datasets

Attacks

⚙️ Customizing Detection

📝 Usage Examples

🔍 Code Highlights

📚 References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

c0510gy/AFLS-AED

Folders and files

Latest commit

History

Repository files navigation

A Few Large Shifts (AFLS) Assumption-based Adversarial Examples (AEs) Detection

📁 Repository Structure

🚀 Quickstart

1. Clone the repo

2. Install dependencies

3. Run training & detection evaluation

Outputs

🔧 Supported Configurations

Backbones

Datasets

Attacks

⚙️ Customizing Detection

📝 Usage Examples

🔍 Code Highlights

📚 References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages