MixMatch

PyTorch implementation of the MixMatch semi-supervised algorithm for image classication.

Semi-Supervised Learning leverages unlabeled images to learn quality representations and allows us to train with a very reduced labeled dataset. Results are surprisingly close to fully supervised learning.

Papers

MixMatch: A Holistic Approach to Semi-Supervised Learning

Environment creation

$ conda env create --file environment.yml

Change the desired parameters in config.yml and run:

$ python3 main.py

Datasets

CIFAR-10: The dataset gets downloaded automatically in data/cifar-10-batches-py, if it has not been downloaded before.

Results

Accuracy (%)	250 Labels	1000 labels	4000 labels	Fully supervised
This code	86.52	90.28	93.33	94.39
MixMatch paper	88.92 ± 0.87	92.25 ± 0.32	93.76 ± 0.06	95.87

Beyond MixMatch

In this project, we attempt to improve MixMatch using pseudo labels. The idea is to use the most confident predictions by the model trained with MixMatch as one-hot labels. This allows unlabeled images to be inlcuded in the labeled dataset. Training then resumes with the extended labeled dataset.

Results

We don't see a significant improvement with the use of pseudo labels. An interesing result was to find out that the model makes wrong guesses even in very confident predictions (>99% confidence), possibly as a side effect of MixMatch's entropy minimization. This confirmation bias likely is hurting the performance. Results are for only 200,000 update steps.

Accuracy (%)	4000 labels
Plain MixMatch	92.75
Pseudo-labels (threshold = 0.95)	92.25
Pseudo-labels (threshold = 0.99)	92.74
Pseudo-labels (top 10% of each class)	92.63

References

@article{berthelot2019mixmatch,
  title={MixMatch: A Holistic Approach to Semi-Supervised Learning},
  author={Berthelot, David and Carlini, Nicholas and Goodfellow, Ian and Papernot, Nicolas and Oliver, Avital and Raffel, Colin},
  journal={arXiv preprint arXiv:1905.02249},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 237 Commits
data/cifar-10-batches-py		data/cifar-10-batches-py
models		models
notebooks		notebooks
results		results
src		src
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MixMatch

Papers

Environment creation

Datasets

Results

Beyond MixMatch

Results

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MixMatch

Papers

Environment creation

Datasets

Results

Beyond MixMatch

Results

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages