About the pre-trained model #33

bobo0810 · 2020-07-31T07:21:23Z

Thank you for this work. I want to use the pre-trained model of AttentionNet-IRSE-56/92 from the MODEL_ZOO.md for fine-tuning. Where can I get the pre-trained model?

cavalleria · 2020-07-31T07:33:12Z

The AttentionNet-IRSE pretrained model is not avaiable. And I saved the model use torch.jit.save api which seem like not support for finetuning. You can retrain it use this to save model.https://github.com/cavalleria/cavaface.pytorch/blob/13182ecc349ca050fa5a877045390a41037313a7/train.py#L352

bobo0810 · 2020-07-31T07:35:32Z

The AttentionNet-IRSE pretrained model is not avaiable. And I saved the model use torch.jit.save api which seem like not support for finetuning. You can retrain it use this to save model.https://github.com/cavalleria/cavaface.pytorch/blob/13182ecc349ca050fa5a877045390a41037313a7/train.py#L352

OK，thanks! Can the models from Model Zoo.md not be finetuning? Such as IR-SE50 etc.

cavalleria · 2020-07-31T07:55:10Z

The models from Model Zoo.md is saved using api torch.jit.save which can evaluate it without defining model python files, but it seem like cannot finetune.😂

xsacha · 2020-08-01T14:23:13Z

You can load the weights from torch.jit.load and set these in to the state_dict of a real model (from BACKBONE_DICT) in order to support finetuning.

bobo0810 · 2020-08-02T02:46:54Z

You can load the weights from torch.jit.load and set these in to the state_dict of a real model (from BACKBONE_DICT) in order to support finetuning.

Oh, that's great. Can you provide the pre-trained model of AttentionNet-IRSE-56/92 so that I can fine-tune it? Thank you @cavalleria @xsacha

xsacha · 2020-08-03T00:09:32Z

@bobo0810
Should be like this:

mymodel = AttentionNet_IRSE_92()
mymodel.load_state_dict(torch.jit.load('AttentionNet_IRSE_92_torchscript.pt').state_dict())

See: https://pytorch.org/docs/stable/generated/torch.jit.ScriptModule.html#torch.jit.ScriptModule.state_dict

FelixZhang7 · 2020-08-21T02:20:00Z

@cavalleria hello，according to data augmentation result in model_zoo.md，baseline is the best, so we do not need to do any data augmentation?

xsacha · 2020-08-21T02:21:53Z

@cavalleria hello，according to data augmentation result in model_zoo.md，baseline is the best, so we do not need to do any data augmentation?

When trained for the same amount of time, from scratch, baseline is better. This will be the case for any sort of classifier AFAIK.
This is also the case if you double the size of the dataset. There is more data to learn, so it will take longer.

Augmentation requires either a fine-tune on a baseline (pre-trained) or much longer training to see optimal results.

bobo0810 · 2020-08-21T02:25:53Z

@cavalleria hello，according to data augmentation result in model_zoo.md，baseline is the best, so we do not need to do any data augmentation?

When trained for the same amount of time, from scratch, baseline is better. This will be the case for any sort of classifier AFAIK.

Augmentation requires either a fine-tune on a baseline or longer training to see optimal results.

@FelixZhang7 Although model.md is like this, my experimental experience is that adding data augmentation will improve performance.
@xsacha In order to make the network converge better, data augmentation should be performed on the pre-trained model and continue training

FelixZhang7 · 2020-08-26T02:22:25Z

@xsacha @cavalleria @bobo0810 Hello, I want to find a network with a better accuracy than resnet50-IR, and it is better to have similar inference speed，could you give me some advice ?

xsacha · 2020-08-26T02:25:37Z

@FelixZhang7 AttNet-56-IR
It is similar inference speed to resnet50 on GPU and better accuracy.

bobo0810 · 2020-08-26T02:26:18Z

@xsacha @cavalleria @bobo0810 Hello, I want to find a network with a better accuracy than resnet50-IR, and it is better to have similar inference speed，could you give me some advice ?

I haven't compared the speed of the networks yet. Refer to modelzoo.md, I am using attention_irse.

cavalleria · 2020-08-26T02:46:12Z

@bobo0810 efficient-b1/mobilenetv3(enlarge width) and so on.

FelixZhang7 · 2020-08-27T06:22:08Z

@xsacha I‘ve tried AttNet-56-IR，almost the same acc compare with resnet50-IR...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the pre-trained model #33

About the pre-trained model #33

bobo0810 commented Jul 31, 2020 •

edited

Loading

cavalleria commented Jul 31, 2020

bobo0810 commented Jul 31, 2020 •

edited

Loading

cavalleria commented Jul 31, 2020

xsacha commented Aug 1, 2020

bobo0810 commented Aug 2, 2020

xsacha commented Aug 3, 2020 •

edited

Loading

FelixZhang7 commented Aug 21, 2020

xsacha commented Aug 21, 2020 •

edited

Loading

bobo0810 commented Aug 21, 2020

FelixZhang7 commented Aug 26, 2020

xsacha commented Aug 26, 2020 •

edited

Loading

bobo0810 commented Aug 26, 2020

cavalleria commented Aug 26, 2020

FelixZhang7 commented Aug 27, 2020

About the pre-trained model #33

About the pre-trained model #33

Comments

bobo0810 commented Jul 31, 2020 • edited Loading

cavalleria commented Jul 31, 2020

bobo0810 commented Jul 31, 2020 • edited Loading

cavalleria commented Jul 31, 2020

xsacha commented Aug 1, 2020

bobo0810 commented Aug 2, 2020

xsacha commented Aug 3, 2020 • edited Loading

FelixZhang7 commented Aug 21, 2020

xsacha commented Aug 21, 2020 • edited Loading

bobo0810 commented Aug 21, 2020

FelixZhang7 commented Aug 26, 2020

xsacha commented Aug 26, 2020 • edited Loading

bobo0810 commented Aug 26, 2020

cavalleria commented Aug 26, 2020

FelixZhang7 commented Aug 27, 2020

bobo0810 commented Jul 31, 2020 •

edited

Loading

bobo0810 commented Jul 31, 2020 •

edited

Loading

xsacha commented Aug 3, 2020 •

edited

Loading

xsacha commented Aug 21, 2020 •

edited

Loading

xsacha commented Aug 26, 2020 •

edited

Loading