cifar10 results #22

ahundt · 2017-09-06T22:18:22Z

I just ran a 300 epoch run using tensorflow and an unmodified cifar10.py from 54ed2d6 on a Titan X (old version) and got the following results:

Epoch 300/300
499/500 [============================>.] - ETA: 0s - loss: 0.0635 - acc: 0.9891Epoch 00299: val_acc did not improve
500/500 [==============================] - 181s - loss: 0.0635 - acc: 0.9891 - val_loss: 0.3646 - val_acc: 0.9224
Accuracy :  92.24
Error :  7.76

Here is the file:
DenseNet-40-12-CIFAR10.h5.zip

This definitely doesn't seem as good as previous training runs from the readme which cite 4.51 % error.

The text was updated successfully, but these errors were encountered:

titu1994 · 2017-09-06T23:19:51Z

Hmm that is odd. I assumed the model would train correctly this time since the code was almost exactly similar to the one posted by the author. The only thing I can think of that is different is preprocessing and augmentation.

The earlier one was after tons of restarts, and I did not use horizontal flips for it. Also, if you used the unmodified cifar10 script, that means that you did not use the densenet.preprocess_data(...) ? I think the authors have specifically mentioned in one of their comments on the issues of Github that the specific mean standard normalization was necessary along with the scaling.

The "+" mark at the end denotes for standard data augmentation (random crop after zero-padding, and horizontal flip)

They suggest random crops after zero padding and horizontal flips. Keras doesnt have inbuilt support for random crops, so instead I used the augmentation from earlier papers such as rotation, scaling and horizontal flips.

I suggest removing horizontal flipping. I have found that it simply destroys performance, no matter how much I train models with it. Without it, convergence is faster and ofcourse, overfits faster as well, but at least validation loss is higher (though loss of flip invariance is kinda bad).

I hate to ask this since it takes so much time, but could you run it with the current modified version ? And while it may not be best, you could try simply running it with these weights initialized to speed it up and not need 300 additional epochs (it will be very eratic in the beginning, since the earlier normalization was static at 0-1 and now its -2.x to 2.x (this is from (255 - 124) * 0.017). Still, it should not affect too many of the weights at the latter end of the network, so it may be better off.

titu1994 · 2017-09-06T23:23:24Z

By exactly same to the author, I mean https://github.com/liuzhuang13/DenseNet/blob/master/models/densenet.lua#L67

titu1994 · 2017-09-06T23:34:48Z

Oh wait. Disregard the above. Seems you ran it with the updated script. I don't really understand then. Perhaps the additional rotation and scaling are hurting performance ?

ahundt · 2017-09-07T02:28:19Z

Sorry, I meant the older version of the Titan X (there are two). I haven't really investigated but I too found it a bit puzzling. Perhaps additional training would bring it down to the expected level.

titu1994 · 2017-09-07T04:24:57Z

Yeah maybe that will work.

LelouchVC · 2017-09-12T11:39:15Z

I found the same problem with ahundt that just ran 300 epochs using tensorflow 1.3 backend in keras 2.0, and got the same result as same as ahundt's result. And I change the optimization algorithm to SGD with momentum=0.9 and nesterov=True ,and initial learning rate=0.1 and in epoch=150 with learning rate=0.01 and in epoch=225 learning rate=0.001 until the end of 300 epochs, but the result did not get some improvement and be approximately 92% . oh, my GPU's version is Nvidia GTX 1080ti. So i'm so confused because I have modified the hyper parameters again and again and just cannot see any virtual improvement. Thanks for your time!!

ahundt · 2017-09-12T17:44:29Z

Perhaps related to #25?

@LelouchVC do you think you could try that version? My GPUs are currently occupied

Lan1991Xu · 2017-10-18T10:56:34Z

HI, does the problem solved? I also use this code, but not get satisfied results as paper reported. Any idea?

LelouchVC · 2017-10-23T08:15:01Z

I have no idea if the reason is related to #25......I have no time until now, maybe I will try it, hope it useful.
@athundt

LelouchVC · 2017-10-23T08:18:47Z

I consider the result is related to the keras version, I see another code use keras 1.0 API and got a result similar with paper's result.
@Lan1991Xu

Lan1991Xu · 2017-10-23T10:07:13Z

Hi，Thank for your reply. Would you mind give a link for another code for keras1.0. Or, have you try what about this code in keras1.0. I have similar experience, the keras1.0 and keras2.0 will make the different results.

sjf8866 · 2019-10-14T11:50:22Z

I consider the result is related to the keras version, I see another code use keras 1.0 API and got a result similar with paper's result.
@Lan1991Xu

Do you mind sending me a version of keras 1.0? I am very anxious now because of my graduation thesis. My mailbox [email protected]. Thank you

sjf8866 · 2019-10-14T11:54:37Z

I consider the result is related to the keras version, I see another code use keras 1.0 API and got a result similar with paper's result.
@Lan1991Xu

Do you mind sending me a version of keras 1.0? I am very anxious now because of my graduation thesis. My mailbox [email protected]. Thank you

titu1994 closed this as completed Jul 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cifar10 results #22

cifar10 results #22

ahundt commented Sep 6, 2017 •

edited

Loading

titu1994 commented Sep 6, 2017 •

edited

Loading

titu1994 commented Sep 6, 2017 •

edited

Loading

titu1994 commented Sep 6, 2017

ahundt commented Sep 7, 2017 •

edited

Loading

titu1994 commented Sep 7, 2017

LelouchVC commented Sep 12, 2017

ahundt commented Sep 12, 2017

Lan1991Xu commented Oct 18, 2017

LelouchVC commented Oct 23, 2017

LelouchVC commented Oct 23, 2017

Lan1991Xu commented Oct 23, 2017

sjf8866 commented Oct 14, 2019

sjf8866 commented Oct 14, 2019

cifar10 results #22

cifar10 results #22

Comments

ahundt commented Sep 6, 2017 • edited Loading

titu1994 commented Sep 6, 2017 • edited Loading

titu1994 commented Sep 6, 2017 • edited Loading

titu1994 commented Sep 6, 2017

ahundt commented Sep 7, 2017 • edited Loading

titu1994 commented Sep 7, 2017

LelouchVC commented Sep 12, 2017

ahundt commented Sep 12, 2017

Lan1991Xu commented Oct 18, 2017

LelouchVC commented Oct 23, 2017

LelouchVC commented Oct 23, 2017

Lan1991Xu commented Oct 23, 2017

sjf8866 commented Oct 14, 2019

sjf8866 commented Oct 14, 2019

ahundt commented Sep 6, 2017 •

edited

Loading

titu1994 commented Sep 6, 2017 •

edited

Loading

titu1994 commented Sep 6, 2017 •

edited

Loading

ahundt commented Sep 7, 2017 •

edited

Loading