You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello authors, I have ran training on a single GPU and tested both the original batch size and an increased batch size (2X since given script runs on two GPUs). In all these cases, the training script does not yield a model with comparable accuracy as those reported in the paper.
I wonder how these detailed results compare to yours, and moreover, what other items are necessary to reproduce the DeiT-LT results? I am happy to provide more details as well. Thanks!
The text was updated successfully, but these errors were encountered:
Hello authors, I have ran training on a single GPU and tested both the original batch size and an increased batch size (2X since given script runs on two GPUs). In all these cases, the training script does not yield a model with comparable accuracy as those reported in the paper.
The results are the following:
I wonder how these detailed results compare to yours, and moreover, what other items are necessary to reproduce the DeiT-LT results? I am happy to provide more details as well. Thanks!
The text was updated successfully, but these errors were encountered: