-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixed same weights bug when loading SSL models from Vissl #178
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #178 +/- ##
==========================================
+ Coverage 76.23% 76.30% +0.06%
==========================================
Files 40 40
Lines 2058 2055 -3
Branches 262 262
==========================================
- Hits 1569 1568 -1
+ Misses 404 402 -2
Partials 85 85
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Hi Jonas, I can imagine this was an incredibly annoying bug to find 🤣 Do I understand it correctly that, for example, if you loaded Okay, I've taken a closer look at all the filenames now and can more clearly see what the issue was and how the fix amends it. It would have been a bit clearer to specify more clearly the addition of I can see this was raised in the main repo about 5 years ago, yikes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
For
simclr-rn50
,swav-rn50
andpirl-rn50
, the model weights ended up being the first one that were loaded becausetorch.hub.load_state_dict_from_url
cached the state dicts based on the file_name of the url (last part) although the first part of the url was completely different. With thefile_name
argument, we specify now a unique name based on the model name.