You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The paper mentions that an ensemble model achieved the state of the art results however there is no mention of how the seperate models were trained
#33
Open
Chhokra opened this issue
Mar 26, 2020
· 2 comments
The readme only mentions of a training one single model if I'm not wrong. How to go about training 4 models as mentioned by the results table of your paper?
The text was updated successfully, but these errors were encountered:
Chhokra
changed the title
The paper mentions that an ensemble model achieved the state of the art results however there is no mention of how the seperate models were traiend
The paper mentions that an ensemble model achieved the state of the art results however there is no mention of how the seperate models were trained
Mar 26, 2020
I was also looking into this and came to the conclusion that they likely just used 4 different random initializations as was done in (Chollampatt, Ng, 2018), a paper they reference.
Thanks @kevbp5. We did use 4 different random initializations for the models without DA.
For the models with DA, we also used pre-trained checkpoints from different pre-training stages.
The readme only mentions of a training one single model if I'm not wrong. How to go about training 4 models as mentioned by the results table of your paper?
The text was updated successfully, but these errors were encountered: