Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to train resnet_18_adf #2

Open
SaumilShah66 opened this issue May 7, 2020 · 1 comment
Open

Unable to train resnet_18_adf #2

SaumilShah66 opened this issue May 7, 2020 · 1 comment

Comments

@SaumilShah66
Copy link

Hey,
I am trying to train resnet18_adf network from the beginning, but it doesn't train. After several epochs, my loss and accuracy don't change and the output shows "nan" values. I guess this is happening because my variances overshoot and they start showing "inf".

@mattiasegu
Copy link
Contributor

Hi @SaumilShah66

Thanks for the interest in our work!

Concerning the NaNs for loss when training ADF, it is a known problem that training with the heteroscedastic loss may be difficult because of numerical instability problems. We also mentioned in our paper that it wasn't possible to train the heteroscedastic neural network from Kendall et al. because of numerical instability enhanced by the SoftMax layer. To address this problem when trainining the ADF network with the heteroscedastic loss, we initialized the network weights from the best pretrained ckpt on Resnet-18 with and without dropout. You can try it yourself, no modification to the code are needed, you only need to load one of the two available ckpts trained without ADF.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants