Skip to content
This repository has been archived by the owner on Feb 11, 2019. It is now read-only.

why beta=0.1 zeta=0.01? #8

Open
iron9light opened this issue Aug 30, 2018 · 0 comments
Open

why beta=0.1 zeta=0.01? #8

iron9light opened this issue Aug 30, 2018 · 0 comments

Comments

@iron9light
Copy link

I use this recommend params for my own model, the mean of noise scale is keeping about 1e-9 which is too small.

My model this something like below:

layer1 = multi_layer_complex_mess(input)  # layer1 shape: [batch_size, 512]
layer1_bn = batch_norm(layer1)
layer1_act = relu(layer1_bn)
layer2 = dense_with_sdr(layer1_act, unit=1024)
logit = dense(layer2, unit=3)

My origin modal is:

layer1 = multi_layer_complex_mess(input)  # layer1 shape: [batch_size, 512]
layer1_bn = batch_norm(layer1)
layer1_noise = layer1_bn + gaussian_noise(mean=0, stddev=noise_scale)
layer1_act = relu(layer1_noise)
layer2 = dense(layer1_act, unit=1024)
logit = dense(layer2, unit=3)

For the origin model, I tried noise_scale=0.5 and 1.0.
1.0 is better.

If beta and, especially zeta, need to be tune, then there's no much advantage against fix scale gaussian noise.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant