#mlpack on 2017-08-26 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:36 kris1 has quit [Quit: kris1]

01:28 sumedhghaisas has joined #mlpack

01:29 sumedhghaisas has quit [Read error: Connection reset by peer]

03:02 kris__ has quit [Quit: Connection closed for inactivity]

04:02 travis-ci has joined #mlpack

04:02 < travis-ci> mlpack/mlpack#3244 (mlpack-2.2.x - 4f6f2e5 : Ryan Curtin): The build has errored.

04:02 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/ae8e35bb1030...4f6f2e597c32

04:02 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/268594802

04:02 travis-ci has left #mlpack []

06:26 sumedhghaisas has joined #mlpack

06:28 sumedhghaisas has quit [Read error: Connection reset by peer]

06:29 sumedhghaisas has joined #mlpack

06:34 sumedhghaisas has quit [Ping timeout: 246 seconds]

06:37 sumedhghaisas has joined #mlpack

07:59 kris1 has joined #mlpack

08:02 kris__ has joined #mlpack

11:15 vivekp has quit [Ping timeout: 248 seconds]

11:16 vivekp has joined #mlpack

11:23 vivekp has quit [Ping timeout: 255 seconds]

11:25 vivekp has joined #mlpack

11:30 < kris__> Hi lozhnikov

11:30 < kris__> I was able to fix the orilley test.

11:30 < kris__> Look here https://gist.github.com/kris-singh/4b355418edd9c69ede11c4af18086438

11:31 < kris__> The problem seems to be it is very slow.

11:31 < kris__> So i don't think i would be able to get the optimal arguments for the test in time.

11:32 < kris__> -i train7.txt -o output.txt -e 20 -m 2000 -x 3 -N 100 -r 0.003 -v you can try it with this.

11:33 < kris__> Other than that i was able to fix the ssRBM test last night ie find parameters that give good accuracy as well as the time is less. I updated the PR for the same.

11:33 < kris__> I also added the test for GAN as we discussed using Pre-Trained gan. for the gaussian data set

11:39 < kris__> It would be great if you could give a final review for the GAN PR. I would like to fix the errors and then go for writing the Final Blog. Meanwhile test for orilley example is running on system.

11:45 < lozhnikov> kris__: I suggested other parameters at github (https://github.com/mlpack/mlpack/pull/1046#issuecomment-325116460).

11:45 < lozhnikov> I'll look through the GAN PR today

11:49 < kris__> Great....on my system the orilley example takes around 15 min for one batch.

11:49 sumedhghaisas has quit [Read error: Connection reset by peer]

11:49 sumedhghaisas has joined #mlpack

11:49 < lozhnikov> I'll start the test soon

12:13 sumedhghaisas has quit [Ping timeout: 246 seconds]

12:40 vivekp has quit [Ping timeout: 248 seconds]

12:44 vivekp has joined #mlpack

13:00 miagar has joined #mlpack

13:01 miagar has quit [Client Quit]

13:03 sumedhghaisas has joined #mlpack

13:03 maigar has joined #mlpack

13:07 sumedhghaisas has quit [Read error: Connection reset by peer]

13:10 < lozhnikov> kris__: Looks like you did an error somewhere. I got the following output

13:10 < lozhnikov> [INFO ] gradientDiscriminator = 0.000000e+00

13:10 < lozhnikov> [INFO ] gradientGenerator = 0.000000e+00

13:24 sumedhghaisas has joined #mlpack

13:28 < kris__> Hmmm yes the first gradient is zero in my case also ...

13:29 < kris__> but after that it starts "converging"

13:29 < kris__> here is screen grab from my training...... https://usercontent.irccloud-cdn.com/file/FNxujNyS/Screen%20Shot%202017-08-26%20at%206.58.28%20PM.png

13:30 < kris__> https://usercontent.irccloud-cdn.com/file/ORCMVmqA/Screen%20Shot%202017-08-26%20at%206.58.15%20PM.png

13:30 < lozhnikov> 5e-290 is too small

13:31 < kris1> Well it actually alternates to 0.39 also at one training iteration it was 2000

13:32 < kris__> Also i did this on training size of 200. I will try for training size = 2000.

13:42 < lozhnikov> kris__ : I can't reproduce. I get zeros each time. I tried to reduce the size of the dataset and I got the same result. I guess there is an error in the layer structure

13:43 maigar has quit [Ping timeout: 260 seconds]

13:43 < kris__> Well i actually tested both the discriminator and generator seprately also. i will send the code. Try running that and see if that is working for you.

13:55 < lozhnikov> kris__: I found the issue. The discriminator network shouldn't contain SigmoidLayer. But without that the Evaluate() function returns NAN. I wrote about that yesterday

13:56 < kris__> The discriminator by itself trains fine.........

13:56 < kris__> https://gist.github.com/kris-singh/449adb1e6d503e9d21f0c17122c8c515

13:56 < kris__> The output is in the comments....

13:57 < kris__> Yes i found that not having a sigmoid layer was causing nan's that's why i added them....

13:57 < lozhnikov> again, the discriminator shouldn't contain the sigmoid layer. Look at the oreilly example

13:58 < lozhnikov> and that's why the gradients are too small

13:59 < kris__> generator network alone also trains fine https://gist.github.com/kris-singh/55f84f603aa1e84555a8f0ab1812a34d

14:00 < kris__> Yes but without the sigmoid, i get nan's in the output.

14:03 < lozhnikov> probably, changing the network structure is not a good idea. As for me, it is better to figure out why the Evaluate() function returns NAN

14:04 < kris__> sigmoid_cross_entropy_with_logits operates on unscaled values rather than probability values from 0 to 1. Take a look at the last line of our discriminator: there's no softmax or sigmoid layer at the end. GANs can fail if their discriminators "saturate," or become confident enough to return exactly 0 when they're given a generated image; that leaves the discriminator without a useful gradient to descend.

14:04 < kris__> This is from the tutorial....

14:04 < kris__> i mean the orilley example..

14:05 < kris__> I don't if the cross entropy that is implemented mlpack works the same i will take a look.

14:06 < kris__> Okay so softmax is done at loss function level in the case of orilley example and we do it at the architecture level.

14:07 < kris__> Check here https://www.tensorflow.org/api_docs/python/tf/nn/sigmoid_cross_entropy_with_logits

14:10 < lozhnikov> hmm, they added a workaround in order to avoid overflow. So, this implementation differs from our implementation

14:13 < kris__> Were could you point me to it.

14:13 < kris__> is reduce_mean function.

14:13 < lozhnikov> https://www.tensorflow.org/api_docs/python/tf/nn/sigmoid_cross_entropy_with_logits

14:14 < lozhnikov> sigmoid_cross_entropy_with_logits differs from SoftMax+CrossEntropy

14:19 < kris__> Well softmax for one class is the same as sigmoid. I do agree about the overflow part though.

14:19 < lozhnikov> I think that could be the reason of small gradients

14:20 < kris__> I don't understand the eps part in the cross entropy implementation of mlpack.

14:21 < lozhnikov> I think epsilon is added in order to avoid NANs

14:22 < kris__> I can edit the cross entropy layer and make it take care of overflow but i am not sure about the back prop though.

14:23 < kris__> Ahhh no the backprop is easy also.

14:26 < lozhnikov> no, I don't think so since the overflow happens in exp(-x) i.e. in the sigmoid layer

14:27 < kris__> The logistic function is implemented to avoid overflows like that.

14:29 < kris__> line 41-47 in logistic_function.hpp

14:29 < lozhnikov> actually, no

14:29 < lozhnikov> if (x < arma::Datum<eT>::log_max)

14:29 < lozhnikov> {

14:29 < lozhnikov> if (x > -arma::Datum<eT>::log_max)

14:29 < lozhnikov> return 1.0 / (1.0 + std::exp(-x));

14:29 < lozhnikov> return 0.0;

14:29 < lozhnikov> }

14:29 < lozhnikov> return 1.0;

14:30 < lozhnikov> it handles overflows differently

14:31 < lozhnikov> the present implementation just rejects some values

14:32 < kris__> Hmmm should i just implement the sigmoid_cross_entropy_with_logits

14:33 < lozhnikov> looks reasonable to me, it shouldn't take a lot of time

14:34 < kris__> okay, i will do that straight away. I will use the same branch though other wise i have to switch and rebuilding takes around 20-25 minutes. Is that okay.

14:37 < lozhnikov> okay. On the other hand you can clone the repo into a separate directory in order to avoid rebuilding

14:55 < kris__> Just one question. In this equation max(x, 0) - x * z + log(1 + exp(-abs(x))) x's are scalar right.

14:56 < kris__> So in the case of arma:: mat we have do every operation per element basis.

15:06 < lozhnikov> sure

15:31 sumedhghaisas has quit [Read error: Connection reset by peer]

16:09 kris1 has quit [Quit: kris1]

16:16 kris1 has joined #mlpack

16:58 < kris__> lozhnikov: should arma::accu or arma::mean when doing forward pass for loss function.

17:00 < lozhnikov> kris__: yeah, I think you should apply arma::accu

17:01 < kris__> most of the examples i saw were actually using tf.reduce_mean

18:24 manjuransari has joined #mlpack

18:29 manjuransari has quit [Quit: Page closed]

19:37 < kris__> lozhnikov: I have implemented the layer

19:37 < kris__> but some test are failing.

19:41 < kris__> like the i label is 0 and input is 0.5 the output should be 0.29...

19:44 < kris__> but the tf output is 0.97407699

19:46 < kris__> okay i figured it out.

20:14 mikeling has quit [Quit: Connection closed for inactivity]

21:01 < lozhnikov> kris__: Great! Could you share the code? I'll look through that tomorrow

21:02 < kris__> Sure i will create a new PR for it. I think that's better.

21:03 < lozhnikov> could you cherry-pick commit "Fix depth for bilinear function" (07972dd26e362f442b3a2a5b746a098ccee220fd) to the ResizeLayer branch?

21:05 < kris__> I don't understand cherry pick for what. In which PR are you talking about.

21:05 < lozhnikov> you changed the ResizeLayer implementation inside the GAN branch

21:06 < kris__> Okay you want me merge this commit "Fix depth for bilinear function" to the Resize Layer.

21:06 < lozhnikov> yeah

21:07 < kris__> Okay i will do that.

21:07 < lozhnikov> ok, thanks

21:08 < kris__> I just wanted to ask can we merge RBM and ssRBM after i add the parameters that you sent in the patch.

21:09 < lozhnikov> I have to look through the whole PR again

21:10 < kris__> Sure.... would be good if we could merge something before Tuesday.

21:10 < lozhnikov> And I think we should ask Marcus. Maybe he wants to add something

21:31 kris1 has quit [Quit: kris1]

21:34 kris1 has joined #mlpack

22:10 < kris__> cstdin not found with mlpack 5.5

22:10 < kris__> Using clang any help

22:12 < kris__> I found that using -stdlib=libc++ this works

22:12 < kris__> i see that this has already done in the cmake file but it dose not work for me

22:17 < kris__> i am using clang btw...