#mlpack on 2017-08-27 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:42 < kris__> lozhnikov: Do you have the script that you used for generating the images and from the rbm with deep learning.net example. Also the cpp file of the test. I seem to have it missing.

02:48 kris1 has quit [Quit: kris1]

03:15 kris1 has joined #mlpack

04:56 kris1 has quit [Quit: kris1]

06:25 < lozhnikov> kris__: Try the following file. https://www.irccloud.com/pastebin/yKmWeiQu/main.cpp

06:25 kris1 has joined #mlpack

06:26 < lozhnikov> kris__: I wrote that for the ssRBM but it isn't difficult to use the binary RBM instead.

06:27 < lozhnikov> Could you describe your build issue more detailed?

06:28 < kris__> The build issue had to do with the cstdin not found since i think pywrapper were introduced that require that library.

06:29 < kris__> I was able to build after removing the line from CmakeLists.txt that attaches -stdc++=libc++ when the apple mac os version is less that 10.9. I remove the if statement and it worked.

06:29 < lozhnikov> which branch are you compiling?

06:30 < kris__> The newest branch. Since then i have reverted the changes to the merge i did with latest version. now i run 2.2.4 everything works fine now.

06:32 < lozhnikov> I haven't got MacOS. So, probably it is difficult to reproduce

06:32 < kris__> Okay no problem. Right now i got working using the easy solution. Will look at a detailed solution later.

06:35 < kris__> Both the resize layer and cross entropy with logit are succesfully build. Travis is faililng on randomForestTest.

06:35 < kris__> So you can go ahead and have a look at them.

06:38 < lozhnikov> I'll look through that today

06:38 < kris__> ok, i will give the orilley example a final try today....

07:16 < kris__> The orilley example is working now check it out here https://gist.github.com/kris-singh/4b355418edd9c69ede11c4af18086438

07:17 < kris__> since, CrossEntropyWithLogits with sigmoid is same as CrossEntropy so i replaced it in the generator network.

07:17 < kris__> ./gan_test2.o -i train7.txt -o output.txt -e 200 -m 200 -x 1 -N 100 -r 0.003 -v

07:18 < kris__> If you can run it with ./gan_test2.o -i train7.txt -o output.txt -e 20 -m 2000 -x 1 -N 100 -r 0.003 -v these parameters that would be great.

07:20 < lozhnikov> okay, I'll run the test soon

08:04 < lozhnikov> kris__: I ran the test with the following arguments: -e 20 -m 2000 -x 300 -N 100 -r 0.0003 -v

08:05 < lozhnikov> I think these parameters correspond to the oreilly example better than your parameters

08:07 < kris__> Sure, but i think these would require a lot time to converge i was going for fining a good set of starting parameters and optimising them.

08:07 < kris__> Though i think e 20 is okay but i think x 300 is pretty high.

08:08 < lozhnikov> I think we haven't got enough time for obtaining new parameters

08:09 < lozhnikov> okay, I'll replace that by 100

08:36 kris1 has quit [Quit: kris1]

08:36 kris1 has joined #mlpack

08:42 kris1 has quit [Quit: kris1]

08:43 kris1 has joined #mlpack

08:46 kris1 has quit [Client Quit]

09:03 kris1 has joined #mlpack

09:04 < lozhnikov> [INFO ] gradientGenerator = 0.000000e+00

09:04 < lozhnikov> [INFO ] gradientDiscriminator = 2.155897e-02

09:04 < lozhnikov> [INFO ] gradientGenerator = 0.000000e+00

09:04 < lozhnikov> [INFO ] gradientDiscriminator = 2.877930e-02

09:04 < lozhnikov> kris__: the gradient of the generator is zero

09:06 < lozhnikov> I tried to replace GaussianInitialization gaussian(0, 1); by GaussianInitialization gaussian(0, 0.02);. Actually, the oreilly example uses 0.02 as the standard deviation

09:06 < lozhnikov> I got the same result

09:06 < lozhnikov> Curiously, SigmoidLayer + CrossEntropyError works fine with it

09:07 < lozhnikov> so, the error that we observed yesterday happened due to incorrect deviation

09:08 kris1 has quit [Quit: kris1]

09:08 < lozhnikov> however, CrossEntropyErrorLogits works with deviation 1.0

09:16 kris1 has joined #mlpack

09:18 kris1 has quit [Client Quit]

09:26 kris1 has joined #mlpack

09:26 < kris__> Just saw the comments...

09:27 < kris__> So does the sigmoid + cross entropy layer work.....

09:31 < lozhnikov> sigmoid + cross entropy shows the same results as CrossEntropyErrorLogits with deviation 0.02

09:31 < lozhnikov> but the gradient of the generator is zero

09:31 < kris__> Okay i will check what wrong now.

09:32 < kris__> btw i have updated the summary of the work. Will you have a look at it when you have time.

09:36 < lozhnikov> actually, that isn't the summary

09:36 < lozhnikov> you should write the final blog post and describe all changes you have done

09:36 < lozhnikov> e.g. http://mlpack.org/gsocblog/profiling-for-parallelization-and-parallel-stochastic-optimization-methods-summary.html

09:37 < lozhnikov> end then you should submit the final evaluation with the link to the final blog post

09:40 < kris__> I have followed the same here https://github.com/mlpack/blog/blob/master/content/blog/KrisSummary.md

09:41 < kris__> I do not know why it doesn't show up on the website though.

09:42 < lozhnikov> try to change "Summary Date" by "Date"

09:46 < lozhnikov> 1. "I am happy to say that in terms of visual reconstruction in both the examples of Mnist digit generation and Gaussian distribution generation we were able to get comparable results with keras and tensorflow."

09:46 < lozhnikov> I think we didn't get good results on the mnist dataset yet.

09:47 < lozhnikov> 2. "Here are the results of our implementation on the digits dataset(smaller version Mnist)."

09:47 < lozhnikov> Actually, this is the mnist dataset

09:48 < kris__> yes mnist 7 i would update it. Okay i will try to run it for more number of epochs and see if we can get better results.

09:49 < lozhnikov> 3. "One of the reasons accuracy of ssRBM is less than ssRBM".

09:49 < lozhnikov> I guess "is less than binary RBM" is correct

09:54 kris1 has quit [Quit: kris1]

09:54 < lozhnikov> 4. "I would also like to say here that Mikhail tried to convert mu-ssRBM code for testing our implementation but it took a lot of time finally."

09:54 < lozhnikov> I finished the implementation of the mu-ssRBM. But it isn't well tested yet, I focused on the GAN PR. That's why I haven't pushed the code yet. So, I think there is no sense to write about that. However, it's up to you.

09:56 < lozhnikov> 5. "We tried ssRBM on the cifar data set code but due to the large volume of data set and scarcity of the computation resources, we decided that it was not really required."

09:56 < lozhnikov> If I remember right, last time you told that you got good accuracy on the CIFAR dataset.

10:03 < zoq> kris__: About the libc++ issue, what OS did you use?

10:03 < zoq> kris__: Also the static site generator is really picky about the metadata header, if you remove the extra lines (between the meta data) the build should pass

10:05 < lozhnikov> kris__: 6. I think it is reasonable to upload the tests to github gists and mention them in the blog post.

10:05 < lozhnikov> Except the comments above the blog post looks good to me

10:36 < lozhnikov> kris__: I looked through the oreilly example again. The fourth ReLU layer is not needed in the generator network

10:38 klitzy has joined #mlpack

10:42 klitzy has quit [Client Quit]

10:51 < kris__> I will have to look at the cifar classification accuracy

10:52 < kris__> I have the code for it

10:52 < kris__> But I Have to look at the results once

10:54 < kris__> Zoq I use Mac OS X 10.12 if I rember right

10:58 < lozhnikov> kris__: without the fourth ReLU layer the gradient of the generator network isn't equal to zero

10:58 < lozhnikov> [INFO ] gradientGenerator = 3.100361e+00

10:58 < lozhnikov> [INFO ] gradientDiscriminator = 1.375701e-04

10:58 < lozhnikov> [INFO ] gradientGenerator = 3.084291e+00

10:58 < lozhnikov> [INFO ] gradientDiscriminator = 1.281130e-04

10:58 < lozhnikov> [INFO ] gradientGenerator = 3.063628e+00

11:04 kris1 has joined #mlpack

11:30 < kris__> yup 10.12.6build version 16G29

11:36 < zoq> kris__: okay, let me update my machine and see if I can reproduce the issue

11:51 < kris__> zoq: can we use l1 or l2 weight regularisers in the ffn.

12:56 < zoq> kris__: Unfortunately it's not implemented yet.

13:21 kris1 has quit [Quit: kris1]

13:34 kris1 has joined #mlpack

15:41 kris1 has quit [Quit: kris1]

16:00 kris1 has joined #mlpack

16:12 < lozhnikov> kris__: I ran the test with the following arguments -e 20 -m 2000 -x 100 -N 100 -r 0.03 -v and got the following results https://usercontent.irccloud-cdn.com/file/fuO7OT6K/mnist-conv.png

16:13 < lozhnikov> something went wrong

16:14 < kris__> Well i don't know. I think orilley example goes through around 6000 epoch's to the desired result.

16:14 < kris__> How did your results converge so fast.

16:15 < kris__> I am running a much smaller epoch but it is being going on since morning.

16:16 < lozhnikov> I think something is incorrect since the gradients differ too much

16:16 < lozhnikov> [INFO ] gradientGenerator = 1.562875e+00

16:16 < lozhnikov> [INFO ] gradientDiscriminator = 8.360856e-06

16:18 < kris__> https://www.irccloud.com/pastebin/O8Q1yWdY/temp

16:19 < lozhnikov> why the objective is so huge?

16:19 < lozhnikov> did you fix the standard deviation?

16:19 < kris__> No i was training on the CrossEntropy with Logits.

16:20 < kris__> since the training was going on since morning i din't kill it.

16:20 < kris__> I will kill it now.

16:20 < lozhnikov> how much time should the test take?

16:23 < kris__> I am not sure. I actually did not run the orilley example since you said that you had run it already.

16:24 < lozhnikov> Mini-batch SGD provides an output each epoch. So, you can estimate the time

16:28 < kris__> Okay i will check and tell you but i am pretty sure that one epoch takes > 25 min on my machine with m = 2000.

16:39 < kris__> lozhnikov I had a question you said that mnist 7 digit reconstruction we got was not good. I think that could be because of the regularisation of weights.

16:41 < lozhnikov> kris__: and what is the question?

16:42 < kris__> No i was just saying that we don't have regularisation of weights in mlpack so i was trying with higher number of epoch to get a images what were smooth.

16:42 < kris__> But i am unable to do that.

16:42 < lozhnikov> maybe you are right. I didn't dig into the keras example

16:47 < lozhnikov> kris__: have you fixed the blog post and have you submitted the final evaluation? I just want to make sure that you haven't got troubles with that

16:48 < kris__> Not yet. I will do that by tonight. I just wanted to wait for the results from the cnn gan but i think that would not be possible i would go ahead.

16:48 < kris__> Should i submit it now. Or wait till tuesday. Because if you submit once you can't change it.

16:49 < lozhnikov> I don't recommend to wait Tuesday

16:51 < lozhnikov> so, I think it is better to complete the evaluation now

16:52 < kris__> Ok i will do that in hour or so then.

17:31 vpal has joined #mlpack

17:33 vpal has quit [Read error: Connection reset by peer]

17:33 vivekp has quit [Ping timeout: 248 seconds]

17:35 vivekp has joined #mlpack

18:00 vivekp has quit [Ping timeout: 260 seconds]

18:05 vivekp has joined #mlpack

18:10 < kris__> zoq: The images are not displayed in the mlpack.org page.

18:10 < kris__> but on github page they are shown do i need to give full path for img src right now i have given a relative path only.

18:13 vivekp has quit [Read error: Connection reset by peer]

18:14 < lozhnikov> try to fix the path i.e. replace "../images/mnist_out.png" by "images/mnist_out.png"

18:15 < lozhnikov> take a look at the fourth blog post https://github.com/mlpack/blog/blob/master/content/blog/KrisWeekFour.md

18:15 vivekp has joined #mlpack

18:22 < kris__> Done...thanks.

18:58 < kris__> lozhnikov: Should i implement the batch norm layer for the cnn test. Also do you have any ideas how we could test it faster.

19:07 < kris__> Also one more thing was batch normalization acts as a regularizer it can be replaced by the DropOut layer. But when i was using the dropout layer the network converged very quickly i mean in 2-3 iterations. This is true for even individual ffn's.

19:23 < kris__> zoq: Why do you say the present implmentation of batchNorma won't work with conv layers.

19:23 < kris__> *batchNorm

19:36 vivekp has quit [Ping timeout: 276 seconds]

19:42 < zoq> kris__: The current implementation does not cover the data dimensions correctly if > 2 (normalization step isn't correct); Sumedh is working on that end.

19:43 < zoq> kris__: Looks like you fixed the image issue?

19:47 < zoq> kris__: I'm not able to reproduce the libc++ issue, Mac OS > 9 (Mavericks) should automatically link against libc++, did you say that the problem encountered with mlpack 2.2.5?

19:48 < kris1> Yes the newest version.

19:49 < zoq> Did you build with -DBUILD_PYTHON_BINDINGS=OFF?

19:50 < kris1> no actually i did not…..

19:51 < zoq> Do you have time to test that?

19:51 < zoq> not sure that's the problem

19:51 < kris1> Well that would fix it i think

19:52 < kris1> I will check in the morning tommrow. I have a early day tommrow.

19:52 < zoq> okay, thanks a lot

19:56 vivekp has joined #mlpack

20:16 vivekp has quit [Ping timeout: 255 seconds]

20:19 vivekp has joined #mlpack

21:29 kris1 has quit [Quit: kris1]

22:09 kris__ has quit [Quit: Connection closed for inactivity]