#mlpack on 2017-08-18 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

01:15 sumedhghaisas has joined #mlpack

01:48 kris1 has quit [Quit: kris1]

01:59 govg has quit [Ping timeout: 240 seconds]

04:58 govg has joined #mlpack

05:02 < ironstark> rcurtin: zoq: Now that R implementations have been added I would like to start working on the webpage and the charts. I see that some of it is already implemented and can be seen at http://mlpack.org/benchmarks.html I just wanted to ask from where to begin for preparing this webpage?

06:00 < lozhnikov> kris1: the shape of the output of the second convolutional layer doesn't correspond to the shape of the input of the third convolutional layer

06:00 < lozhnikov> try the following:

06:00 < lozhnikov> generator.Add<Convolution<>>(gInputDim / 2, gInputDim / 4, 3, 3, 2, 2, 15, 15, 28, 28);

06:00 < lozhnikov> i.e. replace 14 by 15

06:01 < lozhnikov> Actually, these arguments look strange since the padding size is greater than the filter size

06:02 < lozhnikov> are you sure that the stride size is correct?

06:09 < lozhnikov> I looked through the oreilly example. You forgot that they use each time tf.image.resize_images(g2, [56, 56])

06:11 < lozhnikov> So, in that example the size of the output of each convolutional layer is equal to 28x28

06:12 < lozhnikov> and tf.image.resize_images() restores the original size (56*56)

07:41 kris1_ has joined #mlpack

08:16 sumedhghaisas has quit [Quit: Ex-Chat]

08:16 sumedhghaisas_ has joined #mlpack

08:19 sumedhghaisas_ has quit [Client Quit]

08:19 sumedhghaisas__ has joined #mlpack

08:28 bvr has joined #mlpack

08:28 bvr has quit [Client Quit]

08:29 bvr has joined #mlpack

08:54 lozhnikov has quit [Excess Flood]

08:54 lozhnikov has joined #mlpack

10:10 < zoq> ironstark: I like the idea from the proposal; using e.g. Chartist.js to create a more interactive data view. I guess it would be a good idea to start with a demo that shows all the functionality you like to intergarte into the view. Currently the data is stored in a mysql database, we can provide a database dump that you could use. Let us know what you think.

10:13 kris1_ has quit [Quit: kris1_]

10:38 kris1 has joined #mlpack

10:45 sumedhghaisas__ has quit [Ping timeout: 240 seconds]

13:17 kris1 has quit [Quit: kris1]

14:01 kris1 has joined #mlpack

14:16 < kris1> Lozhnikov: I was able to get the results for this test https://github.com/bstriner/keras-adversarial/blob/master/examples/example_gan.py using keras ....

14:16 < kris1> Here are some of the output images....

14:16 < kris1> http://imgur.com/a/IeTn2

14:17 govg has quit [Ping timeout: 240 seconds]

14:17 < kris1> I have implmented the networki in mlpack i will test it once i get the Gan working on the digits dataset.

14:17 < lozhnikov> which dataset did you use?

14:21 < kris1> I used the full mnist dataset.

14:21 < kris1> I had pm you regarding the error i am facing with the pach that you sen’t did you get the messages.

14:22 < lozhnikov> hmm.. but your results don't look like digits. in that case there is no need to implement this test. it doesn't work at all

14:22 < lozhnikov> I replied. Did you receive my messages?

14:22 < kris1> No actually not.

14:23 < lozhnikov> diff --git a/src/mlpack/methods/ann/gan_impl.hpp b/src/mlpack/methods/ann/gan_impl.hpp

14:23 < lozhnikov> index 0b787443c..97f945979 100644

14:23 < lozhnikov> --- a/src/mlpack/methods/ann/gan_impl.hpp

14:23 < lozhnikov> +++ b/src/mlpack/methods/ann/gan_impl.hpp

14:23 < lozhnikov> @@ -136,7 +136,7 @@ double GAN<Model, InitializationRuleType, Noise>::Evaluate(

14:23 lozhnikov has quit [Excess Flood]

14:23 lozhnikov has joined #mlpack

14:23 < lozhnikov> oh, I hit the wrong button

14:23 < lozhnikov> diff --git a/src/mlpack/methods/ann/gan_impl.hpp b/src/mlpack/methods/ann/gan_impl.hpp

14:23 < lozhnikov> index 0b787443c..97f945979 100644

14:23 < lozhnikov> --- a/src/mlpack/methods/ann/gan_impl.hpp

14:23 < lozhnikov> +++ b/src/mlpack/methods/ann/gan_impl.hpp

14:23 < lozhnikov> @@ -136,7 +136,7 @@ double GAN<Model, InitializationRuleType, Noise>::Evaluate(

14:23 lozhnikov has quit [Excess Flood]

14:24 lozhnikov has joined #mlpack

14:24 < lozhnikov> https://www.irccloud.com/pastebin/UhpJTpUF/patch.patch

14:24 < lozhnikov> https://www.irccloud.com/pastebin/QEMVcIFq/gan.cpp

14:24 < kris1> Ahh i did reply to that here are my replies ....

14:24 < kris1> That does not fix it for me….. std::normal_distribution(); required that we give randGen object

14:24 < kris1> My program dosen’t even compile btw. I mean the random number engine object. Also for the dataset i am using digits_train.arm file .Are you using the same. With randGen the problem persists.

14:25 < lozhnikov> I hit the wrong button. 2 last messages contain the fix

14:26 < lozhnikov> I've sent you the program. Look at gan.cpp

14:27 < kris1> Yup… i used that program only…. my question is in your fix you said that the remove randGen parmeter to the noiseFunction right.

14:27 conrad_ has joined #mlpack

14:28 < lozhnikov> yes, randGen is not needed

14:29 < kris1> Hmm okay i will try once again. Just give me 15 min.

14:30 < lozhnikov> I'll repeat since there were a lot of messages.

14:30 < lozhnikov> Your results (http://imgur.com/a/IeTn2) don't look like digits. in that case there is no need to implement this test. it doesn't work at all

14:31 < kris1> How would std::uniform_real_distribution be able to create random number with the random engine.

14:32 < lozhnikov> I don't use std::uniform_real_distribution

14:32 < kris1> Also for the link i sent there are 3 images i think you only looked at the starting image.

14:32 < kris1> The starting image is of epoch 0.

14:33 < lozhnikov> oh, I saw only the first one

14:33 < kris1> Line 145 here https://www.irccloud.com/pastebin/QEMVcIFq/gan.cpp

14:34 < lozhnikov> I sent you the wrong file

14:34 < lozhnikov> https://www.irccloud.com/pastebin/NVCdnzpu/gan.cpp

14:38 < lozhnikov> as for me the oreilly example shows better results than your test since the oreilly example generates different digits

14:40 < kris1> Also i was just testing the ssRBM implmentation with 30 epoch hidden size 80 accuraccy ~79% the time is around 4 min all the three tests. So i don’t think that time is issue.

14:41 < lozhnikov> https://travis-ci.org/mlpack/mlpack/jobs/263580562

14:42 < lozhnikov> Travis tells that RbmNetworkTest wastes 1276.57 sec

14:45 < lozhnikov> kris1: what about the oreilly example?

14:46 < kris1> Test project /Users/kris/Desktop/GsoC2k17/mlpack/build

14:46 < kris1> Start 1: RbmNetworkTest

14:46 < kris1> 1/1 Test #1: RbmNetworkTest ................... Passed 105.38 sec

14:46 < kris1> 100% tests passed, 0 tests failed out of 1

14:46 < kris1> Total Test time (real) = 105.41 sec

14:46 < kris1> this is on my local machine with epoch 25 and hidden size 100.

14:47 < kris1> I am still trying to get their architechture. My present implmentation dosen’t work.

14:48 < kris1> I saw you comments on convolution problem in the evening only i will work on it after dinner i guess.

14:49 < lozhnikov> hmm... I sent them in the morning, our timezones don't differ a lot

14:49 conrad_ has left #mlpack []

14:50 < kris1> I only saw once i checked the logs……

14:50 < lozhnikov> okay, I'll repeat.

14:50 < lozhnikov> kris1: the shape of the output of the second convolutional layer doesn't correspond to the shape of the input of the third convolutional layer

14:50 < lozhnikov> try the following:

14:50 < lozhnikov> generator.Add<Convolution<>>(gInputDim / 2, gInputDim / 4, 3, 3, 2, 2, 15, 15, 28, 28);

14:50 < lozhnikov> i.e. replace 14 by 15

14:50 < lozhnikov> Actually, these arguments look strange since the padding size is greater than the filter size

14:50 < lozhnikov> are you sure that the stride size is correct?

14:50 < lozhnikov> I looked through the oreilly example. You forgot that they use each time tf.image.resize_images(g2, [56, 56])

14:50 < lozhnikov> So, in that example the size of the output of each convolutional layer is equal to 28x28

14:50 < lozhnikov> and tf.image.resize_images() restores the original size (56*56)

14:53 < kris1> I do agree with the comments just one thing how do you resize the images from 28*28 to 56*56 are you padding them with zeros?

14:53 < kris1> tf resize function implmentation once.

14:54 < lozhnikov> we have to insert a layer that resizes images

14:55 < lozhnikov> https://www.tensorflow.org/api_docs/python/tf/image/resize_images

14:55 < lozhnikov> https://en.wikipedia.org/wiki/Bilinear_interpolation

14:55 < kris1> Hmmm okay i think i would implment such a layer in that case. But i am not sure how would backpropogate through such a layer.

14:56 < lozhnikov> that shouldn't be difficult, the layer even hasn't got weights

14:56 < kris1> Okay got your point. There is something called transposed convolution i think that is better the DCGAN paper uses that. I was trying to implment that.

14:57 < kris1> But i have to understand that fully yet.

14:57 < kris1> https://datascience.stackexchange.com/questions/6107/what-are-deconvolutional-layers

14:59 < lozhnikov> we didn't test that yet. but we tested the oreilly example a lot. so, I think it is better to implement the oreilly example first

15:01 < kris1> Okay…. but i think it is better if we do fractional convlution since both are doing interpolation. One is doing it in a backpropgable way.

15:07 < kris1> I will implemnt the bilinear interpolation layer by tonight and then create a diffrent PR.

15:08 < lozhnikov> sounds good

15:11 < zoq> kris1: It might be helpful to take a look at the Glimpse layer in particular ReSampling.

15:13 < kris1> Okay thanks i will have a look at it. I have update the ssRBM PR ctest on my local machine takes 105.38s for all the 3 test with ssRBM accuarcy being reduced to 78.2%

15:19 govg has joined #mlpack

15:21 < zoq> kris1: Okay, let's wait for the travis timings.

15:31 kris1 has quit [Quit: kris1]

15:32 < rcurtin> ironstark: great, if you want to work on the webpage side, here are a couple ideas:

15:32 < rcurtin> - take a look at the PR I just opened for the sweep view and review it: https://github.com/mlpack/benchmarks/pull/106 -- comment if you find anything wrong or anything that could be improved :)

15:32 < rcurtin> - run some small benchmark jobs on Jenkins (I'd suggest using a small configuration) in order to thoroughly test some methods

15:33 < rcurtin> - like Marcus suggested maybe chartist.js might be worth looking into

15:34 < rcurtin> one of the things I have been trying to focus my time on is assembling a set of a few focused benchmarks of mlpack, as opposed to a lot of benchmarks that are harder to interpret

15:35 < rcurtin> so another idea might be to spend some time working on a configuration for one specific method, making sure that a good collection of datasets is used and that the benchmarks run successfully for each method

16:00 < rcurtin> ironstark: also if you can fix the issues with PR #101, I think we can merge it after that

16:04 kris1 has joined #mlpack

16:30 < kris1> lozhnikov: How do you use irccloud on your laptop( I am assuming this). I use colloquy and it basically is pretty bad so was thinking of switching...

16:36 < lozhnikov> kris1: it doesn't depend on your PC type. I mean it doesn't matter which PC you are using (laptop or desktop). irccloud works via a browser

16:40 kris_ has joined #mlpack

16:48 < lozhnikov> kris1: actually, that's a paid service rather than an app. If you are looking for free alternatives and you've got a static IP at home you could set up a bouncer instead. Actually, you needn't even a static IP, you could use dynamic DNS

16:53 < kris1> Ahhh thanks i will have a look….

17:12 kris__ has joined #mlpack

18:15 < kris1> zoq: travis failed again.

18:16 < kris1> I do not understand why on debug on option it is failing…

18:16 < kris1> Also the 1st travis ci machine passes the test in 1hr 6 min but the second test fails at 1hr 4 min ??

18:24 < lozhnikov> kris1: you forgot about the time required for compilation

18:28 < lozhnikov> The release build of RbmNetworkTest wastes 717.55 sec. The debug build should waste more time

18:29 < lozhnikov> for example RecurrentNetworkTest takes 145.21 sec in release mode and 384.00 sec in debug mode

18:31 < kris1> Ok…. still i don’t get how does the second build stop at the 1hr 4 min it should have a higher build time if debug mode is on.

18:32 < lozhnikov> the test reached the time limit

18:49 < zoq> Each travis build has a specific time limit, and travis will stop the build if it takes longer than x minutes, in our case the build will timeout after 70 minutes.

19:07 < kris1> Okay i will try to reduce the epoch size 20 and check…

19:29 kris__ has quit [Quit: Connection closed for inactivity]

19:29 kris_ has quit [Quit: Connection closed for inactivity]

20:14 < kris1> For bilinear interpolation or any resize image how would the use define the backward pass.

20:14 < kris1> ie the gradients coming in would be 56 * 56 and outgoing would be 28 * 28.

20:14 sumedhghaisas__ has joined #mlpack

20:20 sumedhghaisas__ has quit [Ping timeout: 240 seconds]

20:22 < ironstark> zoq: rcurtin: Thanks for all the ideas. I'll look into it.

20:23 < ironstark> I tried moving the dlibml make file to methods/dlibml/src/build_scripts.sh

20:23 < ironstark> but how to use LIBPATH and INCLUDEPATH there>

20:24 < ironstark> I am getting the following error:

20:24 < ironstark> ./build_scripts.sh: 1: ./build_scripts.sh: shell: not found

20:24 < ironstark> ./build_scripts.sh: 1: export: :: bad variable name

20:29 < ironstark> doing a pwd in build_scripts would return path till src

20:29 < ironstark> how to get path till only home

20:47 < zoq> kris1: Depending on what the user specified at the layer construction, either new image size or resize factor you should be able to reconstruct the necessary parameters for the Forward and Backward pass, in your case all you need to know is the resize value is 2?

20:47 < zoq> ironstart: You could pass the path information, something like: https://github.com/mlpack/jenkins-conf/blob/master/linter/lint.sh what do you think? Does this solve the home path problem?

20:48 < zoq> ironstart: About the shell not found error maybe the header isn't right? maybe you can push the script.

20:51 < kris1> zoq: are you saying the how we upsample the image. We would downsample the gradients when going backward

20:54 < zoq> In this case, yes you can use the parameter from the Forward pass.

20:55 < kris1> Hmmm, yes when you say parameter what do you mean exactly…. there are no learnable parameters for the resize layer as such..........

20:56 < zoq> I mean the scaling factor or new image size (width/height).

20:57 < kris1> I have a have baked version here have a look …..https://gist.github.com/kris-singh/f0f510a7182949a2c12abbd64937e820

21:02 < kris1> I am following this for refrence slide 62 https://ia802707.us.archive.org/23/items/Lectures_on_Image_Processing/EECE_4353_15_Resampling.pdf

21:04 < kris1> Does it look okay to you??

21:05 < ironstark> zoq: Sure I'll do it after doing the required changes in #104 and #105

21:11 < zoq> kris1: The Forward function looks good, but I would write a simple test case to make sure the math is correct and we both haven't missed something.

21:12 sumedhghaisas__ has joined #mlpack

21:12 < kris1> Hmmm so the backward function would in that case be just the forward function with output and input reversed, right?

21:16 < zoq> kris1: multiplied with the error, you can bascially just use the ReSampling and DownwardReSampling function from the glimpse layer.

21:17 < kris1> But i gy would be one dim vector so i would have to resize it first and then make it go through the forward pass….

21:17 sumedhghaisas__ has quit [Ping timeout: 240 seconds]

21:19 < zoq> yes, or rewrite the function so that it can handle a 1 dim vector :)

21:21 < kris1> Ahhh okay….

21:26 < zoq> krs1: If you write a test for the interpolation layer (compare against e.g. a manually calculated input/output pair), I can help/write the backward pass for you.

21:31 < kris1> Not fully sure how to do test this in a neural network setting though. I can test it out as indivisual layer but testing out as part of nn is tricky i guess.

21:32 < zoq> testing it independently is just fine

21:33 < kris1> Ahhh okay then i think writing a test would be simple. I will write it out after i finish the implmentation.

21:49 kris1 has quit [Quit: kris1]

21:50 kris1 has joined #mlpack

22:16 < kris1> lozhnikov: I still can’t recreate your results with the digits dataset. The trainloss does not go down for me. Even though i used the patch as you had suggested.

22:24 < kris1> My generated results are totally random.

23:51 < kris1> zoq: I was able to implement the ResizeLayer. Have a look here https://github.com/kris-singh/mlpack/tree/ResizeLayer

23:51 < kris1> But i am getting weird undefined symbol error for Reset Function in the linear layer

23:59 < kris1> Also should i do this as a seprate PR or should i just include this in the Gan PR.