#mlpack on 2017-08-29 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

02:12 mikeling has joined #mlpack

03:53 < rcurtin> wow, I just found some really cool usage of mlpack right before I went to bed:

03:53 < rcurtin> http://ieeexplore.ieee.org/abstract/document/8001607/

03:53 < rcurtin> the paper is paywalled, but I want to highlight a bit in there:

03:54 < rcurtin> "MLPack [15] was chosen to be used for the NN library. Although MLPack's artifical neural network (ANN) is not as mature as other software libraries, it provided many advantages.

03:54 < rcurtin> First, the build process was relatively simple and the required dependency list was short.

03:54 < rcurtin> Second, its API is well-documented both with function definitions in Doxygen and code usage examples.

03:55 < rcurtin> Third, other machine learning algorithms in MLPack have been used by our cognitive communications colleagues at NASA GRC, so using a common library would ease incorporation of our cognitive engine with their activities.

03:56 < rcurtin> Additionally, MLPack does not support the Levenberg-Marquardt backpropagation algorithm for training. The authors wrote their own implementation of the algorithm and verified its performance using MATLAB's "trainlm" function in its Neural Network Toolbox."

03:56 < rcurtin> I think this is really exciting, I think I will send the authors an email to find out if our code has been used in space :)

03:57 < rcurtin> (this is a personal life goal for me, to do something that ends up in space... I guess code counts, sort of !)

03:59 < rcurtin> (also, still lots of confusion about the capitalization of mlpack... not sure how to fix that...)

04:46 vivekp has quit [Ping timeout: 248 seconds]

04:48 vivekp has joined #mlpack

04:54 vivekp has quit [Ping timeout: 240 seconds]

04:56 vivekp has joined #mlpack

06:58 kris1 has quit [Quit: kris1]

09:22 kris___ has quit [Quit: Connection closed for inactivity]

11:10 kris1 has joined #mlpack

11:40 < zoq> rcurtin: Exciting, we should implement Levenberg-Marquardt and release it as a space edition :)

11:54 < zoq> Also, I'm always wondering why people use MLPack, the main paper uses MLPACK so I'm not sure I get the connection. I guess at this point we settled on mlpack; at least that's what I use :)

12:21 vivekp has quit [Ping timeout: 260 seconds]

12:22 vivekp has joined #mlpack

12:51 kris1 has quit [Quit: kris1]

13:10 kris1 has joined #mlpack

13:12 sumedhghaisas has joined #mlpack

13:16 sumedhghaisas has quit [Remote host closed the connection]

13:16 sumedhghaisas has joined #mlpack

13:17 sumedhghaisas has quit [Remote host closed the connection]

14:23 kris___ has joined #mlpack

14:23 < kris___> https://www.irccloud.com/pastebin/8KMvI2ac/output.txt

14:23 < kris___> This is the output on cifar dataset with 5000 images. Something seems to be wrong in the evaluation code.

14:33 < kris___> https://www.irccloud.com/pastebin/weJ2utOj/output.txt

17:04 < lozhnikov> kris___: try to increase the radius e.g. multiply it by 1.5 or 2

17:04 < lozhnikov> actually, the paper doesn't restrict the upper bound

17:17 < kris1> I did try that i increased it by a factor of 3 but the same thing happens.

17:18 < kris1> I have the preprocessing as per the paper.

17:18 < kris1> I think some thing might wrong in the preprocessing part.

17:18 < kris1> could you have a look at he gist once.

17:18 < lozhnikov> maybe it is reasonable to increase slabPenalty and initial visiblePenalty

17:19 < kris1> Hmmm right now i am using the values that are provided them in the paper let me make the changes and see.

17:28 < lozhnikov> I looked through the preprocessing part. Actually it doesn't correspond to the paper. The paper states:

17:28 < lozhnikov> "We use the following protocol. We train mcRBM on 8x8

17:28 < lozhnikov> color image patches sampled at random locations, and then

17:28 < lozhnikov> we apply the algorithm to extract features convolutionally

17:28 < lozhnikov> over the whole 32x32 image by extracting features on a

17:28 < lozhnikov> 7x7 regularly spaced grid (stepping every 4 pixels)"

17:29 < kris___> Yes why do you say that the preprocessing is different.

17:30 < kris___> patches[:,:,channel, i * 7 + j, img] = img_data[i*4 : i*4 + 8, j*4 : j*4+8, channel]

17:30 < lozhnikov> 1. You don't sample patches at random locations

17:31 < kris___> I did it like we do with cnn.

17:31 < lozhnikov> 2. you don't sample color patches you use whitening instead

17:33 < kris___> I am following this paper https://deeplearningworkshopnips2010.files.wordpress.com/2010/11/nips2010_workshop_ssrbm.pdf

17:34 < lozhnikov> okay, the authors refer to paper [13]: "We produced image features from an ssRBM model trained on patches using the same procedure as [13]"

17:35 < lozhnikov> and that paper states the same that I just wrote

17:36 < lozhnikov> i.e. you should do a number of color patches at different locations and then train the ssRBM on them

17:45 < kris___> Hmmm so i shouldn't do centre and the whiten the patches.....

17:45 < kris___> Also how many random samples should i take is that another hyper parameters....

17:57 < lozhnikov> it seems the paper doesn't describe that

18:15 < kris___> Centring and whitening of patches are required afaik otherwise the we get the assertion error on visiblePenalty is less than zero

18:16 < lozhnikov> Could you elaborate a bit? I don't see the connection

18:17 < kris___> Well i did not the not do the preprocessing i was getting this error. I don't a theoretical explanation right now.

18:17 < kris___> I would have to work that out.

18:26 mikeling has quit [Quit: Connection closed for inactivity]

18:42 < kris___> lozhnikov: I re-read the training procedure. I don't understand it.

18:43 < kris___> You train the ssRBM on color image patches

18:43 < kris___> randomly sampled

18:43 < lozhnikov> yeah

18:43 < kris___> but i don't get this line "extract features convolutionally"

18:44 < kris___> are these required for training the logistic regression.

18:44 < kris___> because we already have the features for the ssRBM.

18:45 < lozhnikov> I think that means you have to sample 49 hidden variables from the image and concatenate it

18:46 < kris___> Yes but those features are required for which algorithm?

18:47 < lozhnikov> that is you should sample 49 patches (the shape of the image is 32x32, the step size is equal to 4), then sample hidden variables from each patch and concatenate the results

18:48 < lozhnikov> then you train logistic regressor on these features

18:49 < lozhnikov> If I understand your question right these features are required for logistic regression

18:49 < kris___> Okay so this for the testing part of the algorithm.

18:50 < lozhnikov> yeah

18:50 < kris___> the ssRBM is trained on random patches of 8*8 size. is that correct?

18:50 < lozhnikov> yeah, I think that's correct

18:51 < kris___> okay i will try to complete this by morning.

18:51 < kris___> What should we do about the GAN...........

18:52 < lozhnikov> I think we should complete the test

18:53 < kris___> The problem seems to be that i don't have computation resources.

18:54 < lozhnikov> but I have:)

18:54 < kris___> Okay i think we should complete the Batch Normalization just to be sure.....

18:57 < lozhnikov> actually, I think we've got another issue. The oreilly example states that the step size should be equal to 3e-4. But our implementation shouldn't work with this step (you can check the loss function to ensure)

18:57 < lozhnikov> it is enough to look at the pretrain phase

18:59 < lozhnikov> so, I guess the implementation of the discriminator differs from the oreilly example

18:59 < lozhnikov> but the discriminator network hasn't got batch normalization layers at all

20:21 kris1 has quit [Quit: kris1]

22:42 kris___ has quit [Quit: Connection closed for inactivity]