#mlpack on 2017-08-31 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:01 Cyrinika has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

00:03 Cyrinika has joined #mlpack

01:11 Cyrinika has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

01:12 Cyrinika has joined #mlpack

01:18 Cyrinika has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

01:21 vivekp has quit [Read error: Connection reset by peer]

01:21 vivekp has joined #mlpack

01:21 kris___ has quit [Quit: Connection closed for inactivity]

01:42 Cyrinika has joined #mlpack

02:25 Cyrinika has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

02:27 Cyrinika has joined #mlpack

02:59 Cyrinika has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

04:46 vivekp has quit [Ping timeout: 260 seconds]

04:47 vivekp has joined #mlpack

05:49 govg has quit [Ping timeout: 248 seconds]

06:13 govg has joined #mlpack

06:26 kris1 has joined #mlpack

06:50 kris___ has joined #mlpack

06:51 < kris___> lozhnikov: Updated the gist... the numpy reshapes a layer...

06:51 < kris___> https://gist.github.com/kris-singh/ee57aee957c9e1780e982342d1e5bd96

06:51 < kris___> I did not understand your point regarding checking of resize module.

06:53 < kris___> In the case of logitic regression and ssRBM we are training of 1 * 49 * 8* 8 * 3 vector so how does the it matter how we reshape it.

07:09 < kris___> the patches come around about 245 mb for random patches since there are only 10 random patches for 5000 images.

07:10 < kris___> For the convolutional patches we get around 1.2gb for 5000 images

08:54 < lozhnikov> kris___: Looks like random_patch.py is correct. Regarding the rehsape() function: I think the reshape order could affect indexing and patch sampling, therefore I guess it is reasonable to verify each step of the algorithm i.e. visualize images and patches

10:17 < kris___> lozhnikov:

10:17 < kris___> I am getting an accuracy of around 15 on softmax regression https://gist.github.com/kris-singh/ee57aee957c9e1780e982342d1e5bd96

10:18 < kris___> using 10 random samples.

10:26 < lozhnikov> kris___: Could you send me the description of the test?

10:27 < kris___> With the 100 patches it goes down to 11%

10:28 < kris___> Okay. So i for every image i am passing a 1920(8*8*3*RandomSample = 10) vector to softmax classifier

10:28 < kris___> I tested this for the 500images 400 train images and 100 test images.

10:29 < lozhnikov> no, I mean the following: Did someone implement the test? Could you send me the link to the paper?

10:29 < kris___> Hmmm, no there is no link.

10:29 < kris___> I thought of this test.

10:30 < kris___> as comparison to the ssRBM classification test

10:30 < kris___> something similar to how we write down the test.

10:32 < lozhnikov> ah, okay. I don't understand patch handling: "temp = trainData.cols(i * numPatches, i * numPatches + numPatches - 1);"

10:33 < lozhnikov> if I understand right each column represents a separate image, right?

10:34 < kris___> well the original data shape is 192, 4000. So temp actually evalutes to matrix of 192*10 and then we serailise this, now this reprsents a single image.

10:34 < kris___> 192*4000

10:35 < lozhnikov> why the shape is 192x4000? the shape of each image is 32*32. Images are colored. So the number of rows should be equal to 3*32*32

10:36 < kris___> (8*8*3 ) x 10 * 400: Here 8*8 patches are used.

10:37 < kris___> We would have to train on patches since we will be training the ssRBM on pathches also.

10:37 < lozhnikov> patches are used in order to train RBM, but you don't use RBM at all

10:38 < kris___> I am saying that since pathches are used to train the RBM a fair comparision would be train a classifier on these patches.

10:38 < kris___> Are you saying that we should train the classifier not on the patches but the images ??

10:39 < kris___> That would be unfair comparision.

10:39 < kris___> *comparision

10:40 < lozhnikov> Why do you think so?

10:40 < kris___> Well simply because one classifier gets to see the whole image. While other classifier see only parts of the image.

10:41 < lozhnikov> no, you use the ssRBM in order to extract features convolutionally from the whole image

10:42 < kris___> What is the input to the ssRBM?

10:42 < lozhnikov> the input contains patches

10:43 < kris___> What is the output of the ssRBM? Are you suggesting we get the convolutional kernels from the ssRBM's.

10:44 < lozhnikov> kris___: check the paper. the paper states you should extract 49 N features and train on them the classifier

10:44 < lozhnikov> "We train mcRBM on 8x8

10:44 < lozhnikov> color image patches sampled at random locations, and then

10:44 < lozhnikov> we apply the algorithm to extract features convolutionally

10:44 < lozhnikov> over the whole 32x32 image by extracting features on a

10:44 < lozhnikov> 7x7 regularly spaced grid (stepping every 4 pixels). Then,

10:44 < lozhnikov> we use a multinomial logistic regression classifier to rec-

10:44 < lozhnikov> ognize the object category in the image."

10:45 < kris___> I do not understand "we apply the algorithm to extract features convolutionally

10:45 < kris___> 4:14 PM over the whole 32x32 image by extracting features on a"

10:45 < kris___> What algorithm ...

10:46 < lozhnikov> the algorithm that samples hidden variables

10:48 < lozhnikov> so, they don't use patches for training the classifier

10:48 < kris___> Okay, So you train an ssRBM on random pathches of size 8*8*3. Then after the ssRBM is trained. You sample hidden variables of the ssRBM on test data to extract features. Is what i understood from the paper.

10:48 < kris___> Using the extracted features you train a classifier.

10:49 < lozhnikov> yeah

10:49 < kris___> I do not get the part that says we extract features convolutinally.

10:49 < kris___> Where is the convolution operator applied here.

10:50 < lozhnikov> Sample 49 patches from each image using a regularly spaced 7*7 grid

10:50 < lozhnikov> then sample hidden variables from each patch and concatenate these hidden variables

10:52 < kris___> okay i get it. Then why are intial patches taken randomly we could also sample them in regularly spaced 7*7 grid manner.

10:53 < lozhnikov> not sure. maybe in order to decrease the size of the dataset

10:55 < kris___> Okay i get training algorithm now. Sample random 8 * 8 * 3 pathches from say 400 image ==> 400 * (8 * 8 * 3 * 10). Then you train the ssRBM on these patches. Then you take the input image again and sample 400 * ( 7 *7 *3 * 49) pathches and then using those get hidden variables. now you have N * 49 vector. Train a classifier on (N*49*num_images, image_lables)

10:56 < kris___> Is this correct.

10:57 < lozhnikov> exactly

10:57 < kris___> Okay thanks. I was very confused earlier.

10:57 < lozhnikov> however, I suggest to increase the number of input images (400)

10:58 < kris___> Okay but more than a 1000 images and system would become very slow.

10:58 < kris___> It might till tommrow to complete the test. I have some other work i need to do.

13:59 vivekp has quit [Ping timeout: 240 seconds]

14:00 vivekp has joined #mlpack

18:46 kris1 has quit [Read error: Connection reset by peer]

18:46 kris1 has joined #mlpack

21:02 kris___ has quit [Quit: Connection closed for inactivity]

21:43 kris1 has quit [Quit: kris1]

21:44 kris1 has joined #mlpack

22:25 kris1 has quit [Quit: kris1]