#mlpack on 2017-08-25 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

04:13 kris1 has quit [Quit: kris1]

06:34 < lozhnikov> kris__: you didn't take care of the depth. Each convolution layer produces an output of size 28*28*depth. So, you should resize the image at each slice.

06:34 < lozhnikov> I see two possible ways:

06:34 < lozhnikov> 1. Add the third dimension to BiLinearFunction in such a way that the input has the shape (x, y, z) and the output has the shape (nx, ny, nz).

06:34 < lozhnikov> 2. Add the number of slices to BiLinearFunction in such a way that the input has the shape (x, y, depth) and the output has the shape (nx, ny, depth) i.e. the depth is constant.

06:49 kris__ has quit [Quit: Connection closed for inactivity]

07:48 sumedhghaisas has joined #mlpack

07:57 kris__ has joined #mlpack

08:01 kris1 has joined #mlpack

08:08 < kris__> Hey lozhnikov,

08:09 < kris__> I don't think that's entirely correct the linear dosen't support the depth parameter if you look at it. And it can be successfully used with the convolution layer.

08:15 < kris__> I will just check why the conv --> linear works...

08:15 < kris__> and then see.

08:19 < lozhnikov> there are no problems to use the linear layer with the convolution one. But the resize layer accepts an input of size 28*28, however the convolution layer provides an output of size 28 * 28 * depth

08:22 < kris__> Hmmm in that way a quick solution would be just overload the constructor of the bilinear function with (inRowSize, inColSize, outRowSize, outColSize, depth).

08:23 < lozhnikov> I agree

08:25 < kris__> I am just think we would have to change the indexing in the input(i,j,d) = d * depth+ j * colSize + i.

08:26 < kris__> That would require using 3 loops. That makes this pretty expensive.

08:31 < lozhnikov> no, that's incorrect. I think (input(i, j, k) = k * colSize * rowSize + j * colSize + i) is correct

08:32 < kris__> yes sure that still requires using 3 loops though.

08:34 < lozhnikov> sure, the input is 3-dimensional. therefore it requires 3 loops

08:35 < kris__> the input can be 32 dim(# of channels) also so i think this would be pretty slow. But i think thats the quick solution. let me implement that...

08:36 < lozhnikov> I don't see any problems here. The oreilly example does the same and works very well

09:15 < kris__> Okay i updated the code directly in the Gan PR.

09:15 < kris__> You could have a look.

09:31 < lozhnikov> look like the changes are correct

09:32 sumedhghaisas has quit [Ping timeout: 240 seconds]

10:00 partobs-mdp has joined #mlpack

10:00 < partobs-mdp> zoq: Can't figure out the issue with zero_init.hpp not found

10:00 < partobs-mdp> On my computer everything correctly compiles even after make clean

10:01 < partobs-mdp> In CMakeLists it is included, so no idea why Travis doesn't see it

10:02 < partobs-mdp> Could you take a look? (My plane is leaving in 4 hours, so I would appreciate if someone would be able to respond ASAP)

10:15 < zoq> partobs-mdp: Looks like zero_init was renamed to const_init, so if you swap zero_init with const_init it should work.

10:16 < zoq> partobs-mdp: You might also have to add a default constructor to the ConstInitialization class, that uses 0 as initVal.

10:24 kris1 has quit [Quit: kris1]

10:32 < kris__> Lozhnikov the code still behaves pretty weird still

10:32 < kris__> Different errors at different runs and sometimes running successfully

10:32 < kris__> Here is the gist

10:32 < kris__> https://gist.github.com/kris-singh/4b355418edd9c69ede11c4af18086438

10:35 < kris__> Run with ./gan_test.o -i train7.txt -o output.txt -m 200 -N -100

10:36 < kris__> Could you look into it I would Be away from my PC for a few hours

10:44 < lozhnikov> kris__: I am busy right now, I'll look through the code in the evening

10:50 < partobs-mdp> zoq: Now I get this error: https://gist.github.com/partobs-mdp/72ac2c4450c4eb99ac13a2b2136fa7b2

10:51 < partobs-mdp> It appeared after I redefined NetworkInit

10:51 < partobs-mdp> fwiw, before the instructions that crashed I had this:

10:51 < partobs-mdp> NetworkInitialization<ConstInitialization> networkInit();

10:51 < partobs-mdp> (instead of NetworkInitialization<> networkInit();)

10:52 < zoq> Can you push the code?

10:53 < partobs-mdp> Pushed

10:54 < partobs-mdp> Looked at the code more carefully - I found that someone has removed offset parameter

10:57 < partobs-mdp> Even though it would cause an error otherwise, it is still not the true reason for the error message - it still crashed even after I returned the old implementation of the method

10:59 < partobs-mdp> zoq: Any ideas hwo to fix that?

10:59 < partobs-mdp> *how

11:01 < partobs-mdp> Fixed the error by removing parentheses: NetworkInitialization<ConstInitialization> networkInit;

11:02 < partobs-mdp> Why would putting empty parentheses matter? (providing I have the constructor that takers no parameters)

11:15 < zoq> partobs-mdp: See https://stackoverflow.com/questions/620137/do-the-parentheses-after-the-type-name-make-a-difference-with-new

11:16 < zoq> partobs-mdp: The issue with the offset is it's not part of master, so since you merged the current version the code got overwritten

11:17 < zoq> partobs-mdp: You have to manually replace the files, not sure you have time for that right now.

11:19 < zoq> partobs-mdp:O can resolve the merge conflicts for you if you like.

11:21 < partobs-mdp> zoq: Yes, please - that would we very helpful :)

11:23 < zoq> partobs-mdp: Let see if the build passes.

11:24 < partobs-mdp> * partobs-mdp keeps fingers crossed *

11:41 < partobs-mdp> Well, the build went smoothly through ann_layers_test - 30% up :)

11:41 < zoq> nice, 70% left :)

11:45 vivekp has joined #mlpack

11:49 < partobs-mdp> And HAMUnit also successfully compiled, getting up to 43%

12:12 < partobs-mdp> One of two build succeeded :)

12:12 < partobs-mdp> *builds

12:13 < partobs-mdp> For some reason, the second build is not running yet

12:14 < zoq> Nice to see it worked out.

12:16 vivekp has quit [Ping timeout: 248 seconds]

12:16 < partobs-mdp> And the second one crashed :(

12:17 < partobs-mdp> But probably to some unrelevant to the PR issues

12:17 < partobs-mdp> https://travis-ci.org/mlpack/mlpack/jobs/268325787

12:17 < partobs-mdp> The following tests FAILED:\n 19 - CLIBindingTest (Failed)

12:18 < zoq> not related with your changes

12:18 vivekp has joined #mlpack

12:20 < partobs-mdp> Well, it looks like I've managed to get my changes auto-building :)

12:21 < partobs-mdp> So here I'm leaving to the airport. Once again, zoq, rcurtin, thanks for this great experience ;-)

12:22 partobs-mdp has quit [Quit: Leaving]

12:23 < zoq> partobs-mdp: Have fun, and see you later.

13:18 < kris__> zoq: regarding ssRBM i commented how we could use submat

13:18 < kris__> But not sure how we could get rid of the loop

13:28 < zoq> kris__: Yes, have you seen my response?

13:52 < kris__> No yet I will have a look once. I reach my room

14:18 vivekp has quit [Ping timeout: 248 seconds]

14:19 vivekp has joined #mlpack

14:19 vivekp has quit [Changing host]

14:19 vivekp has joined #mlpack

15:05 < rcurtin> partobs-mdp: enjoy your flight, it has been a good summer :)

16:18 < lozhnikov> kris__: I fixed a couple issues. Take a look https://www.irccloud.com/pastebin/SVBFKAwz/conv.cpp

16:20 < lozhnikov> kris__: Moreover, I pointed a few issues at github. There are still some errors: the Evaluate() function returns NAN. Try to figure out which layer produces NANs and why

16:28 sumedhghaisas has joined #mlpack

16:39 kris__ has quit [Quit: Connection closed for inactivity]

16:44 kris1 has joined #mlpack

18:23 sumedhghaisas has quit [Read error: Connection reset by peer]

18:29 kris1 has quit [Quit: kris1]

18:44 kris1 has joined #mlpack

18:44 johnlennon has joined #mlpack

18:44 johnlennon has quit [Client Quit]

18:49 kris__ has joined #mlpack

19:07 sumedhghaisas has joined #mlpack

19:07 < sumedhghaisas> zoq: Hey Marcus... there?

19:11 < zoq> sumedhghais: yes about to step out.

19:16 < sumedhghaisas> zoq: Will catch up with you later then. Wanted to talk o you about that windows problem

19:16 < sumedhghaisas> Have fixed all the other comments

19:17 < sumedhghaisas> Also fixing the batch norm

19:17 < zoq> sumedhghais: Commented on the PR, just a few seconds ago.

19:17 < sumedhghaisas> ahh... okay. I think that should work for now. I am not sure about the performance comparison

19:18 < sumedhghaisas> And the batch norm implementation is not passing the gradients tests

19:19 < zoq> hm, okay for the linear layer, I guess

19:22 < sumedhghaisas> zoq: sorry didnt get that. okay for linear layer?

19:25 < zoq> You tested the batchnorm with the linear layer and not with the conv layer?

19:25 < sumedhghaisas> ahh I know the problem... the output is alwas zero. Cause there is only a single entry in the batch

19:25 < sumedhghaisas> how do I use multiple entries in a batch?

19:26 < sumedhghaisas> yeah with a linear layer. The gradient tests shoul pass right? Or am I missing something?

19:30 < zoq> I'm about to refactor the FNN class so that it works with real batches, for now you have to manully pass a batch to the layer n_cols > 1.

19:32 < sumedhghaisas> ooohhh... okay. What is the refactoring? maybe I can help a little? if we finish it ... then I can properly test batch norm...

19:38 < zoq> Currently the FFN class splits the input using .col, so basically all we have to do is to use cols or submat. Rajiv already put a lot of work into supporting batches: https://github.com/mlpack/mlpack/pull/1073

19:40 < sumedhghaisas> Okay I will try to take a look at his code

21:35 sumedhghaisas has quit [Ping timeout: 246 seconds]