#mlpack on 2017-08-15 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:02 keonkim has joined #mlpack

02:52 kris1 has joined #mlpack

03:32 kris1 has quit [Quit: kris1]

03:45 govg has joined #mlpack

05:26 partobs-mdp has joined #mlpack

07:59 < partobs-mdp> zoq: rcurtin: Right now migrating back to LayerTypes. To this end, I'm trying to merge Sumedh's code into my PR. However, I get this compilation error (from gru_impl.hpp):

07:59 < partobs-mdp> outputHidden2GateModule = new LinearNoBias<>(outSize, outSize);

07:59 < partobs-mdp> (weird stuff, only last line is there)

08:00 < partobs-mdp> Well, is just doesn't paste T_T

08:01 < partobs-mdp> I'll paste the gist in a minute

08:02 < partobs-mdp> https://gist.github.com/partobs-mdp/667a351b9b916bc33eee3ef64c00c277

08:03 < partobs-mdp> The strange thing is that it crashes on LinearNoBias (complaining that it doesn't have all the template arguments) but doesn't crash on Linear

08:03 < partobs-mdp> Obviously, it just doesn't see LinearNoBias in using LayerTypes = boost::variant<...>, but why?

08:07 < zoq> partobs-mdp: How does layer_types.hpp look like?

08:10 < partobs-mdp> zoq: https://github.com/partobs-mdp/mlpack/blob/563c11373b46ad09ee97f2aae43697efde2a4121/src/mlpack/methods/ann/layer/layer_types.hpp

08:13 < zoq> partobs-mdp: looks, good have to take a closer look into the issue

08:36 kris1 has joined #mlpack

08:57 < zoq> partobs-mdp: I can't test it right now, but what happens if you put 'template<typename InputDataType, typename OutputDataType> class GRU;' after the LSTM in layer_types.hpp?

08:59 < partobs-mdp> zoq: Didn't work :(

09:09 < zoq> partobs-mdp: Okay, I guess what you could do is to remove the GRU related code, I'll see if I can take a closer look into the issue in the next hours.

10:35 < zoq> partobs-mdp: Including "gru.hpp" after "lstm.hpp" in "layer.hpp", should solve the problem.

10:35 < kris1> lozhnikov: I think the orilley example that you mentioned used the batch norm layer.

10:36 < kris1> I don’t think we have that in mlpack right now.

10:36 < kris1> Should i skip that layer all together.

10:37 < zoq> kris1: You could test: https://github.com/mlpack/mlpack/pull/955

10:38 < zoq> Currently it does not work with the convolution layer.

10:38 < kris1> Hmmm well i needed it just for that……

10:39 < zoq> ah, okay, in this case you probably have to skip the layer for now

10:42 < partobs-mdp> zoq: That resolved the issue, but there is still a long way to go - I've got a huge compiler error message. The latest version is pushed.

10:47 < zoq> partobs-mdp: Sumedh used another approach to set the Rho: https://github.com/mlpack/mlpack/pull/1094

10:48 < zoq> Looks like you missed some files: 'visitor/reset_cell_visitor.hpp' file not found

11:08 < kris1> zoq: Do we have some equivalent of the reshape layer available.

11:10 < partobs-mdp> zoq: Added reset_cell_visitor and reset_cell_visitor_impl to CMakeLists, still getting error message - it's huge, but rather monotone (it mostly complains on some boost::variant issue)

11:12 partobs-mdp has quit [Remote host closed the connection]

11:12 < zoq> kris1: What does the reshape layer do?

11:15 < kris1> Well the example i am looking at is something like this. There is linear layer Whose output would column vector. Which is reshaped into 3d matrix where the channels = 1 to feed into a CNN. The parameters of the linear are being learned also.

11:17 < zoq> kris1: So you like to use Linear -> Conv?

11:17 < kris1> Yes exactly….

11:17 < kris1> Here is the example btw https://www.oreilly.com/learning/generative-adversarial-networks-for-beginners

11:17 < kris1> Just look at the generator part

11:18 < zoq> kris1: You don't need a Reshape layer, the conv layer handles the reshape for you: take a look at the cnn test

11:20 < zoq> I see, so in mlpack you don't need a reshape layer

11:20 < kris1> I think i get it thanks...

11:21 < zoq> if you need help with the model definition let me know

11:22 < kris1> yup sure…

11:25 sheogorath27 has left #mlpack []

11:31 shikhar has joined #mlpack

11:42 < zoq> partobs-mdp: 'mlpack/methods/visitor/forward_with_memory_visitor.hpp' file not found we could just remove the header for now, it's only used by the NTM model

11:43 < zoq> or maybe not ...

11:48 < zoq> partobs-mdp: Looks like you missed to add FFN<NegativeLogLikelihood<>, RandomInitialization>*, in layer_types.hpp

12:23 < zoq> partobs-mdp: You should use: boost::apply_visitor(ForwardVisitor(std::move(h), std::move(searchOutput)), search); instead of boost::apply_visitor(ForwardVisitor(std::move(h), std::move(searchOutput), search));

12:25 < zoq> partobs-mdp: Also it looks like the TreeMemory uses the FFN class instead of LayerTypes.

12:26 < zoq> partobs-mdp: And you might need to switch to LayerTypes instead of LayerTypes&.

14:16 vivekp has quit [Ping timeout: 240 seconds]

14:18 vivekp has joined #mlpack

14:32 vivekp has quit [Ping timeout: 240 seconds]

14:34 vivekp has joined #mlpack

14:54 < rcurtin> just a heads-up: there will be some downtime for masterblaster probably late this week or next week; I've managed to convince some people to install a Titan X GPU

14:54 < rcurtin> it seems like no long-running benchmark jobs or anything are running, so I think this should be no problem

14:54 < zoq> no way ... awesome :)

15:03 < rcurtin> too early to celebrate yet, but it seems likely at this point :)

15:03 < rcurtin> a second Titan X should be able to be added a few weeks later, but we need to order some extra hardware and new power supplies for that

15:04 < rcurtin> it looks like our power usage for masterblaster is pretty serious already: http://ratml.org/misc/mb_power.jpg

15:06 < zoq> oh, wonder what the peaks are

15:07 < zoq> also, I forgot ... I'll keep my excitement low at least for the moment

15:14 < rcurtin> I guess the peaks are probably big jobs starting

15:14 < rcurtin> Erwan_: sorry for the slow response, I have been traveling

15:15 < rcurtin> I don't do anything special for deserialization, typically that is just used in the mlpack main programs with 'data::Load()' and 'data::Save()'

15:15 < rcurtin> Erwan_: if you want to open a bug report, at this point it sounds like what is going on in your case is a little complex, so maybe that is the easier way to solve it instead of over IRC

15:57 shikhar has quit [Quit: WeeChat 1.7]

16:01 vivekp has quit [Ping timeout: 248 seconds]

16:04 vivekp has joined #mlpack

16:05 govg has quit [Ping timeout: 240 seconds]

16:22 mikeling has joined #mlpack

16:51 vivekp has quit [Ping timeout: 248 seconds]

16:52 vivekp has joined #mlpack

17:49 < kris1> Hi, zoq there

17:50 < kris1> I have implmeented that example i am just having difficulty with generator part

17:50 < kris1> https://gist.github.com/kris-singh/9420d3e9a0afe12099826dd98222dce7

17:51 < kris1> I am confused what should be the padding size for the generator network. Since the strategy is same padding that means the input and output dimension are same

17:52 < kris1> my calculation for padding show a very large value so i am confused could you take a look. I am using the formulas given here http://cs231n.github.io/convolutional-networks/#conv

18:14 < kris1> padding size is coming out be around 29 which seems wrong to me....

18:17 < kris1> lozhnikov: I also tried a classification test for GAN but the results were not good….. I used the gaussian(0, 1) as real dat aand uniform(-5. +5) as noise and trained the gan using that

18:18 < kris1> Then i generated further data using the same distribution and tried to predict their lables using the Discriminator. I was getting around 33% accruracy i don’t know why

18:18 < kris1> But i did not explore the idea further.

18:25 vivekp has quit [Ping timeout: 246 seconds]

18:27 vivekp has joined #mlpack

19:03 mikeling has joined #mlpack

19:06 vivekp has quit [Ping timeout: 260 seconds]

19:07 kris1 has joined #mlpack

19:08 vivekp has joined #mlpack

19:32 < kris1> Figured out the convlution part....

19:39 kris1 has quit [Quit: kris1]

19:39 kris1 has joined #mlpack

20:13 < rcurtin> zoq: as I go through the static analyzer output, I came across this one:

20:13 < rcurtin> http://masterblaster.mlpack.org/job/pull-requests-mlpack-static-code-analysis/28/cppcheckResult/source.5/#451

20:13 < rcurtin> the issue is that i isn't used in the inner loop, but I don't know enough about the test to know what the right solution is

20:13 < rcurtin> I think that you wrote the code, do you have an idea of what the right thing to do is?

20:46 < zoq> rcurtin: hm, I don't see any problem with the unused i, however we will rewrite the part anyway, once we implemented the field interface.

20:48 < zoq> kris1: The pad size should be < kernel size so unless your kernel size is > 29 I agree it's strange.

20:48 < zoq> kris1: Sounds like you figured it out?

20:49 < rcurtin> zoq: ok, then I guess it is just a training loop that will train a certain number of times

20:49 < zoq> rcurtin: right

20:49 < kris1> Well kinda but now there is no segmentation fault but the program still is not working. I used the calculations given here https://stackoverflow.com/questions/37674306/what-is-the-difference-between-same-and-valid-padding-in-tf-nn-max-pool-of-t

20:53 < kris1> The only confusion is that tensorflow is using 4 values for padding.

20:54 < zoq> different values?

20:55 < kris1> and if you go throgh the calculation you would see that pad_top = 1 pad_bottom = 1 - 1 ….

20:56 < kris1> Here is my full code https://gist.github.com/kris-singh/9420d3e9a0afe12099826dd98222dce7

20:57 < zoq> Depending on the kernel it might sense to pad differently but it's uncommon, and the current conv layer does not support that.

21:02 < zoq> kris1: Looks good for me, if you say it's not working can you elaborate on that?

21:03 < kris1> Well i am digging into it but the program for some reasons loops around on the generator.Forward call….

21:05 < zoq> hm, I can't see anything right away, probably have to step through the code.

21:06 < zoq> btw. I'm curious, what is the status of the RBM code?

21:08 < kris1> Well from my side its done.

21:09 < zoq> nice

21:09 < zoq> excited to test it out

21:10 < kris1> Both the rbm and ssRBM. I think lozhnikov said that if he find something else he would comment there.

21:13 < kris1> I would not be able to complete the stacked GAN though it seems in the specified time.

21:15 < zoq> I see, I don't think that's a problem.

21:17 < zoq> rcurtin: https://github.com/mlpack/mlpack/commit/9f0ac326acfce1f20f167c79faa0721b0e47cddc#diff-1b530bdfdf8dc37aacc84f9ff192521eR43, okay ... wasn't sure what the actual problem was

21:44 < rcurtin> I figured the parser was tripping over something; I'm not sure if that fixes it correctly

21:55 < kris1> This seems pretty strange https://gist.github.com/kris-singh/9420d3e9a0afe12099826dd98222dce7

21:56 < kris1> The program causes seg fault somtimes but works perfectly other times.

21:58 < kris1> I better check with valgrind

21:59 < zoq> kris1: good idea

22:02 vivekp has quit [Ping timeout: 240 seconds]

22:03 vivekp has joined #mlpack

23:22 < kris1> Hmmm i tried to look at the valgrid message it does a invlid read of size 4 but other than that the error message is pretty obscure….

23:22 < kris1> Could you have a look at it i have updated the gist btw.

23:23 < zoq> Sure, I'll set it on my todo list for tomorrow.