#mlpack on 2016-07-16 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:24 marcosirc has quit [Quit: WeeChat 1.4]

00:48 < zoq> nilay: Here is the modified backward pass I used to build the code: https://gist.github.com/zoq/c1e373837f78948eb9104ab5dd844e9b

00:48 < zoq> nilay: The problem here is that slices returns a subview, but the layer expects some matrix type (arma::Mat<eT> or arma::cube<eT>.

00:59 < zoq> nilay: You have to call the backward function of the base layer with some valid input, I this case it's: base1.Backward(base1.OutputParameter(), ..., ...)

00:59 < zoq> nilay: I think, you can use the OutputParameter in every case, but using a dummy parameter as you do it right now, is also a neat idea.

04:52 nilay has joined #mlpack

05:05 Ritwik has quit [Quit: Page closed]

05:31 < nilay> zoq: thanks i don't think i could have figured out that error.

06:48 Mathnerd314 has quit [Ping timeout: 240 seconds]

08:35 mentekid has joined #mlpack

09:27 mentekid has quit [Ping timeout: 276 seconds]

10:19 nilay has quit [Ping timeout: 250 seconds]

12:04 mentekid has joined #mlpack

13:06 marcosirc has joined #mlpack

14:20 Mathnerd314 has joined #mlpack

15:52 mentekid has quit [Ping timeout: 240 seconds]

16:54 nilay has joined #mlpack

17:04 Mathnerd314 has quit [Ping timeout: 246 seconds]

17:10 nilay has quit [Ping timeout: 250 seconds]

17:58 nilay has joined #mlpack

17:58 < nilay> zoq: hi, for base_layer would the backprop error always be a matrix type, whatever the input??

17:58 < nilay> matrix meaning 2d matrix

18:15 < zoq> if the input is a cube the error should also be a cube type

18:16 < nilay> which it is

18:20 < nilay> but whats happening is, it goes to this function( line 76, base_layer.hpp) and segfaults there

18:20 < nilay> when error is a cubetype

18:21 < nilay> i am providiing dummy input since input is not required

18:21 < nilay> providing*

18:23 < zoq> can you update the code, I'm not sure I'm looking at the same line

18:23 < zoq> or are we talking about line 76 in base_layer.hpp?

18:23 < nilay> yes

18:24 < nilay> so should the backward pass go here for cnns?

18:24 < zoq> the input for the base layer is required: ActivationFunction::deriv(input, derivative);

18:24 < nilay> ok

18:24 < nilay> thanks thats the error then.

18:24 < zoq> nilay: You have to call the backward function of the base layer with some valid input, I this case │

18:25 < zoq> it's: base1.Backward(base1.OutputParameter(), ..., ...)

18:25 < nilay> also i was thinking of also implementing concat_layer, i have one doubt though, i should take as input, the number of layers to concatenate and the collection of layers to be concatenated., so what should i use to take as input the "collection of layers"

18:27 < nilay> could i use a tuple just like in the network?

18:28 < zoq> hm, yeah I guess that's probably the best solution here, good idea

18:28 < nilay> ok

18:30 < zoq> Maybe another solution is to take two conv nets as input ... maybe your idea is cleaner

18:33 < nilay> how would that work?

18:33 < nilay> there could be more than 2 layers being concatenated at a time, like in inception layer

18:34 < zoq> yeah, in this case you have to use two concat layer e.g: concat(concat(A, B), C) as I said your idea is nice

18:36 < nilay> ok lets see if I can implement it nicely :)

18:37 < zoq> let me know if you need help

18:38 < nilay> yeah sure

19:07 Mathnerd314 has joined #mlpack

19:48 nilay has quit [Ping timeout: 250 seconds]

20:25 nilay has joined #mlpack

20:28 < nilay> zoq: when performing gradient update, if I have a pooling layer before a convLayer, would i have to do anything different?

20:41 nilay has quit [Ping timeout: 250 seconds]

20:51 nilay has joined #mlpack

21:31 < zoq> nilay: You have to set the InputParameter to the OutputParameter() of the layer before the pooling layer.

21:33 < nilay> but the problem is coming for the layer after the pooling layer

21:34 < zoq> Maybe because the error of the pooling layer isn't correct?

21:35 < nilay> what do you mean by correct?

21:37 < zoq> Since the backward pass of the layer after the pooling layer depends on the error of the pooling layer, it could be that the calculated error of the pooling layer is not correct. Which depends on the error passed to the pooling layer.

21:38 < zoq> not sure, do you get some error message?

21:38 < nilay> i get a segfault when trying to update the gradient of the layer after the pooling layer

21:39 < nilay> the backward pass works correctly, by correctly i mean it proceeds and the gradient is called after that

21:41 < nilay> also i wanted to ask, why do we use rvalue and std::forward instead of lvalue in the constructor for CNN?

21:45 < zoq> To call the CNN constructor with temporary values. Can you push the current state of your code?

21:45 < nilay> https://github.com/nilayjain/mlpack/blob/inceptionlayer/src/mlpack/methods/ann/layer/inception_layer.hpp , it stops at line 365.

21:45 < nilay> when we call convPool.Gradient

21:47 < zoq> can you check that convPool.InputParameter() and biasPool.Delta() isn't empty

21:48 < nilay> ok

21:52 < nilay> convPool.InputParameter() does not have same spatial dimensions as biasPool.Delta()

21:53 < nilay> after pooling, I pad to convPool to make it 28 x 28

21:54 < nilay> should i add a padding in pooling layer also??

21:55 < nilay> then this issue would not come? right now i am doing pooling then padding in the following convLayer

22:00 < nilay> would using a lvalue reference in CNN constructor be a bad idea

22:00 < zoq> hm, I think what we should do is to write a small test which first calls the forward pass with some input data that is way smaller compared to the input you are using right now. Check if the output looks right. Afterwards, we are using some error and call the backward function, and make sure the output looks right.

22:00 < zoq> Once, we checked the forward and backward function, we test the gradient function. It's hard to track down some error in the gradient function, if we can't be sure the two other functions are not correct. Do you think that's reasonable?

22:01 < nilay> i think it is due to the incompatible dimensions only. adding padding in the pooling layer should solve the problem, question is do we want to add a padding arguement also in pooling layer

22:02 < nilay> right now convPool is 26 x 26 x 192

22:02 < nilay> convPool.input that is. and biasPool.Delta is 28 x 28 x 32

22:02 < zoq> I don't think so, we could do any padding in the inception layer, right?

22:03 < nilay> write a separate function to pad?

22:04 < nilay> if I can make convPool.InputParameter 28 x 28 x ... then it could work

22:04 < zoq> yes, if you pad the output of the pooling layer, you have to remove the padding if you call the backward and gradient function.

22:05 < nilay> ok

22:43 nilay has quit [Ping timeout: 250 seconds]

22:46 nilay has joined #mlpack

23:02 mentekid has joined #mlpack

23:08 nilay has quit [Ping timeout: 250 seconds]

23:10 nilay has joined #mlpack

23:19 mentekid has quit [Ping timeout: 276 seconds]

23:42 nilay has quit [Ping timeout: 250 seconds]