#mlpack on 2018-08-08 — irc logs at libera.irclog.whitequark.org

2018-08-06 04:28 ChanServ changed the topic of #mlpack to: Due to ongoing spam on freenode, we've muted unregistered users. See http://www.mlpack.org/ircspam.txt for more information, or also you could join #mlpack-temp and chat there.

00:23 berFt27 has joined #mlpack

00:25 berFt27 has quit [Remote host closed the connection]

01:52 JSharp18 has joined #mlpack

01:54 JSharp18 has quit [Remote host closed the connection]

02:41 ninsei has joined #mlpack

02:43 ninsei has quit [Remote host closed the connection]

07:37 j-fish has joined #mlpack

07:37 j-fish is now known as Guest91136

07:38 tomaw29 has joined #mlpack

07:39 Guest91136 has quit [Remote host closed the connection]

07:39 tomaw29 has quit [Remote host closed the connection]

07:55 clorophormo has joined #mlpack

08:01 clorophormo has quit [Ping timeout: 256 seconds]

08:04 bambams19 has joined #mlpack

08:04 bambams19 has quit [K-Lined]

08:09 NightMonkey25 has joined #mlpack

08:10 ShikharJ_ has joined #mlpack

08:10 NightMonkey25 has quit [Remote host closed the connection]

08:11 < ShikharJ_> rcurtin: Are you there?

08:16 ShikharJ_ has quit [Quit: Page closed]

10:06 < akhandait> zoq: Yeah, I am trying to see why the error goes to -nan

10:37 thekingofbandit4 has joined #mlpack

10:38 thekingofbandit4 has quit [Killed (Sigyn (Spam is off topic on freenode.))]

11:53 richardjohn12 has joined #mlpack

11:53 richardjohn12 has quit [Remote host closed the connection]

12:21 roger_rabbit has joined #mlpack

12:22 roger_rabbit has quit [K-Lined]

12:31 SkyPatrol has joined #mlpack

12:33 SkyPatrol has quit [Remote host closed the connection]

13:45 ImQ009 has joined #mlpack

14:05 n-st7 has joined #mlpack

14:11 < rcurtin> ShikharJ_: yes, I am now

14:11 n-st7 has quit [Ping timeout: 264 seconds]

14:20 Hoosilon16 has joined #mlpack

14:21 Hoosilon16 has quit [Remote host closed the connection]

14:58 Asoka4 has joined #mlpack

14:59 Asoka4 has quit [Read error: Connection reset by peer]

15:41 Venusaur21 has joined #mlpack

15:42 Venusaur21 has quit [Remote host closed the connection]

16:01 < akhandait> zoq: The test should pass with the current build.

16:02 < akhandait> zoq: I had some doubts about the transposed conv issue

16:16 ShikharJ_ has joined #mlpack

16:16 < ShikharJ_> rcurtin: I was thinking how often does a new release happen for mlpack?

16:17 < ShikharJ_> rcurtin: And what constitutes a major release (what makes one go from let's say a 2.3 release to a 3.0)?

16:25 < ShikharJ_> zoq: I have updated the work product report, I'll make a blog post using the same material as well. Also, I'll take up the remaining work now.

16:41 ShikharJ_ has quit [Ping timeout: 252 seconds]

17:00 Guest34098 has joined #mlpack

17:00 Guest34098 has quit [K-Lined]

18:01 < rcurtin> ShikharJ_: it's all arbitrary :)

18:01 < rcurtin> I try to release, e.g., once a month, but it doesn't always happen because a month is pretty short and the releases aren't automated

18:02 < rcurtin> I figured a 3.1.0 release at the end of GSoC with the new project code merged would be good

18:12 vivekp has quit [Ping timeout: 240 seconds]

18:12 em has joined #mlpack

18:14 em has quit [Read error: Connection reset by peer]

18:22 CGML20 has joined #mlpack

18:24 CGML20 has quit [Remote host closed the connection]

18:44 < zoq> ShikharJ_: Sounds good :)

18:44 < zoq> akhandait: Here to help.

18:52 < rcurtin> also, I should say about releases, I'm not picky at all, if anyone else wants to spearhead a release I have no problem with that at all :)

18:55 < akhandait> zoq: We need to take the output width and height of the transposed conv layer as a parameter

18:56 < zoq> akhandait: the output width of the forward pass?

18:56 < akhandait> yes

18:56 < akhandait> If you see Relationship 14 of that paper, o 0 = s(i 0 − 1) + a + k − 2p

18:56 < akhandait> sorry, it didn't copy correctly

18:56 < zoq> let me open the paper

18:57 < akhandait> Yeah, it will be better

18:57 < akhandait> Relationship 14 if the final, most general formula for transposed conv layers

18:58 < akhandait> according to that to calculate 'o', we need 'a'

18:58 < akhandait> a = i + 2p − k

18:58 < zoq> I see

18:58 < akhandait> where i is the input of the associated conv layer, that means it is the output of the trans conv

18:59 < akhandait> Knowinig only input, s, p and k, multiple values of output are possible

18:59 < akhandait> What we can do is check if that relationship holds true and throw an error otherwise

19:00 < zoq> agreed, good idea

19:01 < akhandait> zoq: That's taken care of then, another thing I wanted to clarify

19:01 < zoq> static_assert or something liek that

19:01 < zoq> *like

19:01 < akhandait> Oh, I used a Fatal

19:01 < akhandait> is that okay?

19:01 < zoq> Sure that works as well.

19:01 < zoq> this will do an exit afterwards, which I think is fine

19:02 < zoq> we can't continue anyway

19:02 < akhandait> Yeah

19:03 < akhandait> About inserting the zeros in between, do you know any other way we can do that instead of creating a bigger matrix of zeros and setting alternate elements as the input values

19:03 < akhandait> We will need to use loops

19:03 < akhandait> but I can't think of any other way

19:06 < zoq> Ideally, we would just skip the values in the conv operation, but right now I can't see any other solution as using loops.

19:07 < zoq> if we know the indices we could use submat

19:07 < zoq> X.submat( vector_of_row_indices, vector_of_column_indices )

19:07 < zoq> X( vector_of_row_indices, vector_of_column_indices )

19:08 < zoq> but I don't think this is actually faster

19:11 < akhandait> I am a little confused, can we insert zeros between individual elements using submat, like Figure 4.6 of that paper

19:12 < akhandait> Will submat set the elements alternately if we give the correct indices of rows/columns

19:16 < zoq> ah sorry for the confusion, this is just another way to insrt the values in a bigger matrix, to "avoid" the for loops

19:17 < akhandait> so we can directly insert the elements alternately in the bigger matrix

19:18 < zoq> exactly

19:18 < akhandait> I think it will save at least some time

19:18 < akhandait> I will time it and see

19:19 < zoq> okay, sounds good

19:19 Chords has joined #mlpack

19:21 Chords has quit [Read error: Connection reset by peer]

19:24 ShikharJ_ has joined #mlpack

19:29 ImQ009_ has joined #mlpack

19:30 ImQ009_ has quit [Read error: Connection reset by peer]

19:30 < ShikharJ_> akhandait: Sorry, if I'm interrupting, but if I follow your conversation correctly about inserting the zero elements directly, then your results would be significantly different from what we obtain using other frameworks.

19:30 ImQ009_ has joined #mlpack

19:31 ImQ009 has quit [Ping timeout: 240 seconds]

19:32 < akhandait> ShikharJ: Sorry, I am not sure how other frameworks implement this. Can you explain a bit? This seems to be correct according to the paper

19:32 < ShikharJ_> akhandait: Also i is not the output of the trans conv layer. O is.

19:32 ImQ009_ has quit [Read error: Connection reset by peer]

19:33 ImQ009 has joined #mlpack

19:34 < akhandait> ShikharJ: Yes, i is not, but it is the output of the associated conv layer which is basically the same thing, if he conv goes from 2 -> 4 (i = 2) the trans conv will go from 4 -> 2 (0 = 2)

19:34 < akhandait> (o = 2)

19:35 < akhandait> ShikharJ: I has a discussion with Marcus last time on #mlpack_temp. Sorry you had to miss it. I will send a txt file to you so that you can go through it.

19:35 < akhandait> Can I have your email?

19:35 < ShikharJ_> akhandait: Please use a pastebin?

19:36 < akhandait> oh! sure, i forgot about it

19:37 < akhandait> https://www.irccloud.com/pastebin/bEq3kpwu/

19:46 < ShikharJ_> akhandait: About flipping the kernel, yeah it doesn't matter mathematically, but it was done in the Forward pass because a Trans Conv is simple a Conv operation done in reverse.

19:47 < ShikharJ_> akhandait: Could you mention why you think the full convolution on the forward pass is incorrect?

19:48 < akhandait> ShikharJ: I am not sure about incorrect but it's extremely inefficient, more so for larger matrices

19:48 < akhandait> As I think again about it, I don't think it's incorrect. I will do the job

19:49 < akhandait> it*

19:51 < ShikharJ_> akhandait: Okay, let's leave that for now. What about the stride being one? Why should it depend on the input stride, when we are taking a full convolution?

19:54 < akhandait> ShikharJ: You are correct, we won't need the stride for a full convolution. It's just that we shouldn't always perform a full convolution. That's the reason we will need the stride of the associated convolution operation(to insert zeros in between the input units).

19:55 < akhandait> As mentioned in the paper, when stride of a convolution layer is > 1, the stride of the associated transposed con layer is < 1. That's the reason we insert the zeros.

19:57 < akhandait> Now that I again think about it, I think performing a full convolution in a transposed conv layer is not a correct backward operation for a conv layer which used > 1 stride

19:57 < akhandait> for example

19:59 < akhandait> Let's say that a conv layer goes from 64x64 to 32x32 using k = 33, p = 0, s = 1, then it's correct for it's associated transposed conv layer to use k = 33, p = 0 to go from 32x32 to 64x64

19:59 < akhandait> but,

20:00 < akhandait> if a conv layer goes from 64x64 to 32x32 using k = 5, p = 2, s = 2, then it's correct associated transposed operation should be k = 5, p = 2, s = 1(but with zeros inserted between input units)

20:02 < ShikharJ_> akhandait: "Performing a full convolution is not a correct backward operation for a conv layer which used > 1 stride". I don't think this is correct, a full convolution with stride one is the correct backwards operation on a conv layer.

20:03 ImQ009 has quit [Quit: Leaving]

20:06 < akhandait> ShikharJ: Yeah, it will do the job mathematically, but for that particular conv operation(> 1 stride), using the same kernel size(with fractional stride) is better associated . At least this is what I have understood from the paper.

20:13 < akhandait> zoq: I timed it, surprisingly, using loops is more than two times faster than using submat in this case

20:13 < akhandait> this is when I use [] instead of ()

20:15 < ShikharJ_> akhandait: Furthermore, if you just look intuitively, at the kernel size that small while upsizing an image, you're going to lose a lot of accuracy.

20:19 < ShikharJ_> akhandait: Transconv is nothing but the general conv operation reversed.

20:19 < akhandait> ShikharJ: Hmm, I am not sure about. I think using a huge kernel size for a transposed_conv when a smaller one was used in the conv layer is not the correct solution for that(assuming it's losing accuracy)

20:19 < akhandait> ShikharJ: exactly

20:21 < akhandait> We hardly ever use a huge kernel size in conv operation if the input is big(we use s > 1). The same way, if we use a full convolution to reverse a conv operation which used s > 1, we are not exactly reversing it.

20:21 < akhandait> We are just getting it to the same size, but not using the 'reverse' of the conv layer

20:22 < akhandait> So, as I said, a conv layer goes from 64x64 to 32x32 using k = 33, p = 0, s = 1, then it's correct for it's associated transposed conv layer to use k = 33, p = 0 to go from 32x32 to 64x64

20:22 < akhandait> that's the correct reverse operation

20:22 < akhandait> but if a conv layer goes from 64x64 to 32x32 using k = 5, p = 2, s = 2, then it's correct associated transposed operation should be k = 5, p = 2, s = 1(but with zeros inserted between input units)

20:23 < akhandait> that's the correct reverse operation for that case

20:23 < ShikharJ_> Okay let's break this case by case.

20:23 < akhandait> I think the paper has done that for us :)

20:24 < akhandait> but I will be happy to go through it again

20:24 < akhandait> maybe we both will get to learn some things

20:27 < ShikharJ_> akhandait: "We hardly ever use a huge kernel size in conv operation if the input is big. The same way, if we use a full convolution to reverse a conv operation which used s > 1, we are not exactly reversing it." This is true, but we take this approximation while backpropagating in conv networks in general.

20:27 < akhandait> also, intuitively, it seems very appropriate to use the same kernel size for a transposed conv operation as we used in the associated conv operation

20:28 < akhandait> ShikharJ: Yes we do, but that doesn't justify using a full convolution for a transposed conv when we used s > 1 for it's corresponsin conv operation

20:29 < akhandait> about my previous point, if you see Figure 4.6 of that paper, I don't think we lose accuracy this way

20:29 < ShikharJ_> akhandait: It does because the input is smaller (or rather denser in infromation theoretic sense) than the output in transposed conv.

20:32 < akhandait> Yeah, each output unit(of transposed conv) which should correspond to input unit(of conv operation) is affected by only those input units(of trans conv) corresponsing output units of which(again conv op) were affected by the corresponsing input units of the conv op

20:33 < akhandait> Also, I think it's always better to trust the paper we are following than our intuition :)

20:34 < akhandait> Sorry if my last point is confusing to read, I hope you get what I am trying to say

20:36 < ShikharJ_> akhandait: Yeah confused me big time :P zoq: What would you have to say of this?

20:37 < akhandait> http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html

20:38 < akhandait> This is Theano's tutorial on this(based on that paper), they have used the same thing

20:40 ckeltz19 has joined #mlpack

20:41 ckeltz19 has quit [Remote host closed the connection]

20:41 < ShikharJ_> akhandait: I'm curious, does the tests pass on the new method?

20:41 < ShikharJ_> The ones in the ann layer?

20:42 < akhandait> I haven't implemented all this yet, I will let you know tomorrow.

20:50 < ShikharJ_> akhandait: Ah, okay. I'm all for this method if atleast we're able to get the tests right.

20:50 ShikharJ_ has quit [Quit: Page closed]

20:52 < akhandait> ShikharJ_: Sure, let's see the results tomorrow

22:28 raspimate_ has joined #mlpack

22:31 raspimate_ has quit [Read error: Connection reset by peer]

22:33 sm0rux_ has joined #mlpack

22:33 sm0rux_ has quit [K-Lined]