#mlpack on 2017-05-13 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:30 chenzhe1 has joined #mlpack

00:30 chenzhe has quit [Ping timeout: 246 seconds]

00:30 chenzhe1 is now known as chenzhe

01:24 mikeling has joined #mlpack

03:54 chenzhe has quit [Ping timeout: 240 seconds]

07:28 aashay has joined #mlpack

09:49 aashay has quit [Quit: Connection closed for inactivity]

10:54 vpal has joined #mlpack

10:55 vivekp has quit [Ping timeout: 260 seconds]

10:55 vpal is now known as vivekp

11:49 shikhar has joined #mlpack

12:12 shikhar has quit [Read error: Connection reset by peer]

12:14 shikhar has joined #mlpack

14:06 shikhar has quit [Quit: WeeChat 1.4]

16:51 sumedhghaisas has joined #mlpack

16:56 mentekid has quit [Quit: Leaving.]

16:58 mentekid has joined #mlpack

17:15 sumedhghaisas has quit [Ping timeout: 240 seconds]

17:54 mikeling has quit [Quit: Connection closed for inactivity]

17:57 sumedhghaisas has joined #mlpack

18:12 < sumedhghaisas> zoq: Hey Marcus, had a couple of questions about the architecture.

18:13 < sumedhghaisas> I observed that the 'parameter' input to the evaluate function is not used in FFN

18:13 < sumedhghaisas> So we are assuming that only gradient descend based optimizers will be used?

18:46 < zoq> sumedhghais: That's true, ignoring the input, was the easiest was an easy way to reuse the existing optimizer classes without writing a wrapper. But if you have something in mind, that you think it worth a change, feel free.

18:49 < sumedhghaisas> zoq: yeah I agree. But I am bit confused about the working. So Evaluate returns the loss based on current network parameters

18:51 < sumedhghaisas> but the Gradient function creates gradient in a matrix style

18:51 < sumedhghaisas> for the update

18:51 < sumedhghaisas> so where are the parameters updated? usually they are updated in the optimizer, right?

18:56 mentekid has quit [Quit: Leaving.]

18:59 < sumedhghaisas> ahh okay... the 'iterate' matrix is passed as the reference to the 'parameters' object of FFN

19:00 < sumedhghaisas> so it gets updated in vanilla update...

19:00 < zoq> yeah, absolutely right

19:01 < sumedhghaisas> but then we can maybe we can somehow parameterize the update policy to accept actual update operation and bypass the entire gradient matrix creation?

19:02 < sumedhghaisas> what do you think?

19:02 < sumedhghaisas> that update operation will implement a forward pass through all the layers and update their individual parameters?

19:03 mentekid has joined #mlpack

19:07 < zoq> I mean you could do that, I guess the benefit is you would save memory, since you only have to hold the current gradient of layer x.

19:08 < sumedhghaisas> yeah... thats what I was thinking. And we can compute and update at the same time... without actually saving the gradient

19:10 < sumedhghaisas> So the update function will do the work of gradient and update

19:12 < sumedhghaisas> but then we will need to change the gradient function of all the layers... uffff

19:13 < zoq> I like the idea, not sure, there is an easy way to achieve this; the idea was to avoid the implement of a special optimizer for the ann code.

19:14 < zoq> modifiying the Gradient function should be straightforward

19:14 < zoq> but it takes some time, yes

19:15 < sumedhghaisas> yeah... We will save lot of memory access... and also the creation of gradient matrix... which involves lot of matrix reshaping

19:17 < sumedhghaisas> okay I will create a github issue for this and try to work it out

19:18 < sumedhghaisas> also ... Should I use the BatchNorm pull request and modify it... cause except for some small changes and adding support for convolutional layers, the code looks good to me

19:19 < zoq> opening a new issue is a good idea

19:19 < zoq> yeah, the BatchNorm PR looks good for me too

19:20 < sumedhghaisas> okay... I will try to replace some architectural changes in place of the batch norm implementation... cause most of my work is already done by him there :P

19:21 < zoq> if you like, sure :)

20:10 < sumedhghaisas> zoq: On a separate note, do you think building the network static rather than dynamic will have speed up?

20:11 < sumedhghaisas> just a curiosity, I dont yet have an architecture to do that :)

20:21 sumedhghaisas has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

20:57 vivekp has quit [Ping timeout: 268 seconds]

21:10 vivekp has joined #mlpack

21:23 < zoq> sumedhghais: I think the performance boost you probably get is negligible.

22:06 sumedhghaisas has joined #mlpack

22:20 mentekid has quit [Quit: Leaving.]

22:21 mentekid has joined #mlpack