#mlpack on 2018-11-19 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

13:08 < jenkins-mlpack2> Project docker mlpack nightly build build #128: STILL UNSTABLE in 7 hr 54 min: http://ci.mlpack.org/job/docker%20mlpack%20nightly%20build/128/

14:11 davida has quit [Ping timeout: 245 seconds]

14:16 davida has joined #mlpack

14:48 < davida> Hi. I am trying to apply GradientClipping on a plain Stchastic Gradient Descent. I though I should use VeanillaUpdate but GradientClipping does not seem to like that. Here is the code snippet and the error.

14:49 < davida> VanillaUpdate vanillaUpdate();

14:49 < davida> GradientClipping<VanillaUpdate> clippedVanillaUpdate(-5, 5, vanillaUpdate);

14:50 < davida> Never mind. Think I am using it wrongly.

14:59 < davida> Nope. Cannot get it to work. Seems like GradientClipping doesn't like VanillaUpdate

15:06 < davida> Any idea what I am doing incorrectly?

15:09 < davida> The error is:

15:09 < davida> class GradientClipping

15:10 < davida> class GradientClipping

15:12 < davida> Hmm. Can't seem to paste the error in here.

15:12 < davida> gradient_clipping.hpp:40:3: note: no known conversion for argument 3 from ‘mlpack::optimization::VanillaUpdate()’ to ‘mlpack::optimization::VanillaUpdate&’

15:13 < davida> gradient_clipping.hpp:29:7: note: candidate: ‘constexpr mlpack::optimization::GradientClipping<mlpack::optimization::VanillaUpdate>::GradientClipping(const mlpack::optimization::GradientClipping<mlpack::optimization::VanillaUpdate>&)’

15:31 < davida> Hah. Sorry guys. Should learn to read the errors better. I had an anomolous () on the declaration of my VanillaUpdate.

16:59 vivekp has quit [Read error: Connection reset by peer]

17:01 vivekp has joined #mlpack

17:01 < davida> zoq: Is there a reason that we need to put a layer before adding a recurrent layer in the RNN model? I am referring to the example in the recurrent_network_test.cpp

17:03 < davida> ... where the first layer is IdentityLayer

17:36 < davida> zoq: Also, was the process to have different sample lengths in an RNN incorporated with the latest release and how to use it?

19:50 davida has quit [Ping timeout: 252 seconds]

19:51 davida has joined #mlpack

20:03 < zoq> davida: Per default we discard the backward step for the first layer, since it's not going to be used. But that doesn't work if the first layer holds other layers, so we just add a dummy layer. There is an idea to only skip the layer if it's doesn't implement the Model() function.

20:04 < davida> OK - so adding the IdentityLayer basically does no modification to the input but allows the discard.

20:05 < zoq> right

20:17 < davida> zoq: Any update on the different sized input sequences for the RNN? If you recall changing Rho each training step caused some errors.

20:18 < zoq> davida: Ahh, right, thanks for the reminder, have to put it on the list

20:19 < davida> ... but to complete the RNN exercise I need a way to stop the training else it learns that 0s (I pad with zeros) are most likely to be the next element.

20:19 < davida> Which is not the case

20:20 < zoq> I see, hopefully I can get this done this week

20:21 < davida> Thx. Will be a great help for a real world problem I have to manage as well.

20:21 < zoq> absolutely

20:22 < davida> BTW - if you implement it for RNN will that mean it will also work for the LSTM model?

20:22 < zoq> yeah

20:23 < davida> great