#mlpack on 2017-08-09 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:56 kris1 has quit [Quit: kris1]

00:57 kris1 has joined #mlpack

01:21 < rcurtin> zoq: ironstark: I'll go ahead and make sure that all the dependencies for r-base are installed on all 5 benchmarking systems

01:41 wiking has quit [Quit: ZNC 1.6.3 - http://znc.in]

02:47 wiking has joined #mlpack

03:49 govg has quit [Ping timeout: 260 seconds]

04:34 partobs-mdp has joined #mlpack

04:56 kris1 has quit [Quit: kris1]

04:57 govg has joined #mlpack

05:18 < partobs-mdp> zoq: rcurtin: Could you take a look at HAM PR? I've got some problems T_T

05:34 < rcurtin> partobs-mdp: sure, what would you like me to look at?

05:35 < rcurtin> I am paying attention to a talk right now, but I think you can have my full attention in ~30m

05:36 < rcurtin> ah sorry I see you made a github comment about it, I'll start there

06:02 < zoq> partobs-mdp: Unfortunately, my time is limited right now ... can you make sure that the Linear parameter are correct inside the Linear class, e.g by printing the weight in the Forward step?

06:30 < partobs-mdp> zoq: Checked - there indeed was garbage in Linear layer :) Fixed, now everything works - implementing the WRITE and SEARCH models.

06:40 < rcurtin> partobs-mdp: you beat me to it, I found the same. the linear layer has N^2 + N elements, I guess you were only setting the first 2N

06:40 < rcurtin> there is also a logical error after the JOIN operation: 'error: Mat::cols(): indices out of bounds or incorrectly used'

06:40 < rcurtin> but you are probably aware of that

06:40 < rcurtin> it happens later in the test, to be specific, not during the JOIN operation

06:49 < partobs-mdp> rcurtin: Yes, I know - this code acts as a stub - working on it :)

06:53 < rcurtin> sure, no problem, just making sure you knew about it

06:53 < rcurtin> if compiled with -DDEBUG=OFF and -DPROFILE=OFF, those errors won't be displayed, Armadillo will just happily do something invalid and then suddenly you get a big backtrace and segfault :)

07:06 mikeling has joined #mlpack

08:36 partobs-mdp has quit [Remote host closed the connection]

09:18 wiking has quit [Quit: ZNC 1.6.3 - http://znc.in]

09:24 wiking has joined #mlpack

09:32 sheogorath27 has joined #mlpack

10:41 kris1 has joined #mlpack

12:45 < kris1> Hi, zoq with the networkInit.hpp file when we intialise on per layer basis. The Parameter() function of the FFN module retrurns an matrix actually is this correct behaviour.

12:46 < kris1> Or should i set the generator.Parameters() matrix to point to the parameters matrix after intialisation on per level basis.

12:51 partobs-mdp has joined #mlpack

13:06 < partobs-mdp> zoq: rcurtin: I'm trying to implement forward pass of HAM, and I kind of know what to do, but I get huge error message. (Right now my goal is to port TreeMemory to LayerTypes - we won't need any other "callables" for it) Could someone take a look at the issue?

13:06 < partobs-mdp> (The latest code is in the HAM PR on Github)

13:12 < rcurtin> partobs-mdp: do you need to include mlpack/methods/ann/layer_types.hpp?

13:13 < rcurtin> sorry that's mlpack/methods/ann/layer/layer_types.hpp

13:13 < partobs-mdp> rcurtin: Yes, for using LayerTypes

13:13 < partobs-mdp> rcurtin: Got slightly confused - what file are we talking about?

13:14 < rcurtin> in tree_memory.hpp, you have to include layer_types.hpp I believe

13:14 < rcurtin> then you can't call Predict() directly on a LayerTypes, you have to use boost::visitor

13:15 < partobs-mdp> rcurtin: Can we constrain even further on FFNs there?

13:16 < rcurtin> I suppose it could be possible to use an FFN and not a LayerTypes... I'm not sure if that would add any extra trickiness for computing the gradients/etc.

13:17 < partobs-mdp> rcurtin: Trickiness? Won't we immediately get all needed methods for doing that?

13:17 < rcurtin> if we are learning the join/search/etc. functions, it might be tricky to properly propagate the gradients through it so that it learns

13:18 < rcurtin> maybe it will work fine, I am not sure about that bit

13:18 < rcurtin> but if you are not learning a join/search/etc. function at all but instead hardcoding it, then I think using FFN is fine

13:18 < partobs-mdp> rcurtin: Well, we're doing DHAM - shouldn't it be just a chain-rule (backprop) learning?

13:19 < rcurtin> right, and so the functions that FFN is giving us may not make it easy to have the FFN do only one step of learning

13:19 < rcurtin> I am not 100% certain when I say that, so you should go ahead and try it, but I suspect there may be implementational difficulties later

13:35 < rcurtin> partobs-mdp: I have to go to bed now, keep up the good work :)

13:35 < partobs-mdp> rcurtin: Good night ^_^

13:35 < rcurtin> I assume the most frustrating part is that a single little error yields like 100k lines of error messages

13:35 < rcurtin> it's like unfair punishment

13:35 < rcurtin> at least that's how I always feel when I can't get stuff to compile

13:36 < partobs-mdp> rcurtin: I has 26k once, but it looks like that's not the absolute frustration record :)

13:36 < rcurtin> even funnier when the types that gcc is putting out in its error messages are so large they can't even fit on my terminal...

13:36 < rcurtin> I dunno, maybe 100k is an exaggeration... can't remember for sure

13:36 < rcurtin> I dunno if lines in the best metric either---since some of the types are so long, counting them as '1 line' is a bit misleading

13:36 < partobs-mdp> Well, it's down to 917 lines :)

13:38 < rcurtin> what I have on your branch is only 438 lines but 4.7M characters

13:38 < rcurtin> average > 1k characters per line

13:38 < rcurtin> impressive :)

13:38 < rcurtin> anyway, I am off to bed, I will check in again in the morning

14:45 witness has joined #mlpack

15:39 < kris1> lozhnikov update the GAN PR. I have refactored the code and added another easier test.

15:39 < kris1> I have done some preliminary checking that the matricies are being correctly instialised and updated.

15:41 < kris1> The proble of small gradients still persists though the Gradients are in the order of 1e-5 and multiplying them by learning 1e-3 takes 1e-8 which can be considered as vanishing gradients.

15:41 < kris1> Should i try clipping the gradients values. if yes what should be a reasonable clip value.

15:42 < lozhnikov> kris1: In such a way you can set learningRate to 1.0

15:43 < kris1> Hmmm let me check.

15:46 < kris1> I will try to plot the images at diffrent iterations and check to see if the GAN is learning something or not.

16:35 mikeling has quit [Quit: Connection closed for inactivity]

17:00 kris1 has quit [Quit: kris1]

17:09 < zoq> kris1: Sorry for the slow response; not sure I get your point, do you mean to call the Initalize method with a parameter matrix (not model.Parameters()) and afterwards we do model.Parameters() = parameters?

17:30 kris1 has joined #mlpack

17:56 partobs-mdp has quit [Remote host closed the connection]

18:09 < kris1> I set the learning rate to around 0.5 still the gradients suffer from vanishing gradients problem. This network is only a 2 layer neural net. So i don’t understand why this is happening

18:09 < kris1> Can you have a look at the code.

18:17 < kris1> lozhnikov:

18:34 < lozhnikov> kris1: I think there is no quick answer. I should spend some time debugging the code. I'll do that in the morning.

18:35 < kris1> ok

18:52 kartik_ has joined #mlpack

18:56 < kartik_> <zoq> Hi, my optimization function looks like this https://ghostbin.com/paste/o335b

18:57 < kartik_> at line 29 is where i want to copy the parameters to the network so that i can evaluate it and see the output of those parameters

18:57 < kartik_> i knw im wrong. But what exactly should be done

19:00 < zoq> kartik_: You can do function.parameters() = parameters;, you could also use answer = parameters; have you seen my last comment on the PR?

19:01 kartik__ has joined #mlpack

19:01 < kartik__> yes <zoq> but i was still confused

19:01 < kartik__> oh ohkae.. so as answer is given by the FFN itself

19:01 < kartik__> modifying that will work as well

19:02 < kartik__> also im thinking to make two optimizers .. one as optimizer

19:02 < zoq> kartik__: Yes you can do both, since we call Optimize with function and function.Parameters

19:03 < zoq> what's the purpose of the other optimizer?

19:04 < kartik__> i was thinking if its possible to take the other optimizer give task and the input hidden and output layer size

19:04 kartik_ has quit [Ping timeout: 260 seconds]

19:04 < kartik__> then it can make its own FFN model converge and test fitness using the task and then give output

19:06 < zoq> But, in CNE we evaluate multiple 'networks' and merge the best ones, so we don't train a single network until it converges, maybe I missed something?

19:07 < kartik__> CNE doesnt make species

19:07 < kartik__> but neat does

19:09 < kartik__> so ill be having multiple network parameter matrix

19:09 < kartik__> and will test on the same model

19:10 < kartik__> by setting different parameters

19:11 < kartik__> <zoq> is it possible to just evaluate the FFN model on just one point without giving the train set and test set

19:12 < kartik__> but just the parameters and the input ?

19:12 < zoq> I was not talking about species I was talking about a population, as you pointed out you will have a bunch of network's/parameter; but you said you like to train a single network until it converges, which sounded strange since, if it converges we don't have to train another one.

19:13 < zoq> You can use the Predict function instead of Evaluate.

19:14 < kartik__> oh okhae.. thanks .. also <zoq> im confused about what to implement in the test case

19:14 < zoq> I think we are talking about the same thing, but I don't get the second optimizer part.

19:15 < zoq> We can just use the same test cases as you wrote for CMAES.

19:15 < kartik__> the next optimizer takes the task and the model size of layers and then makes the population and uses the predict function to find the best parameter using fitness supplied by task as a template

19:17 < kartik__> <zoq> is this correct

19:17 < kartik__> then we will just have to supply a class of task type ..

19:18 < zoq> If we use the current optimizer interface, the task is embedded inside the model itself, so Evaluate returns the Fitness of the model using the current parameter.

19:19 < zoq> Based on the Fitness returns by Evaluate we can compare the different models.

19:19 < kartik__> <zoq> sorry .. but i did not understand that

19:22 < kartik__> the meaning of the task is embedded inside the network itself ?

19:23 < kartik__> ohkae i got your point..

19:24 < zoq> I was saying that we don't have to provide a task, since the task is to optimize the model. For example training a simple network on a dataset, the optimizer dosn't need to know what the task is or how the dataset looks like, since a call of Evaluate returns the fitness.

19:24 < zoq> So, for me the code you provided looks just fine, what is left is to implement the reproduce function.

19:25 < kartik__> <zoq> but there it was calculation using the train and test data points given to the model in .train() and here the task should be called to get the fitness.. how will optimize differentiate this ..

19:27 < zoq> We don't have to differentiate between both, since fintess is nothing more as some kind of performance indicator we use to select the best model.

19:29 < zoq> Let's say the task is to get 100% accuracy on a dataset, how do we get the fitness of a single model, we would go through each sample and evaluate the model right?

19:31 < kartik__> right

19:32 < kartik__> <zoq> exactly this is done .. the train and test data that is given before by calling Evaluate can be found for each dataset and the fitness can be found

19:33 < zoq> So, I agree the number of tasks we can solve right now is limited to optimize a model to be good at classification, so we can't train it on some environment like Mario, but the step to get this done is minimal. I think we should concentrate on the classification task for now.

19:34 < kartik__> okae

19:36 < kartik__> <zoq> thanks a lot

19:37 < zoq> Here to help, let me know if you need any help with the reproduce function.

20:34 witness has quit [Quit: Connection closed for inactivity]

20:54 < zoq> kris1: Not sure you have seen my message.

20:54 < zoq> kris1: Sorry for the slow response; not sure I get your point, do you mean to call the Initalize method with a parameter matrix (not model.Parameters()) and afterwards we do model.Parameters() = parameters?

20:56 < kris1> I updated the PR. Have a look. Well that is what i am doing right now.

20:56 < zoq> kris1: Ah, okay, I'll take a look once I get a chance.