#mlpack on 2017-04-04 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:02 chenzhe has quit [Ping timeout: 246 seconds]

00:17 trapz has quit [Quit: trapz]

00:54 Alvis__ has joined #mlpack

00:56 Alvis_ has quit [Ping timeout: 246 seconds]

01:06 Alvis has joined #mlpack

01:07 aashay has quit [Quit: Connection closed for inactivity]

01:08 Alvis__ has quit [Ping timeout: 246 seconds]

01:16 Alvis has quit [Ping timeout: 246 seconds]

01:30 Alvis has joined #mlpack

01:30 < wiking> rcurtin, y0

01:49 trapz has joined #mlpack

01:56 trapz has quit [Quit: trapz]

02:08 < rcurtin> wiking: hello there

02:08 < wiking> hihi

02:08 < wiking> what's your count this year?

02:09 < rcurtin> of applications?

02:09 < rcurtin> 69

02:09 < wiking> niiice

02:09 < wiking> :)

02:09 < rcurtin> a lot to look through :(

02:09 < rcurtin> last year it was 119...

02:09 < rcurtin> how many did you get?

02:10 < wiking> 38

02:10 < rcurtin> a more manageable count :)

02:10 < wiking> i just went through of them

02:10 < wiking> 16 ignore

02:10 < wiking> :)

02:10 < rcurtin> I haven't had a chance yet to really look through ours

02:11 < rcurtin> there were a bunch of emails of the type 'please look through my proposal and give comments' but I thought the deadline was 12 hours after it actually was... so I did not get to those in time... :(

02:12 < wiking> :>

02:14 < wiking> yeah i mean i'm not so sure i understand the rational

02:14 < wiking> with these last minute application

02:14 < wiking> some students really took their time and effort to put together a draft like 1 month before the deadline

02:14 < rcurtin> I guess the stipend is sufficiently high that people think "why not give it a shot"

02:14 < wiking> that manifested in a good application actually

02:14 < wiking> ah but you know

02:15 < rcurtin> same here, a lot of people have been in touch for many weeks before the deadline and all of them are quite well prepared and have made nice patches, etc.

02:16 trapz has joined #mlpack

03:06 trapz has quit [Quit: trapz]

03:34 vinayakvivek has joined #mlpack

04:03 Alvis has quit [Ping timeout: 246 seconds]

04:17 Alvis has joined #mlpack

06:01 diehumblex has joined #mlpack

06:01 vss has joined #mlpack

06:02 vss has left #mlpack []

06:03 chenzhe has joined #mlpack

06:04 Alvis has quit [Ping timeout: 246 seconds]

06:50 yannis has joined #mlpack

06:50 yannis is now known as Guest95911

07:32 chenzhe has quit [Ping timeout: 260 seconds]

07:40 kdkw has quit [Ping timeout: 260 seconds]

07:45 vinayakvivek has quit [Quit: Connection closed for inactivity]

08:43 witness_ has joined #mlpack

08:59 govg has quit [Ping timeout: 240 seconds]

09:22 chenzhe has joined #mlpack

10:32 Guest95911 has quit [Quit: Guest95911]

10:36 yannis has joined #mlpack

10:36 yannis is now known as Guest95911

10:42 Guest95911 is now known as yannis

10:43 yannis is now known as Guest62259

10:43 chenzhe has quit [Ping timeout: 256 seconds]

10:45 vss has joined #mlpack

10:54 Guest62259 has left #mlpack []

10:56 < vss> rcurtin : Atleast its over for now :3

11:29 govg has joined #mlpack

12:34 govg has quit [Ping timeout: 260 seconds]

12:35 witness_ has quit [Quit: Connection closed for inactivity]

12:45 vivekp has quit [Ping timeout: 268 seconds]

12:46 vivekp has joined #mlpack

12:56 Trion has joined #mlpack

14:38 vss has quit [Ping timeout: 260 seconds]

15:00 govg has joined #mlpack

15:04 govg has quit [Ping timeout: 246 seconds]

15:24 Alvis has joined #mlpack

15:37 aashay has joined #mlpack

15:44 < Trion> when model has been training for hours and error happens http://gph.is/1hFGH5a

15:46 vinayakvivek has joined #mlpack

16:26 shikhar has joined #mlpack

16:34 Alvis_ has joined #mlpack

16:35 Trion has quit [Quit: Have to go, see ya!]

16:37 Alvis has quit [Ping timeout: 246 seconds]

17:40 witness_ has joined #mlpack

17:52 shikhar has quit [Ping timeout: 260 seconds]

17:52 sagarbhathwar has joined #mlpack

17:53 < sagarbhathwar> Check this out - https://github.com/junyanz/CycleGAN

17:53 sagarbhathwar has quit [Client Quit]

17:57 aashay has quit [Quit: Connection closed for inactivity]

18:02 Alvis__ has joined #mlpack

18:05 Alvis_ has quit [Ping timeout: 246 seconds]

18:07 Alvis__ has quit [Ping timeout: 246 seconds]

18:11 trapz has joined #mlpack

18:12 Alvis has joined #mlpack

18:38 Alvis has quit [Ping timeout: 246 seconds]

18:41 govg has joined #mlpack

19:12 trapz has quit [Quit: trapz]

19:44 trapz has joined #mlpack

19:45 < rcurtin> zoq: any more comments about SMORMS3 (#899) or should I go ahead and merge it? I think it looks fine now

19:53 < cult-> what methods are available to measure how big is the Hoeffding tree? i want to somehow verify its robustness before i apply Classify().

19:53 < cult-> i can see the methods, like NumChildren() MinSamples(), MaxSamples() but i don't know which one to use

19:54 < cult-> what is MajorityClass()?

19:54 < cult-> or how should I approach this problem? can I measure on how many observations the tree has been already trained? or how many times?

19:54 < cult-> s/measure/request

19:57 < rcurtin> so you could use NumChildren() recursively but that might be a bit irritating

19:57 < rcurtin> here is a simple code snippet, I think it will work:

19:57 < rcurtin> std::stack<HoeffdingTree<>*> stack;

19:57 < rcurtin> stack.push(&tree); // 'tree' is the root of the tree

19:57 < rcurtin> size_t nodes = 0;

19:57 < rcurtin> while (!stack.empty())

19:57 < rcurtin> {

19:57 < rcurtin> HoeffdingTree<>* node = stack.top();

19:57 < rcurtin> stack.pop();

19:57 < rcurtin> nodes += node.NumChildren();

19:57 < rcurtin> for (size_t i = 0; i < node.NumChildren(); ++i)

19:58 < rcurtin> stack.push(&node.Child(i));

19:58 < rcurtin> }

19:58 < rcurtin> MajorityClass() is the class most often seen by that particular node during training

19:59 < rcurtin> so if you asked for the predicted class for some sample, the tree would percolate that sample to the appropriate leaf node and then the predicted class would be MajorityClass() in that leaf node

19:59 < rcurtin> I should add a NumDescendants() function to HoeffdingTree<>, let me open an issue for that since I am doing other things right now

19:59 < cult-> yeah, that would be nice

20:00 < cult-> but the NumChildren if for the top nodes for the tree? or all the children across the tree?

20:00 < cult-> s/if/is

20:00 < rcurtin> the NumChildren() gives the count of the direct children of a node in the tree, not a count of all of the nodes in the tree

20:01 < rcurtin> hence the need to recurse through the tree like the stack solution I gave above

20:01 < rcurtin> I opened https://github.com/mlpack/mlpack/issues/977 , I hope to have a chance to solve it soon-ish

20:01 < cult-> thanks

20:06 < zoq> rcurtin: I think SMORMS3, RMSprop and Adam/AdaMax is ready.

20:07 < cult-> MajorityPRobability() is something like 0.5 or 50.0 ?

20:08 < cult-> but its only for one leaf node

20:09 < cult-> ?

20:11 chenzhe has joined #mlpack

20:19 < rcurtin> zoq: ok, I'll merge it

20:19 < rcurtin> cult-: it's in [0, 1] not [0, 100]

20:19 < rcurtin> and it is the probability of the majority class, but only for that node

20:19 < rcurtin> note that every node in a Hoeffding tree is of type HoeffdingTree<>

20:20 < rcurtin> so every node, even the root and high-level nodes, implements the same interface

20:20 < rcurtin> so if you call MajorityClass() on the root of the tree, what it will give you is the majority class of the entire dataset, and MajorityProbability() will give you the percentage of points (in [0, 1]) belonging to the majority class

20:22 < cult-> ok now its clear, thanks!

20:23 < cult-> until the feature request is done i should do something like if (MajorityProbability() > 0.2) {}

20:23 < cult-> question is what would be considered an appropriate threshold value

20:24 < cult-> but thats very data specific i guess

20:25 < cult-> hm, rather root.MajorityProbability() < 0.9

20:28 Alvis has joined #mlpack

20:33 < rcurtin> cult-: I don't understand, what do you mean to do with that code?

20:47 sumedhghaisas has joined #mlpack

20:57 < cult-> rcurtin: to make sure that the training set is separable and not dominated by one class only

21:00 trapz has quit [Quit: trapz]

21:01 travis-ci has joined #mlpack

21:01 < travis-ci> mlpack/mlpack#2248 (master - 0242c23 : Ryan Curtin): The build is still failing.

21:01 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/fcf165b77d2f...0242c23b2c6f

21:01 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/218616448

21:01 travis-ci has left #mlpack []

21:02 < sumedhghaisas> zoq: Hey Marcus, have you already thought about the regularization framework?

21:09 < zoq> sumedhghais: What you described in section 2 "Batch Normalization"?

21:11 < sumedhghaisas> zoq: ohh umm... no I mean like if I want to add l2 regularization on weights of certain layer...

21:12 < zoq> ah, okay ... no I haven't thought about that ...

21:13 < sumedhghaisas> I was thinking maybe ... somehow with iterators we can create functions to return the weights.

21:14 < sumedhghaisas> and add L2 regularization layer

21:14 < sumedhghaisas> but the more I think about it... the more complicated it gets

21:14 < sumedhghaisas> somehow the gradient function of the layer has to be changed

21:14 < rcurtin> sumedhghaisas: I hate it when that happens, it seems like that is every day for me :)

21:16 < zoq> hm, I have to think about it ... right now I can't think of an easy solution

21:18 < zoq> if you can think of a proof of concept, eager to take a look

21:19 < sumedhghaisas> rcurtin: haha... I do have a solution but I really hope that there is a better one... or else I will lose faith in life :P

21:19 < sumedhghaisas> zoq: proof of concept? sorry I didnt you

21:23 < zoq> Maybe you can think of something that could work, but it's not "perfect" as it should be?

21:25 < sumedhghaisas> zoq: ahh okay... I do have that. but try not to judge :P

21:25 < sumedhghaisas> okay so first I though that 'model.Add<Linear<> >(trainData.n_rows, hiddenLayerSize)' should return the parameters of that layer...

21:26 < sumedhghaisas> So that the user has access to add regularization over it

21:26 < sumedhghaisas> first problem solved...

21:27 < sumedhghaisas> now the parameter... rather than being a matrix

21:28 < sumedhghaisas> be an object ... which stores list of objects each associated with the extra term of that weight in the error function

21:28 < sumedhghaisas> So for example...

21:28 < sumedhghaisas> weight matrix will be a object which stores a matrix and an empty list

21:28 < sumedhghaisas> now when that weight is returned... I apply ApplyL2Regularization(weights)

21:29 < sumedhghaisas> it adds an object to that weight object associated with the error gradient of that weight in the loss function

21:30 < sumedhghaisas> now in the gradient function ... all we have to do ... is call all the gradient functions in the list that parameter holds

21:30 < sumedhghaisas> But indeed the better implementation is the generalization of this

21:31 < sumedhghaisas> so all the parameters of the model should be separate than the layers ... they should be stored in a master object

21:32 < sumedhghaisas> now when every layer is instantiated... the layer will add the parameter objects in the master object and add its gradient wrt itself...

21:33 < zoq> I mean the parameters are basically separated from each layer, and there is this master parameters matrix.

21:34 < sumedhghaisas> yes... but accessing an individual parameter and applying regularization over it will be difficult from it...

21:35 < zoq> I agree

21:35 < sumedhghaisas> what I propose is... create a master storage of all weights. Each layer will create parameter there and store

21:35 < sumedhghaisas> its reference

21:35 < sumedhghaisas> it will also return the reference of those weights to the user...

21:36 < sumedhghaisas> layer will simply add the gradient of that parameter with respect to itself...

21:36 < sumedhghaisas> if each layer does this... its a most generic architecture

21:37 < sumedhghaisas> what do you think?

21:38 < zoq> hm, model.Parameters() is the weight storage, and you could access the parameters also via model.Model()[0].Parameters() which is a reference to the weight storage for a specific layer.

21:38 < zoq> same for the gradients

21:40 < sumedhghaisas> I agree... I also thought of that. But the problem remains is that how to add extra gradient terms while computing the gradient with respect to any parameter?

21:40 < sumedhghaisas> this is due to the fact that the gradient is handled solely by the layer that creates it

21:40 < sumedhghaisas> but in a general structure of any neural network its not the case

21:40 < sumedhghaisas> the gradient is overall with respect to the loss function ... and can be due to multiple layers...

21:42 < sumedhghaisas> This also solves the problem of sharing weights ... and any other model that may share parameters between them

21:44 < zoq> There is a problem with sharing weights?

21:44 < sumedhghaisas> ahh wait.. I see what you mean now

21:45 < zoq> So, you propose something like an execution graph, right?

21:45 < sumedhghaisas> yes... thats what I thought. But I realized thats Tensorflow :P

21:46 < zoq> :)

21:46 < sumedhghaisas> But I still do think that execution graph will be much better

21:47 < zoq> This is definitely an interesting problem, and I have to think about it.

21:47 < sumedhghaisas> but if we keep returning the references and adding them to different layers

21:48 < sumedhghaisas> it is an execution graph... just not explicit

21:55 vinayakvivek has quit [Quit: Connection closed for inactivity]

22:01 < sumedhghaisas> also Marcus... you are also interested in Reinforcement Learning right? If you have some time, I am building an actor only learning algorithm using MCTS sampling. Would love to discuss it with you

22:04 < zoq> Sounds interesting, I have to finish some other things first, so if you like we can talk about the idea in the next days, tomorrow?

22:06 < sumedhghaisas> Sure. Tomorrow sounds perfect. Are you usually online around this time?

22:07 < zoq> yes, we are almost in the same timezone right? +-1h

22:12 < sumedhghaisas> ahh yes... Berlin right? Sure.

22:20 delfo_ has joined #mlpack

22:21 delfo_ has quit [Client Quit]

23:19 chenzhe has quit [Ping timeout: 246 seconds]

23:47 govg has quit [Ping timeout: 260 seconds]

23:59 chenzhe has joined #mlpack