#mlpack on 2019-07-24 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

00:56 < rcurtin> gmanlan: ok, back now

00:56 < rcurtin> zoq: I found a video of the karting, but it's rather long; http://www.ratml.org/misc_img/kart_qualifying_and_start.mp4

00:57 < rcurtin> it includes me (and others) warming up karts, then qualifying, then finally starting the race (I am the second-from-last qualifier, then I ran a lap that put me in 3rd place for the start)

00:57 < rcurtin> the race actually starts at 8:15 or so, the rest might be pretty boring :)

01:23 xiaohong has joined #mlpack

01:42 < rcurtin> robertohueso: I read through the paper, and it seems to me like this is most easily thought of from the bottom up instead of the top down

01:43 < rcurtin> so, e.g., a KD-tree gets built; then, we look at the leaf nodes and run PCA to recover d eigenvectors, and those constitute our basis for that leaf node

01:43 < rcurtin> in the figure, you're right, d = 1, but I think it would be possible to choose a greater d

01:45 < rcurtin> then, we can look at the parent nodes of those leaf nodes and use the algorithm given in [8] to merge the two sets of eigenvectors

01:46 < rcurtin> and do this the rest of the way up the tree

01:46 < rcurtin> it seems to me like you would also need to hold the projections of the points (not the descendants, just the points I think) in each node's statistic

02:00 < robertohueso> That makes sense to me :) I was kinda confused by that, so yeah a node statistic should be enough. And yeah, I also think we need to hold the projections

02:01 < robertohueso> Thanks for the clarification!

02:06 < rcurtin> of course :)

02:34 < rcurtin> robertohueso: ran out of time tonight for the RectangleTree copy constructor... I can do it in the next days, so you can focus on the subspace tree :)

02:46 k3nz0_ has quit [Ping timeout: 245 seconds]

03:32 xiaohong has quit [Remote host closed the connection]

03:37 xiaohong has joined #mlpack

04:35 xiaohong has quit [Read error: Connection timed out]

04:39 xiaohong has joined #mlpack

05:12 xiaohong has quit [Remote host closed the connection]

05:12 xiaohong has joined #mlpack

06:04 xiaohong has quit [Remote host closed the connection]

06:09 xiaohong has joined #mlpack

06:38 xiaohong has quit [Remote host closed the connection]

06:38 xiaohong has joined #mlpack

07:29 xiaohong has quit [Remote host closed the connection]

07:36 xiaohong has joined #mlpack

07:51 xiaohong has quit [Remote host closed the connection]

07:53 xiaohong has joined #mlpack

08:04 < jenkins-mlpack2> Project docker mlpack nightly build build #396: STILL UNSTABLE in 3 hr 50 min: http://ci.mlpack.org/job/docker%20mlpack%20nightly%20build/396/

08:35 xiaohong has quit [Remote host closed the connection]

08:36 xiaohong has joined #mlpack

09:29 xiaohong has quit [Remote host closed the connection]

09:29 xiaohong has joined #mlpack

09:31 xiaohong has quit [Remote host closed the connection]

09:32 xiaohong has joined #mlpack

09:47 xiaohong has quit [Remote host closed the connection]

09:49 xiaohong has joined #mlpack

09:53 xiaohong has quit [Remote host closed the connection]

09:54 xiaohong has joined #mlpack

10:00 xiaohong has quit [Remote host closed the connection]

10:08 sreenik[m] has quit [Ping timeout: 250 seconds]

10:08 Sergobot has quit [Ping timeout: 265 seconds]

10:08 chandramouli_r has quit [Ping timeout: 265 seconds]

10:08 aleixrocks[m] has quit [Ping timeout: 265 seconds]

10:13 xiaohong has joined #mlpack

10:36 KimSangYeon-DGU has joined #mlpack

10:49 chandramouli_r has joined #mlpack

11:28 Sergobot has joined #mlpack

11:28 aleixrocks[m] has joined #mlpack

11:28 sreenik[m] has joined #mlpack

11:46 xiaohong has quit [Remote host closed the connection]

11:46 xiaohong has joined #mlpack

12:23 k3nz0_ has joined #mlpack

12:40 xiaohong has quit [Remote host closed the connection]

12:41 xiaohong has joined #mlpack

12:56 sumedhghaisas has joined #mlpack

13:00 < KimSangYeon-DGU> sumedhghaisas: Hi Sumedh, I'm ready and I'm sorry to hear that you are sick :(

13:01 < sumedhghaisas> KimSangYeon-DGU: Hey Kim.

13:01 < sumedhghaisas> feeling better already

13:01 < sumedhghaisas> how have you been?

13:01 < KimSangYeon-DGU> I set theta as a trainable variable

13:01 < KimSangYeon-DGU> tested it so far

13:02 < sumedhghaisas> ahh yes had couple of question about that as well.

13:02 < sumedhghaisas> hows it looking?

13:02 < KimSangYeon-DGU> Umm, I think it's sensitive to lambda

13:02 < KimSangYeon-DGU> I'll write a document for that

13:03 < KimSangYeon-DGU> I just set theta as a trainable scalar variable.

13:03 < sumedhghaisas> yes I was just going to say that

13:03 < KimSangYeon-DGU> Yeah

13:03 < sumedhghaisas> cause the equation restricting that depends on it

13:03 < sumedhghaisas> so just to clarify

13:04 < KimSangYeon-DGU> yeah

13:04 < sumedhghaisas> before you were using equation under Equation 13 to compute cosine correct?

13:04 < KimSangYeon-DGU> Yeah

13:05 < sumedhghaisas> cool. And you removed it and used theta as a traineable parameter.

13:05 < sumedhghaisas> But you are still using the 2 cluster case right?

13:06 < KimSangYeon-DGU> Yeah, I'm using 2 clusters

13:06 < KimSangYeon-DGU> Sumedh, I have a question

13:06 < sumedhghaisas> Sure go ahead

13:06 < KimSangYeon-DGU> I'm not sure I understand that "before you were using equation under Equation 13 to compute cosine correct?"

13:06 < sumedhghaisas> ahh I meant how were you computing cosine before?

13:06 robertoh1eso has joined #mlpack

13:07 < KimSangYeon-DGU> Ahh, right I used the equation in the constraint

13:07 < KimSangYeon-DGU> under (13)

13:07 < KimSangYeon-DGU> I understand :)

13:07 < sumedhghaisas> yeah thats what I meant no worries :)

13:07 < KimSangYeon-DGU> As you say, I removed it and change it to trainable variable

13:08 < sumedhghaisas> okay so lets state the status so we can get some idea how to move on. Stop me if you think I am saying anything wrong

13:08 < sumedhghaisas> We are currently doing experiments we 2 cluster case

13:08 < sumedhghaisas> pertaining to Equation 13 in the paper

13:08 < sumedhghaisas> we optimize NLL + lambda * constraint

13:09 < sumedhghaisas> we observed that constraint optimization is not amazing good so we need to tinker with lambda or shift to another optimization al together

13:09 < sumedhghaisas> tinkering lambda seems to work better

13:09 robertohueso has quit [Ping timeout: 246 seconds]

13:10 < sumedhghaisas> and now we changed theta as traineable parameter and you are going to document the results of that in a new document

13:10 < sumedhghaisas> so far so good?

13:10 < KimSangYeon-DGU> Exactly :)

13:11 < sumedhghaisas> great. Okay we also observed that alpha goes to zero for some cases but it seems to me from your lambda change that problem has been solved?

13:11 < KimSangYeon-DGU> Right

13:11 < sumedhghaisas> awesome

13:11 < sumedhghaisas> lets also put that in the document :)

13:11 < KimSangYeon-DGU> Got it

13:11 < sumedhghaisas> very important observation indeed

13:11 < KimSangYeon-DGU> I agree

13:12 < KimSangYeon-DGU> I'll write in details

13:12 < sumedhghaisas> so the root of the problem is unconstrained optimization it seems

13:12 < KimSangYeon-DGU> Yes

13:12 < sumedhghaisas> so that is the conclusion of the 2 directions we followed

13:12 < sumedhghaisas> okay last thing

13:13 < sumedhghaisas> so for some cases lambda = 1 actually does the correct thing right?

13:13 < KimSangYeon-DGU> Yeah,

13:13 < sumedhghaisas> Could you try those with lambda 53 and lambda 153 or something

13:13 < KimSangYeon-DGU> I just tested it using lambda = 1

13:13 < sumedhghaisas> I just want to make sure high value of lambda doesn't break the case

13:14 < KimSangYeon-DGU> Ahh, I'll test it

13:14 < sumedhghaisas> basically we tried the failed cases with lower and higher

13:14 < sumedhghaisas> we should also check if higher value of lambda doesn't break normal cases

13:14 < KimSangYeon-DGU> Yeah I tried but the result is not good than 53

13:15 < sumedhghaisas> sorry didn't get that. you mean lambda 53 breaks the normal case?

13:15 < KimSangYeon-DGU> I used lambda=53 in case lambda = 1 has bad result.

13:16 < KimSangYeon-DGU> Lambda = 53 has good result than lambda = 1 in some cases

13:18 < KimSangYeon-DGU> In research of 'Validity of the objective function', I found some cases result in bad results, so I tried to train by changing lambda. and it is the document 'Lambda impact'

13:20 < sumedhghaisas> correct

13:20 < sumedhghaisas> I was referring to the cases that lambda = 1 has good results

13:20 < KimSangYeon-DGU> By the time, I wrote the two documents with theta that paper presents, not trainable variable. Thus, I think the result would be different if I test it with trainable theta

13:20 < KimSangYeon-DGU> Yeah :)

13:20 < KimSangYeon-DGU> Right

13:21 < sumedhghaisas> Surely. We should create new documents for the traineable parameter changes. Take your time with the results. :)

13:21 < KimSangYeon-DGU> *By the way

13:21 < KimSangYeon-DGU> Yeah :)

13:22 < KimSangYeon-DGU> Hmm, about the lambda, how can I find the best value of it?

13:23 < sumedhghaisas> hmm yeah. always a problem with hyperparameters

13:23 < KimSangYeon-DGU> Right

13:23 < sumedhghaisas> there is no ideal way for it

13:23 < sumedhghaisas> we can argue for now that there exists a good lambda such that it can be achieved

13:23 < sumedhghaisas> we can impove it later

13:23 < KimSangYeon-DGU> Yeah

13:24 < sumedhghaisas> thats why I asked you to test the good cases with high lambda

13:24 < sumedhghaisas> tht way we can say that for now high lambda works the best

13:24 < KimSangYeon-DGU> That makes sense. I'll write documents about that.

13:25 < sumedhghaisas> nice. okay another thing. I think we are at the point that we can also look at multiple cluster case

13:25 < sumedhghaisas> I think we now have enough experience for it

13:25 < KimSangYeon-DGU> So far, I observed too high lambda interferes with the training process

13:25 < KimSangYeon-DGU> Yeah

13:26 < KimSangYeon-DGU> I'll write it with the exact graph and values

13:26 < KimSangYeon-DGU> for clarification

13:26 < KimSangYeon-DGU> I agree

13:26 < sumedhghaisas> Great. Looking forward for the results. :)

13:27 < KimSangYeon-DGU> Will test the multiple clusters as well

13:27 < sumedhghaisas> I am still looking into how can I make the constrained optimization better I have some ideas

13:27 < sumedhghaisas> but they might need some work still

13:27 < sumedhghaisas> we have lot of options

13:27 < sumedhghaisas> :)

13:27 < KimSangYeon-DGU> Wow :)

13:28 < KimSangYeon-DGU> I want to make more progress, so feel free to ask them :)

13:29 < sumedhghaisas> although now I think its better to improve the constrained optimization before we move to multiple clusters :)

13:29 < sumedhghaisas> hmmm

13:29 < sumedhghaisas> okay give a trail run for 3 or 4 clusters and see whats happeneing

13:29 < KimSangYeon-DGU> Ah, okay

13:30 < sumedhghaisas> do you have any question regarding multiple cluster scenario?

13:30 < KimSangYeon-DGU> About the data set,

13:30 < KimSangYeon-DGU> Would it be a good idea to generate using GMM?

13:30 < KimSangYeon-DGU> I mean classical GMM

13:31 < sumedhghaisas> ohh that too. we still haven't looked at if QGMM is more powerful that GMM

13:31 < sumedhghaisas> another research option :)

13:31 < KimSangYeon-DGU> Wow

13:31 < KimSangYeon-DGU> Good

13:31 < sumedhghaisas> yeah for no you can generate using GMM and train QGMM on it

13:32 < KimSangYeon-DGU> Yeah

13:32 < sumedhghaisas> or just create 5 gaussains and sample from it

13:32 < sumedhghaisas> thats much easier

13:32 < KimSangYeon-DGU> Okay :)

13:32 < sumedhghaisas> the point is to see how feasible is it

13:32 < sumedhghaisas> just a trail run

13:33 < KimSangYeon-DGU> Got it, I'll keep in mind

13:33 < sumedhghaisas> lets also think about how can we setup experiments to prove QGMM is better than GMM

13:33 < sumedhghaisas> maybe data some data in the middle of 2 clusters and see if the phase angle is changing accordingly

13:34 < sumedhghaisas> the phase angle should change with the amount of data between the 2 clusters

13:34 < KimSangYeon-DGU> Great, with 3D plot, we can see them more clearly

13:34 < sumedhghaisas> yup

13:35 < sumedhghaisas> lets think on this till we generate remaining results.

13:35 < sumedhghaisas> I am sure we can come up with a good plan for this research direction as well

13:35 < KimSangYeon-DGU> Yeah

13:37 < sumedhghaisas> Great. :) Do you have any more updates? For next meeting feel free to set it up anytime later this week.

13:37 < KimSangYeon-DGU> Okay :)

13:37 < KimSangYeon-DGU> When I'm done with the trainable theta and high lambda, I'll ping you

13:38 < sumedhghaisas> Coolio.

13:38 < sumedhghaisas> Best of luck for the traineable variables

13:38 < KimSangYeon-DGU> And then I'll work on the multiple clusters and theta angle researches.

13:38 < KimSangYeon-DGU> Thanks!

13:41 < KimSangYeon-DGU> Ah, firstly, I should check if the angle is correct when optimizing :)

14:13 k3nz0_ has quit [Quit: Leaving]

14:42 ImQ009 has joined #mlpack

15:01 KimSangYeon-DGU has quit [Remote host closed the connection]

15:06 KimSangYeon-DGU has joined #mlpack

15:10 vivekp has joined #mlpack

15:24 KimSangYeon-DGU has quit [Ping timeout: 260 seconds]

15:45 < lozhnikov> rcurtin, zoq: Hello, I added some research on different implementations of Dictionary Encoding.

15:45 < lozhnikov> https://github.com/mlpack/mlpack/pull/1814#issuecomment-514687037

15:47 < lozhnikov> Please glance over it if you have free time, I've got some questions in the "Conclusion" section.

16:12 sumedhghaisas has quit [Ping timeout: 260 seconds]

17:57 < rcurtin> lozhnikov: awesome! I'll take a look tonight :)

20:06 ImQ009 has quit [Quit: Leaving]

21:05 < zoq> rcurtin: Looked pretty intense, also what is the average age, the last contendant looked pretty young.

21:06 < zoq> lozhnikov: Will take a look as well, looks like a lot of work to me.

21:22 < rcurtin> zoq: average age is probably 28-30; there are a few who are 17 and a few who are 13-15, also a few in their 40s and 50s

21:25 < zoq> rcurtin: I see, I think they all use weights to get everyone on the same level?

21:31 < rcurtin> yeah, 90kg/198lbs