#mlpack on 2019-08-06 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

00:35 abernauer has joined #mlpack

00:59 < abernauer> rcurtin: Yeah I will take your advice and go back to the hand written C approach. Rcpp's attribute feature and the compiler were having issues converting a reference to an Armadillo Matrix to type SEXP(pointer) to a SEXPREC C struct, or binary tree which makes sense.

01:09 < abernauer> Dealing with memory and garbage collection might come up again, but I will worry about that later.

01:25 xiaohong has joined #mlpack

01:31 KimSangYeon-DGU has quit [Remote host closed the connection]

01:53 < rcurtin> abernauer: sounds good

02:04 < abernauer> rcurtin: I' am interested in working on that issue with converting methods to call by value, but first going to see how this week goes before committing to that. Decent opportunity to improve my C++11 knowledge though.

02:41 < rcurtin> sounds good, happy to review a PR when it's ready

03:44 xiaohong has quit [Remote host closed the connection]

03:50 xiaohong has joined #mlpack

04:25 abernauer has quit [Remote host closed the connection]

04:48 xiaohong has quit [Read error: Connection timed out]

04:50 xiaohong has joined #mlpack

05:29 xiaohong has quit [Remote host closed the connection]

05:29 xiaohong has joined #mlpack

05:51 xiaohong has quit [Remote host closed the connection]

05:52 xiaohong has joined #mlpack

06:14 xiaohong has quit [Remote host closed the connection]

06:14 xiaohong has joined #mlpack

06:58 Neo22 has joined #mlpack

06:58 Neo22 has left #mlpack []

07:02 jeffin143 has quit [Read error: Connection reset by peer]

07:41 < jenkins-mlpack2> Project docker mlpack nightly build build #409: STILL UNSTABLE in 3 hr 27 min: http://ci.mlpack.org/job/docker%20mlpack%20nightly%20build/409/

09:12 xiaohong has quit [Remote host closed the connection]

09:15 xiaohong has joined #mlpack

09:16 xiaohong has quit [Remote host closed the connection]

09:17 xiaohong has joined #mlpack

09:34 xiaohong has quit [Remote host closed the connection]

09:37 xiaohong has joined #mlpack

09:41 vivekp has joined #mlpack

09:51 xiaohong has quit [Remote host closed the connection]

09:53 xiaohong has joined #mlpack

09:55 < lozhnikov> jeffin143: Could you add the 10th blog post?

10:40 xiaohong has quit [Remote host closed the connection]

10:40 xiaohong has joined #mlpack

10:57 xiaohong has quit [Remote host closed the connection]

10:59 xiaohong has joined #mlpack

11:01 xiaohong has quit [Remote host closed the connection]

11:01 xiaohong has joined #mlpack

11:47 xiaohong has quit [Remote host closed the connection]

11:48 xiaohong has joined #mlpack

12:30 sumedhghaisas has joined #mlpack

12:32 xiaohong has quit [Ping timeout: 246 seconds]

12:53 KimSangYeon-DGU has joined #mlpack

13:01 < KimSangYeon-DGU> sumedhghaisas: Hi Ghaisas, I sent a message about the research documents and videos on hangouts, please can you check it?

13:03 vivekp has quit [Ping timeout: 245 seconds]

13:09 < sumedhghaisas> KimSangYeon-DGU: Hey Kim

13:09 < KimSangYeon-DGU> Hey!

13:10 < sumedhghaisas> I am just going through the results of changing the distance

13:10 < sumedhghaisas> So changing the distance does affect the training right?

13:10 < KimSangYeon-DGU> Right

13:10 < sumedhghaisas> even if the Phi is 180?

13:10 < KimSangYeon-DGU> Yeah

13:10 < sumedhghaisas> so in Test case 2 ... the final phi after training is around 90?

13:11 < KimSangYeon-DGU> However when phi 180, the initial positions should be close to the each center of the observations

13:11 < KimSangYeon-DGU> Wait a moment.

13:13 < KimSangYeon-DGU> When phi is 0, 0, the final phi is 0, when phi is 45, -45, the final phi is 30.31, -30.31, and when phi is 90, -90, the final phi is 97, -97

13:13 < KimSangYeon-DGU> There are three initial values of phi per test case

13:13 < KimSangYeon-DGU> I wrote the values in Appendix C -3 phi.

13:14 < sumedhghaisas> wait I am confused why are there 2 values in 2 clusters? I thought we modeled them as a single value

13:14 < KimSangYeon-DGU> Ahh, in the equation (8) in the original paper, it's represented as a subtraction.

13:15 < KimSangYeon-DGU> So, I write the code like that...

13:15 < sumedhghaisas> I see...

13:15 < KimSangYeon-DGU> Ahh, in the previous discussion, you mentioned about the subtraction.

13:15 < KimSangYeon-DGU> So, I wrote the code like that.

13:16 < KimSangYeon-DGU> Hmm.. am I wrong??...

13:17 < sumedhghaisas> ahh no that is fine no problem

13:17 < sumedhghaisas> so I am looking at Appendix C 3 Phi

13:18 < KimSangYeon-DGU> Yeah.

13:18 < sumedhghaisas> Could you train case d more so that the Phi settles?

13:18 < sumedhghaisas> I think the phi is still moving in the graoh

13:18 < KimSangYeon-DGU> Yeah

13:18 < KimSangYeon-DGU> I'll do that

13:19 < sumedhghaisas> hmm... these are interesting results but hard to understand

13:19 < sumedhghaisas> so only when the distance was too big they settled

13:19 < KimSangYeon-DGU> Yeah, when initial phi is 180.

13:19 < sumedhghaisas> but then the means were too close to the clusters

13:20 < KimSangYeon-DGU> Yeah

13:20 < KimSangYeon-DGU> When phi is 0 or 90, it can find correctly.

13:20 < KimSangYeon-DGU> But...

13:20 < KimSangYeon-DGU> Phi 180 counld't

13:20 < KimSangYeon-DGU> When I observed, it tends to be close each other when phi is 180.

13:21 < KimSangYeon-DGU> I guess the phi would represent cohesion of the clusters

13:24 < sumedhghaisas> In the next research with the crazy dataset as w call it

13:24 < sumedhghaisas> the results are interesting

13:24 < sumedhghaisas> when the phi is 0 the cluster are further apart

13:24 < KimSangYeon-DGU> I'm really surprised at your idea.

13:24 < KimSangYeon-DGU> Wait a moment

13:25 < sumedhghaisas> although the objective function is going negative

13:25 < sumedhghaisas> we need to fix that

13:26 < sumedhghaisas> can you analyze what value is making it negative

13:26 < sumedhghaisas> the constraint seems to positive

13:26 < sumedhghaisas> so its the log likelihood that is negative

13:26 < sumedhghaisas> that is strange

13:26 < KimSangYeon-DGU> Ahh, yes

13:27 < KimSangYeon-DGU> I suspect the constraint

13:27 < KimSangYeon-DGU> because it is so jagged

13:28 < KimSangYeon-DGU> In Appendix A-1

13:28 < KimSangYeon-DGU> I'll look into it

13:28 < KimSangYeon-DGU> Ahh, I know it is because of the unconstrained optimization

13:29 < KimSangYeon-DGU> * I see

13:29 < KimSangYeon-DGU> In appendix C, I increased the lambda higher, then the NLL isn't negative.

13:29 < KimSangYeon-DGU> The only difference between A and C is a lambda

13:30 < sumedhghaisas> I see we didn't constraint the probabilities themselves

13:30 < sumedhghaisas> if they are above 1 the objective function will be negative

13:31 < KimSangYeon-DGU> I agree

13:32 < sumedhghaisas> can you try clipping the gradients so that the values of probabilities do not go above 1?

13:33 < sumedhghaisas> basically check after each update if the probabilities are more than 1

13:33 < sumedhghaisas> if they are clip them back to 1

13:33 < sumedhghaisas> its little harder to do

13:33 < sumedhghaisas> but its worth a try

13:34 < KimSangYeon-DGU> Yeah, I'll try

13:34 < sumedhghaisas> this is 1 way

13:34 < KimSangYeon-DGU> Yeah

13:34 < sumedhghaisas> another is to add more constraint

13:34 < sumedhghaisas> basically add each probability as a lagrangian

13:34 < KimSangYeon-DGU> Ahh, right

13:35 < sumedhghaisas> but that won't make a difference as each of them is already in the lagrangian

13:35 < sumedhghaisas> hmmm

13:35 < sumedhghaisas> okay try clipping first lets try to figure out more on this

13:36 < KimSangYeon-DGU> Yes

13:37 < KimSangYeon-DGU> Ahh, Ghaisas, actually, I tried to restrict our probability to 1, however I read this link https://www.researchgate.net/post/Probabilities_values_in_a_Gaussian_Mixture_Model_are_very_very_big_Is_it_possible_Or_they_should_be_in_0_1

13:38 < KimSangYeon-DGU> Is it applied to our problem?

13:38 < sumedhghaisas> yes that right

13:38 < KimSangYeon-DGU> They said the probability can't be 1.

13:38 < sumedhghaisas> technically the probability can be bigger than 1

13:39 < sumedhghaisas> Although the log is creating the problem

13:39 < KimSangYeon-DGU> Ahh...

13:40 < sumedhghaisas> Could you actually investigate and see what exactly is generating the negative value?

13:40 < KimSangYeon-DGU> Yeah

13:40 < KimSangYeon-DGU> I'll try

13:40 < sumedhghaisas> rather than guessing lets actually see what is it

13:40 < sumedhghaisas> thanks :)

13:41 < KimSangYeon-DGU> Oh, thanks

13:41 < sumedhghaisas> I need to go to another meeting right now? But I will go over the documents again and see if I spot more.

13:41 < KimSangYeon-DGU> I'll write the findings about the probability

13:41 < KimSangYeon-DGU> Ahh yes

13:51 jeffin143 has joined #mlpack

13:51 < jeffin143> lozhnikov : https://github.com/mlpack/mlpack/pull/1969#discussion_r311072796

13:52 jeffin143 has left #mlpack []

14:03 favre49 has joined #mlpack

14:04 sumedhghaisas has quit [Quit: Ping timeout (120 seconds)]

14:04 < favre49> zoq KimSangYeon-DGU I'm still having difficulty comprehending the code and translating it to c++, it makes no sense to me. Can any of you help me?

14:05 < KimSangYeon-DGU> favre49: Surely

14:06 < KimSangYeon-DGU> Actually, I'm familiar with the algorithm, but I can help you as much as I can

14:06 < KimSangYeon-DGU> Oops

14:06 < KimSangYeon-DGU> *I'm not familiar

14:08 < KimSangYeon-DGU> favre49: Is there any pull request about it?

14:08 favre4954 has joined #mlpack

14:09 < favre4954> KimSangYeon-DGU No issues, it's just the code snippet I sent you earlier

14:09 < KimSangYeon-DGU> favre4954: Yeah, is there any pull request about it?

14:09 < KimSangYeon-DGU> for mlpack?

14:09 < KimSangYeon-DGU> *in mlpack

14:10 < favre4954> The paper doesn't explain how it finds the extreme points, so this is the only resource I have

14:10 < favre4954> No, I haven't updated the current state of the code on that PR in a while

14:10 < KimSangYeon-DGU> Ahh... Okay, I'll go through it again

14:10 < KimSangYeon-DGU> https://github.com/msu-coinlab/pymoo/blob/master/pymoo/algorithms/nsga3.py#L166

14:10 favre49 has quit [Ping timeout: 260 seconds]

14:11 < KimSangYeon-DGU> F is a 3D matrix, right?

14:12 < favre4954> I think so

14:12 < favre4954> https://pastebin.com/QPWeJuZj is how far I've gotten

14:13 < favre4954> In my case I have made F a matrix, where each column is the objective vector of a population member

14:13 < KimSangYeon-DGU> Ahh

14:14 < KimSangYeon-DGU> Can you let met know the type of extreme points?

14:15 < KimSangYeon-DGU> and `ideal_point`

14:16 < KimSangYeon-DGU> I guess it is 3D matrix

14:17 < favre4954> I'll try to find out and get back to you

14:17 < KimSangYeon-DGU> Yeah

14:18 < KimSangYeon-DGU> I'll look into it as well

14:30 < KimSangYeon-DGU> favre4954: If you leave a message, I'll read it after I'm back to my home.

14:31 < KimSangYeon-DGU> I also read some references about NSGA-3

14:31 KimSangYeon-DGU has quit [Remote host closed the connection]

14:33 < favre4954> KimSangYeon-DGU I'll try to look into it more tonight too. This is what's stalling my progress, for the most part

14:33 favre4954 has quit [Remote host closed the connection]

14:40 ImQ009 has joined #mlpack

15:14 xiaohong has joined #mlpack

15:25 KimSangYeon-DGU has joined #mlpack

15:27 xiaohong has quit [Remote host closed the connection]

15:28 xiaohong has joined #mlpack

15:33 xiaohong has quit [Ping timeout: 264 seconds]

15:50 jeffin143 has joined #mlpack

15:55 vivekp has joined #mlpack

16:39 jeffin143 has quit [Remote host closed the connection]

18:08 vivekp has quit [Ping timeout: 268 seconds]

18:46 < zoq> favre4954: Sorry for the slow response, which part, would it be helpful if I implement the asf function?

19:10 KimSangYeon-DGU has quit [Ping timeout: 260 seconds]

19:24 favre49 has joined #mlpack

19:26 < favre49> zoq: I don't think that will be necessary, hopefully

19:26 < favre49> So F is a 2d matrix, which is row major instead of column major

19:27 < favre49> I'm still stuck on the same line I was as before though - how does multiplication of a 2d matrix by a 3d matrix work in this case?

19:27 < favre49> And I still don't understand what the axis argument is doing?

19:36 < zoq> favre49: Let me open the code.

19:39 favre49 has quit [Remote host closed the connection]

19:40 < zoq> favre49: asf isn't a 3d matrix, it's a 2d.

19:42 < zoq> favre49: https://repl.it/repls/RuddyFrequentChords

19:44 favre49 has joined #mlpack

19:44 < zoq> favre49: Or do you mean: asf[:, None, :]

19:45 < favre49> Yes that's what I meant

19:45 < favre49> Isn't that reshaping it into a cube? Or have i misunderstood

19:47 < zoq> favre49: Right, so it looks like the multiplication is elementwise

19:48 < favre49> As in? I don't get it, I'm sorry

19:49 < favre49> so in the case A * B[:, None, :] , it multiplies A(1, 1) with B(1,1), A(1,2) with B(1,2) and so on?

19:49 < zoq> favre49: I think so, let me put together a simple example

19:50 < favre49> Alright, thanks!

19:52 ImQ009 has quit [Quit: Leaving]

19:54 < zoq> favre49: https://repl.it/repls/RuddyFrequentChords

19:54 < zoq> favre49: So it's A * B.row()

19:56 < zoq> favre49: I think that is a strange way to write that operation.

19:56 < favre49> Yes, it's not very legible. I'm not too sure why it's done like this

19:58 < favre49> Thanks for the help, I think I can do it now. I'll try implementing it in a couple hours, I need a nap

19:58 < zoq> Sure, have a good sleep.

19:58 < favre49> Thanks :)

19:58 favre49 has quit [Remote host closed the connection]

19:58 < zoq> As I said happy to implement the method.

19:59 < zoq> In case you run into any issues.

20:22 jeffin143 has joined #mlpack

20:23 < jeffin143> lozhnikov : Probably we should use a template parameter instead since the comparison is performed inside the loop.

20:23 < jeffin143> I didn't understand , where should I put the template parameter ?

20:25 < lozhnikov> jeffin143: Never mind. I am not quite sure. Probably the compiler is able to optimize the comparison out.

20:26 < jeffin143> Also about enum asz

20:26 < jeffin143> Class ..?? Should I go with enum class or bool will work.?

20:26 < lozhnikov> jeffin143: Yes, enum class is definitely better than a number of bools.

20:27 < jeffin143> Ok , then I will try to rewrite it*

20:28 < lozhnikov> Since smooth_idf doesn't depend on other variables it can be bool.

20:28 jeffin143 has quit [Read error: Connection reset by peer]

20:29 jeffin143 has joined #mlpack

20:29 < lozhnikov> But sublinear_tf, term_frequency and binary should be of type enum class.

20:30 < lozhnikov> Besides, we use camel case for all variable names.

20:33 < jeffin143> Ok

20:33 < jeffin143> I will make those changes

20:34 jeffin has joined #mlpack

20:38 jeffin143 has quit [Ping timeout: 245 seconds]

21:00 < rcurtin> hey everyone, I think that it's time to make another release

21:00 < rcurtin> (actually that time was probably a while ago but it didn't help that I was traveling over the summer)

21:01 < rcurtin> so we will have to figure out which PRs are "almost ready" and we should wait for, and which we can incorporate in a future release

21:01 < rcurtin> if there is interest we can have another mlpack video meeting about it; I'll send an email to the list in the upcoming days

21:01 < rcurtin> :)

22:11 < zoq> Count me in for the video meeting.

23:55 favre49 has joined #mlpack

23:55 favre49 has quit [Remote host closed the connection]