#mlpack on 2015-01-25 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:26 jbc_ has quit [Quit: jbc_]

00:39 < naywhayare> stephentu: what's his lack of belief stem from? the lack of scalability?

00:40 < naywhayare> technically you could see choosing a kernel correctly as manifold learning, if it manages to map the non-linearly-separable points into a kind-of-linearly-separable set of points in the kernel space

00:43 < stephentu> naywhayare: i think he thinks that the manifold assumption is not correct--

00:44 < stephentu> but i sort of agree with your point of view

00:44 < stephentu> like if you were to learn a manifold

00:44 < stephentu> and then run a say gaussian kernel

00:44 < naywhayare> I wouldn't be surprised if the manifold assumption isn't always right

00:44 < stephentu> it seems quite redundant

00:44 < naywhayare> but it only needs to be approximately right :)

00:44 < stephentu> what i dont understand is

00:44 < stephentu> why did it die down

00:45 < stephentu> like does it not work well in practice

00:45 < stephentu> did neural nets take over everything

00:45 < naywhayare> manifold learning? it died because you can't apply it to more than like 10k points

00:45 < stephentu> ah

00:45 < stephentu> so scalability

00:45 < stephentu> kind of like sdps

00:45 < stephentu> haha

00:45 < stephentu> why do i always like things that dont scale

00:45 < naywhayare> well, I think there is a solution out there somewhere :)

00:45 < naywhayare> but I mean, scalability has (kind of) been achieved for kernel methods

00:46 < stephentu> how so?

00:46 < naywhayare> embeddings, for instance

00:46 < naywhayare> Nystroem approximations (sample your input points)

00:46 < stephentu> do these techniques not work for manifold learning

00:47 < naywhayare> sampling often doesn't work for manifold learning because of the lack of out-of-sample extensions

00:47 < naywhayare> so if I put in new points, it's sometimes difficult (depending on the method) to map it to the unfolded manifold or whatever

00:47 < naywhayare> there is a paper that suggests how this can be done, but I don't remember the exact details

00:47 < stephentu> i see

00:47 < stephentu> the fact that you arent really learning an f

00:48 < stephentu> but just mapping hte point sin 1:1

00:48 < stephentu> interesting

00:49 < stephentu> question: how were you going ot use MVU in yoru research

00:49 < naywhayare> at the time, I wasn't sure what I wanted my research to be

00:49 < stephentu> ah, so i'm like you 3 years ago now

00:49 < stephentu> lol

00:49 < stephentu> any advice

00:49 < naywhayare> 5 years :(

00:49 < naywhayare> :)

00:49 < naywhayare> my advice is just to find something that you enjoy

00:50 < naywhayare> performing a random walk through the field until you find something sufficiently interesting is a good way to do it, in my opinion

00:50 < naywhayare> you get good breadth like that

00:50 < stephentu> haha thats pretty much been what i'm doing

00:50 < naywhayare> anyway, one of my interests was to find out how to scale nonlinear dimensionality reduction techniques

00:50 < naywhayare> but after failing to get LRSDP+MVU working I kind of moved different ways, because I simultaneously noticed some improvements and nice abstractions for problems like nearest neighbor search

00:52 < naywhayare> it kind of snowballed from there, and then suddenly I had more ideas than time...

00:55 < stephentu> cool

00:55 < stephentu> in his papers I think he says he uses some variant of LRSDP

00:55 < stephentu> i guess he had some magic sauce?

00:57 curiousguy13 has joined #mlpack

01:05 curiousguy13 has quit [Ping timeout: 244 seconds]

01:10 < naywhayare> I have his old code

01:10 < naywhayare> it's incomprehensible

01:11 < naywhayare> it's actually in the git history if you dig far enough, under like u/nvasil/mvu/ or something like that

01:11 < naywhayare> anyway, I think that maybe he got it to converge for one particular set of parameters once

01:12 < naywhayare> I don't think the results are bullshit but I think, as you've noticed, that LRSDP is so finicky and not well understood that it's basically unreproducible without the magical set of parameters and penalty schedule

01:12 < naywhayare> I never got his original code to converge either...

01:15 < naywhayare> at the same time, it wasn't documented at all, so I may have been just using it wrong

01:46 curiousguy13 has joined #mlpack

01:51 curiousguy13 has quit [Ping timeout: 272 seconds]

02:08 naywhayare has quit [Ping timeout: 240 seconds]

02:09 naywhaya1e has joined #mlpack

02:22 stephent1 has joined #mlpack

02:35 stephentu has quit [Ping timeout: 244 seconds]

02:57 jbc_ has joined #mlpack

03:47 dhfromkorea has joined #mlpack

03:51 dhfromkorea has quit [Remote host closed the connection]

04:26 karthikabinav has joined #mlpack

04:45 jbc_ has quit [Quit: jbc_]

05:13 jbc_ has joined #mlpack

05:18 jbc_ has quit [Ping timeout: 265 seconds]

05:20 vedhu63w has joined #mlpack

05:30 karthikabinav has quit []

05:34 Kallor has quit [Remote host closed the connection]

05:56 < vedhu63w> Hi! Does mlpack contain test data on which i could probably check the results of the algorithms

06:10 < stephent1> vedhu63w: not like sklearn has

06:10 < stephent1> vedhu63w: i assume you're looking for something like MNIST

06:15 jbc_ has joined #mlpack

06:17 < vedhu63w> or maybe the one that developer uses for testing?

06:26 < stephent1> for testing, each module typically has a suite of test cases

06:26 < stephent1> and those typically have toy datasets

06:26 < stephent1> you can see in the tests folder if you are curious

06:27 < vedhu63w> ohh! thanks

06:48 jbc_ has quit [Ping timeout: 245 seconds]

06:51 vedhu63w has quit [Quit: Leaving]

07:08 vedhu63w has joined #mlpack

07:31 Kallor has joined #mlpack

07:47 jbc_ has joined #mlpack

07:54 stephent1 has quit [Ping timeout: 264 seconds]

08:17 jbc_ has quit [Ping timeout: 245 seconds]

08:54 babel42 has joined #mlpack

08:54 < babel42> hello

08:55 Kallor has quit [Remote host closed the connection]

09:16 jbc_ has joined #mlpack

09:47 jbc_ has quit [Ping timeout: 265 seconds]

10:46 jbc_ has joined #mlpack

11:04 < zoq> babel42: Hello!

11:18 jbc_ has quit [Ping timeout: 272 seconds]

12:17 jbc_ has joined #mlpack

12:21 < babel42> Hi zoq. I'm interested in participating in gSoC'15 can you help me get started?

12:49 jbc_ has quit [Ping timeout: 252 seconds]

12:51 < zoq> babel42: The best way to get started is to download mlpack and compile it from source, then use it for some simple machine learning tasks. The tutorials might prove helpful: http://www.mlpack.org/tutorial.html

12:51 < zoq> Once you've got a basic feel for mlpack programs and source, you can take a look at the list of open tickets you might find something interesting: https://github.com/mlpack/mlpack/issues.

12:51 < zoq> Most of them are marked with a difficulty, so that might help you figure out some issues that you can handle. We are always interested in new algorithms so if you interested in some special field I think we can figure something out.

12:52 < zoq> Also, be aware that Google hasn't selected orgs yet. We've participated in the past years, but this is no guarantee they'll select us again.

12:59 < babel42> I like ML and am doing a course o it this semester, so It thought this would be good practice

13:00 < zoq> Yeah, It's a great opportunity.

13:09 Kallor has joined #mlpack

13:43 babel42 has quit [Ping timeout: 246 seconds]

13:48 jbc_ has joined #mlpack

14:19 jbc_ has quit [Ping timeout: 256 seconds]

15:11 Kallor has quit [Remote host closed the connection]

15:18 jbc_ has joined #mlpack

15:49 jbc_ has quit [Ping timeout: 265 seconds]

15:52 hritikj has joined #mlpack

15:56 hritikj has quit [Ping timeout: 246 seconds]

16:47 jbc_ has joined #mlpack

17:20 jbc_ has quit [Ping timeout: 272 seconds]

18:25 jbc_ has joined #mlpack

18:44 stephentu has joined #mlpack

18:49 jbc_ has quit [Quit: jbc_]

18:52 < stephentu> naywhaya1e: ever thought of using travis CI for builds?

18:52 < stephentu> then we can hook up each PR

18:53 < stephentu> to see if the tests pass

19:11 < naywhaya1e> stephentu: thought of it, yes, had time to set it up, no :(

19:11 < naywhaya1e> definitely a good idea to test each PR

19:31 < stephentu> naywhayare: i have had some experience w/ travis CI

19:31 < stephentu> i can take a stab at it

19:47 < naywhayare> sure, feel free :)

19:47 < naywhayare> do you need any permissions or anything?

20:50 < stephentu> naywhayare: we'll see

20:50 < stephentu> i'll ping you

20:52 < stephentu> naywhayare: but alas, today i must do a pset

20:52 < stephentu> haha

20:52 < stephentu> time to do some maths

21:02 jbc_ has joined #mlpack

21:02 jbc_ has quit [Client Quit]

21:12 jbc_ has joined #mlpack

21:49 karthikabinav has joined #mlpack

22:24 jbc_ has quit [Quit: jbc_]