#mlpack on 2015-05-31 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

04:30 stephentu has joined #mlpack

04:30 < stephentu> naywhayare: there are two kinds of ML algorithms

04:30 < stephentu> ones taht you can prove shit about

04:31 < stephentu> and ones that work in practice

04:31 < stephentu> if you try to do both your life really sucks

04:31 < stephentu> haha

04:41 < naywhayare> stephentu: truth! :)

04:42 < naywhayare> one of the things that has been really interesting me lately has been deriving runtime bounds that depend on easily calculable measures of the dataset

04:42 < naywhayare> worst-case analysis is somewhat pointless: you never have a worst-case dataset in practice (well... depending, but generally, yeah, datasets are well-behaved to some extent)

04:42 < naywhayare> so if I can come up with a measure of "goodness" of a dataset for a particular task, and I know how algorithms fare with respect to that measure, I can make an informed decision about which algorithm to use

04:43 < naywhayare> in the nearest neighbor search literature, at least, there is not very much in this vein right now

04:43 < naywhayare> I'm hopeful that I can find some time to think about this type of stuff in the future :)

04:43 < naywhayare> what is the algorithm that you are working on? a matrix completion algorithm? :)

04:55 < stephentu> naywhayare: that'd be an awesome thing to have, seems quite difficult for optimization algorithms

04:55 < stephentu> naywhayare: this stupid idea to try to make stochastic algorithms parallelizable and still equivalent to some serial order

04:57 < stephentu> naywhayare: there are enoughn matrix completion algorithms

04:57 < naywhayare> ah, okay

04:58 < stephentu> the craziest one i've seen being

04:58 < naywhayare> well hopefully the submission goes well :)

04:58 < stephentu> gradient descent on an orthogonal group manifold

04:58 < stephentu> as if anybody is actually going to implement that

04:58 < naywhayare> heh

04:59 < stephentu> the idea here is

04:59 < stephentu> what if we didnt have to write a new NIPS/ICML paper

04:59 < stephentu> every time we wanted to parallelize some ML algorithm

05:00 < stephentu> although its unclear ot me if thats a desirable goal

05:00 < stephentu> since we'll have less things to work on

05:00 < naywhayare> maybe, but it's worth noting that most abstractions of that type ("free parallelization!") come at some cost, and someone could come in and do a better job by hand

05:01 < naywhayare> still, I think it would be a great thing to have, if it works

05:02 < stephentu> oh ya

05:02 < stephentu> it has

05:02 < stephentu> not just some cost

05:02 < stephentu> but

05:02 < stephentu> A LOT of cost

05:02 < stephentu> thats why i've been raging

05:02 < stephentu> trying to figure out clever ways to cut teh costs

05:03 < stephentu> there really is no free lunch :(

05:03 < naywhayare> yeah; at the very least, if you can say "the cost is overridden if you have enough processors", then that's at least good enough

05:03 < stephentu> well in our case we'r ehoping its the cost is overriden if your gradients are super expensive

05:03 < stephentu> like if the gradient is ComputeDigitsOfPi(10000)

05:03 < naywhayare> ah, okay, I see

05:03 < stephentu> then i think we're good

05:04 < stephentu> or like mine bitcoins

05:04 < naywhayare> I've had to spend a lot of space in the NIPS paper I'm working on outlining the use case

05:04 < naywhayare> it's a k-means algorithm, and it's fast only when k is large and also the number of points is large

05:05 < naywhayare> but justifying that k is actually large in practice... that took some digging to find out that people were actually doing that

05:05 < stephentu> interesting

05:05 < naywhayare> I feel like I have to devote an entire page of the submission to justifying that though, otherwise a reviewer goes, without searching, "yeah but nobody is actually doing that"

05:05 < stephentu> i applaud you for still innovating on k-means

05:05 < stephentu> i woudl have thought everything was done

05:06 < naywhayare> that's what I thought too; personally, I don't think there's any huge innovation

05:06 < naywhayare> I've taken off-the-shelf dual-tree nearest neighbor search techniques, observed that k-means is nearest neighbor search, and put the two together

05:06 < naywhayare> plus some other exploitation of the problem to get some additional speedup, like, sometimes the nearest centroid of a point can't change between iterations

05:07 < naywhayare> you need low-dimensional data (like, below 50 dimensions), large datasets (several hundred thousand and up), and large k (500? 1000? in that ballpark or larger) to see any speedup though

05:09 < naywhayare> we'll see if the reviewers think it's interesting :) last year, they didn't, but I've since improved the algorithm significantly

05:14 < stephentu> naywhayare: sounds cool

05:15 < stephentu> good luck!

05:15 < stephentu> i'll be going ot NIPS regardless

05:15 < stephentu> so we can hang out in teh cold

05:19 < naywhayare> :)

05:19 < naywhayare> I should be there too

05:19 < naywhayare> I'll be at ICML in france also, if you are going

05:20 < naywhayare> anyway, I'm getting pretty tired... gonna call it a night

05:20 < naywhayare> later

05:25 < stephentu> no ICML for me, no paper

05:25 < stephentu> peace

05:25 stephentu has quit [Quit: Lost terminal]

12:50 udit_s has joined #mlpack

12:54 udit_s has quit [Ping timeout: 244 seconds]