#mlpack on 2016-06-29 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

01:07 tham has joined #mlpack

02:09 Mathnerd314_ is now known as Mathnerd314

04:35 kwikadi has quit [Remote host closed the connection]

04:40 kwikadi has joined #mlpack

05:00 < lozhnikov> marcosirc: You're right, thanks. I'll try to do that.

06:07 nilay has joined #mlpack

06:31 Mathnerd314 has quit [Ping timeout: 264 seconds]

07:01 mentekid has joined #mlpack

07:05 < mentekid> rcurtin: So everything got hashed to bucket 0. I would have never seen that... Cool, thanks :)

07:18 mentekid has quit [Ping timeout: 244 seconds]

08:18 mentekid has joined #mlpack

08:21 < mentekid> rcurtin: I think we should do the same thing in returnIndicesFromTables, right? There's a similar problem there I think

08:21 < mentekid> let me look at the code

08:24 tham has quit [Quit: Page closed]

09:05 < mentekid> rcurtin: I think there's still some bug in the LSH code, my tests crash... Here's a backtrace: http://pastebin.com/Psvadsgp

09:09 < mentekid> (I've added some markers every few lines of code to isolate the error)

09:21 < Karl_> rcurtin_: sorry for not getting back. I got stuck with other things yesterday. I think my kernel isn't proper... I get negative eigenvalues

09:44 < Karl_> zoq: if you want a beta tester let me know how to get the svd-pca code...

09:47 < Karl_> zoq: or was it just the normal pca method?

10:47 < zoq> Karl_: Thanks, I'll get back to you once it is finished.

11:27 < lozhnikov> marcosirc: rcurtin: I opened a PR that contains some changes proposed by Marcos Pividori (RectangleTree::NumDescendants() optimization).

12:04 sumedhghaisas has joined #mlpack

12:34 sumedhghaisas has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

12:40 < lozhnikov> mentekid: Hi, there is a segfault in LSHTest/NumTablesTest. Are you sure that you should use secondHashVectors[j] instead of secondHashVectors(i, j)? (lsh_search_impl.hpp:200 and 202)

12:45 mentekid has quit [Ping timeout: 246 seconds]

12:59 < lozhnikov> rcurtin: The error appears in e6bc4b4.

13:18 mentekid has joined #mlpack

13:38 Mathnerd314 has joined #mlpack

14:04 marcosirc has joined #mlpack

14:12 < marcosirc> lozhnikov: great, thanks.

14:15 < rcurtin> lozhnikov: marcosirc: odd, I tested it on my system, I guess I did not run valgrind and now I pay the price :)

14:16 nilay_ has joined #mlpack

14:19 < mentekid> rcurtin: I fixed what lozhnikov but I still get a segmentation fault at LSHTrainTest

14:19 < mentekid> the other tests seem to run fine :/

14:23 < mentekid> actually... In Train(), shouldn't secondHashTable be cleared when Train is called?

14:28 < rcurtin> mentekid: I'm an idiot, I have the fix, hang on

14:30 nilay_ has quit [Ping timeout: 250 seconds]

14:44 < rcurtin> actually, I don't quite have the fix, this is more complex than I thought

15:07 < rcurtin> okay, fixed in eea2aa4, sorry for the issue

15:21 < mentekid> ah thanks :) I'll finish the style changes and push the final multiprobe tests

15:21 < mentekid> sorry multiprobe changes*

15:29 < rcurtin> sounds good

15:39 travis-ci has joined #mlpack

15:39 < travis-ci> mlpack/mlpack#1112 (master - eea2aa4 : Ryan Curtin): The build is still failing.

15:39 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/e6bc4b41704e...eea2aa43b9b9

15:39 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/141105382

15:39 travis-ci has left #mlpack []

16:01 < rcurtin> Karl_: no worries, if you can show the code for the kernel, I can take a glance and see if I see anything wrong

16:08 mentekid has quit [Ping timeout: 264 seconds]

16:35 travis-ci has joined #mlpack

16:35 < travis-ci> mlpack/mlpack#1115 (master - eaa7182 : Ryan Curtin): The build was broken.

16:35 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/809ed4bf33ce...eaa7182ebed8

16:35 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/141120814

16:35 travis-ci has left #mlpack []

16:49 nilay_ has joined #mlpack

16:53 mentekid has joined #mlpack

17:13 sumedhghaisas has joined #mlpack

17:14 < sumedhghaisas> @marcosirc: Hey marcos...

17:16 < marcosirc> sumedhghaisas: Hi!

17:18 < sumedhghaisas> sorry about the delay...

17:18 < sumedhghaisas> I read through the paper paper... you are right

17:18 < sumedhghaisas> Will spill trees we cannot guarantee the error...

17:19 < sumedhghaisas> But I guess Ryan is right...

17:20 < sumedhghaisas> Considering the popularity of Spill trees... I think we should implement it...

17:20 < sumedhghaisas> we need to decide on the implementation...

17:22 < marcosirc> Yeah, I agree.

17:22 < sumedhghaisas> do you think we should implement a separate command line for defeatist search??

17:23 < marcosirc> Ok, I have been thinking on the implementation.

17:23 < marcosirc> Mm I don't think we should implement it as a separate command line program.

17:24 < marcosirc> Maybe we can include it as a flag to the main mlpack_knn program...

17:24 < marcosirc> It would be clearer this way, I think.. For benchmarks, etc.

17:25 < marcosirc> we could print an error if epsilon value is specified for spill trees...

17:27 < sumedhghaisas> hmmm...

17:27 < marcosirc> But I don't have a strong preference... maybe we can start working implementing spill trees

17:27 < marcosirc> and onces it is ready, we decided.

17:28 < sumedhghaisas> flag does sound a viable option to me...

17:28 < sumedhghaisas> yeah I agree...

17:28 < marcosirc> yeah, maybe it will be confussing...

17:28 < sumedhghaisas> We can also decide when spill tree implementation is ready...

17:29 < marcosirc> Ok.

17:29 < marcosirc> Regarding spill trees implementation.

17:29 < marcosirc> I think it will be similar to binary space trees.

17:30 < marcosirc> However, we need to manage the list of points differently. We are going to have overlapping nodes, so we can not use range of indexes of the main dataset's matrix as we do with binary trees.

17:31 < marcosirc> I am thinking of having a general dataset instance (as we do with binary trees), and leaf nodes will hold a vector of indexes pointing to columns of that matrix.

17:32 < marcosirc> (This is what I mentioned in the last email)

17:32 < marcosirc> I think this will be the simplest/most efficient approach.

17:34 < sumedhghaisas> yes... it does look simple...

17:34 < sumedhghaisas> give me some time to think on it...

17:35 < marcosirc> ok, sure!

17:56 < rcurtin> marcosirc: I agree, I think vector of indices is the easiest way to go here

17:57 sumedhghaisas has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

17:58 < marcosirc> rcurtin: ok, thanks!

18:15 sumedhghaisas has joined #mlpack

18:36 sumedhghaisas has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

18:36 sumedhghaisas has joined #mlpack

18:50 mentekid has quit [Ping timeout: 258 seconds]

18:56 mentekid has joined #mlpack

19:05 < nilay_> zoq: Hello, in the forward pass of bias unit, why do we add input?

19:05 sumedhghaisas has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

19:06 < zoq> nilay_: Do you mean the forward pass of the BiasLayer?

19:06 < nilay_> yes, bias unit is not connected to any input?

19:08 < zoq> It is connected to all units in the next layer.

19:09 < zoq> So, first we call the forward pass of e.g. the LinearLayer and afterwards we use the input from the LinearLayer and add the bias term.

19:09 < zoq> The input in the Forward function is the output of the e.g. LinearLayer

19:09 < zoq> or in general the layer before the bias layer

19:13 < zoq> ah, the bias unit isn't connected with any input units, it's only connected with the units in the following layer

19:15 < nilay_> yes thats why i asked, or is it some convention

19:15 < zoq> That the bias isn't connected with the input?

19:16 < zoq> here is an example: http://ufldl.stanford.edu/wiki/images/thumb/9/99/Network331.png/400px-Network331.png

19:16 < zoq> you can also integrate the bias term into the linear layer

19:17 < nilay_> ok now i understand it is the total output

19:17 < nilay_> bias layer is wrapped over the linearlayer

19:18 < zoq> yes or any other layer

19:18 < nilay_> yeah ok, thanks.

19:19 < zoq> It's uncommon to use a bias term in combination with a convolution layer.

19:20 < nilay_> it is used in the vanilla network though

19:21 < nilay_> convolutional_network_test

19:23 < zoq> yeah, maybe I should say it's uncommon for very deep network :)

19:23 < zoq> *networks

19:25 < nilay_> so i should not put it in the inception layer?

19:27 < zoq> You can do that, but e.g. if the user sets the bias to 0 you can avoid the bias term operation?

19:28 < nilay_> if user sets to zero then output = input.

19:28 < zoq> yes

19:29 < nilay_> is there a reason to not use bias? because bias are useful

19:29 < nilay_> in deep networks

19:31 < zoq> Performance reasons, it's always challenging to figure out how the network should look like for a certain task.

19:33 < nilay_> ok.

19:34 < zoq> Btw. I tested another approach (quic svd) for the pca method, in some cases it looks promising.

19:35 < nilay_> i tried understanding the math of randomized svd, i read a blog but then it referred to a paper of 74 pages :P

19:36 < zoq> FINDING STRUCTURE WITH RANDOMNESS: PROBABILISTIC ALGORITHMS FOR CONSTRUCTING APPROXIMATE MATRIX DECOMPOSITIONS

19:36 < zoq> yeah, right :)

19:36 < nilay_> so did you read it, before implementing this thing, its a lot

19:36 < zoq> the QUIC-SVD paper is shorter: http://www.cc.gatech.edu/~isbell/papers/isbell-quicsvd-nips-2008.pdf

19:38 < zoq> I skipped the proofs :)

19:38 < nilay_> so did you get the idea of the error this(randomized svd) technique has compared to normal svd

19:41 < zoq> By reading some other realted papers. Right, now I'm not sure if I do something wrong, it looks like the QUIC-SVD method doesn't work if m=n

19:55 < nilay_> so do we need to integrate r-svd with PCA::Apply or replace it. (if the error is less we might as well replace it?)

20:00 < nilay_> or we still take components according to eigVal so it is correct always

20:01 < zoq> I think what we could do here is to change the PCA method and let the user define which method he likes to use, right now we use exact svd, randomized svd is just an approximation. In case of edge boxes an approximation is totally fine.

20:02 < nilay_> yes what i don't get is what do we lose by doing randomized svd as compared to when we do normal svd.

20:03 < zoq> precision, in case of randomized svd, we just use parts of the full data matrix.

20:04 < zoq> Probably I can work out a proof of concept ... perhaps in the next hours

20:05 < zoq> I think in that case I'll have to figure out why the quic svd method only works when m < n.

20:08 < zoq> maybe rcurtin can provide any insight?

20:09 < rcurtin> hm, it has been a while since I thought about it

20:09 < rcurtin> in this case, m is the number of returned eigenvectors, and n is the number of dimensions in the dataset?

20:10 < rcurtin> ah I guess the matrix being decomposed is m x n

20:10 < zoq> yeah, right

20:10 < rcurtin> but the first paragraph of the paper says quic-svd works for m >= n, but not m < n

20:10 < zoq> right

20:11 < rcurtin> when m < n, we can just transpose the matrix and then once the SVD is done, we switch V and U

20:11 < rcurtin> so I think maybe I don't understand what the issue is

20:11 < rcurtin> maybe I am looking at the wrong part of the paper

20:12 < zoq> maybe I used the wrong dimension, not sure right now, but I used m=n and it didn't work

20:13 < rcurtin> hm, hang on, let me take a look at the code

20:14 < rcurtin> what happens if you change quic_svd_impl.hpp:29 to be >= instead of just > ?

20:14 < zoq> it's not urgent, there is another bug in my randomized svd implementation ...

20:15 < zoq> I think I already tested >=, let's check again

20:15 < rcurtin> yeah, if that does not work, can you open a bug on github?

20:15 < rcurtin> if you want you could assign it to siddharth, but I don't know if he will be able to do anything, I am not sure how much time he has

20:15 < rcurtin> I dunno if he'll even see an email, I haven't heard from him in a while :)

20:15 < rcurtin> but I can take a look into it when I have some time (maybe a week or two, maybe more?)

20:17 < zoq> :) I'll open a bug if I get too frustrated with the code.

20:18 < rcurtin> yeah; the primary quic-svd code is in core/tree/cosine_tree/, not in methods/quic_svd/

21:31 nilay_ has quit [Ping timeout: 250 seconds]

21:39 marcosirc has quit [Quit: WeeChat 1.4]

22:23 mentekid has quit [Ping timeout: 272 seconds]