#mlpack on 2014-06-05 — irc logs at libera.irclog.whitequark.org

2014-05-21 16:24 naywhayare changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

05:38 < jenkins-mlpack> Yippie, build fixed!

05:38 < jenkins-mlpack> Project mlpack - nightly matrix build build #476: FIXED in 1 hr 37 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20nightly%20matrix%20build/476/

05:38 < jenkins-mlpack> * andrewmw94: add the missed files.

05:38 < jenkins-mlpack> * andrewmw94: a few miscellanious small changes. Added to CMake.

06:30 udit_s has joined #mlpack

07:20 cuphrody has quit []

08:40 udit_s has quit [Read error: Connection reset by peer]

09:07 udit_s has joined #mlpack

12:00 govg has joined #mlpack

12:00 govg has quit [Changing host]

12:00 govg has joined #mlpack

12:13 govg has quit [Ping timeout: 265 seconds]

12:19 govg has joined #mlpack

12:19 govg has quit [Changing host]

12:19 govg has joined #mlpack

12:25 andrewmw94 has joined #mlpack

12:26 govg has quit [Ping timeout: 265 seconds]

12:48 sumedhghaisas has joined #mlpack

13:33 Anand has joined #mlpack

13:44 < Anand> Marcus : I have made some changes. Have a look now and let me know.

14:03 udit_s has quit [Ping timeout: 245 seconds]

14:04 < naywhayare> andrewmw94: sumedhghaisas: sorry about all the Jenkins emails... they should have stopped after last night, and now you should only get emails if your commit breaks something

14:04 < naywhayare> I'm still not sure how Jenkins got your correct emails

14:05 < sumedhghaisas> yeah its fine. :)

14:06 < sumedhghaisas> isnt jenkins connected to my account?? cause I have my email there...

14:06 < naywhayare> jenkins is separate from trac, and it manages user accounts separately

14:06 < naywhayare> I couldn't find a way to integrate the two

14:06 < sumedhghaisas> ohhh....

14:15 udit_s has joined #mlpack

14:31 Anand has quit [Ping timeout: 240 seconds]

15:39 Anand has joined #mlpack

15:39 < andrewmw94> naywhayare: is it ok if I change some things in the allknn_main.cpp file, such as requiring the leafSize to be greater than 0 rather than greater than or equal to 0?

15:44 < naywhayare> andrewmw94: sure, I wonder why the check was >= 0...

15:45 < andrewmw94> yeah, I wanted to check it doesn't use that to represent the default value or something

15:47 < naywhayare> as far as I know it doesn't

15:50 < andrewmw94> also, looking through the code, it appears that the single tree traverser won't work correctly if all of the points are in the root node (ie. there is only one node)

15:51 < andrewmw94> obviously using a tree with so few points is foolish but someone could try to do it

15:56 < naywhayare> I think you're right; probably worth adding a check at the beginning of the traverser or something

15:57 < naywhayare> but I wouldn't worry too much about it... that is pretty unstable code and will probably be overhauled soon, and then overhauled again, and then overhauled again...

16:55 < marcus_zoq> Anand: The code looks good, currently you are writing the tests?

16:57 < Anand> Ok, great! Yes, I am writing the tests. Will push them by tomorrow. I was just doing a dry run for the new code manually to make sure things work fine

16:57 < marcus_zoq> Anand: Sounds like a good plan :)

16:57 < Anand> :)

17:21 govg has joined #mlpack

17:28 < jenkins-mlpack> Starting build #1934 for job mlpack - svn checkin test (previous build: FIXED)

17:34 Anand has quit [Ping timeout: 240 seconds]

17:53 sumedhghaisas has quit [Ping timeout: 240 seconds]

18:01 < jenkins-mlpack> Project mlpack - svn checkin test build #1934: SUCCESS in 33 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1934/

18:01 < jenkins-mlpack> andrewmw94: require kd-trees to have leafSize of at least 1. Add an assert to ensure that the SingleTreeTraverser isn't called on a tree with only one node.

18:21 < andrewmw94> naywhayare: In the BSP tree's singleTreeTraversal code, a fixed number of children, so when you descend you can just get the score for each child. However, with the R tree, you don't know how many children you will have. I think the most sensible way to descend the tree, at least of the obvious methods, is to get the score for each child node, then sort this list, then descend the tree, testing to see if you can prune each sibling b

18:25 < naywhayare> message clipped at "prune each sibling b"

18:25 < naywhayare> but guessing at what you might have written, I think your strategy is reasonable

18:25 < naywhayare> as long as the number of children doesn't get too large, the sorting of the scores won't take too long

18:26 < naywhayare> I think that std::sort is guaranteed to be an O(n log n) sort... but I'm not sure, let's see...

18:26 < naywhayare> (arma::sort would probably be what you want to use, but I think it uses std::sort)

18:28 < naywhayare> (actually, you'd want to use arma::sort_index(), and it does use std::sort, which as of C++11 is required to be worst-case O(n log n), but for C++03 just needs to be expected O(n log n))

18:30 govg has quit [Ping timeout: 240 seconds]

18:39 < andrewmw94> I'm confused.

18:39 < andrewmw94> Wouldn't we want to sort the distances, not the arma::*

18:39 < andrewmw94> objects

18:40 < andrewmw94> doesn't score return the minimum distance from the query point to the MBR (assuming nearest neighbors)

18:50 sumedhghaisas has joined #mlpack

18:53 < sumedhghaisas> naywhayare: residue in NMF is calculated as...

18:53 < sumedhghaisas> WH = W * H;

18:53 < sumedhghaisas> norm = sqrt(accu(WH % WH) / nm);

18:53 < sumedhghaisas> if (iteration != 0)

18:53 < sumedhghaisas> {

18:53 < sumedhghaisas> residue = fabs(normOld - norm);

18:53 < sumedhghaisas> residue /= normOld;

18:53 < sumedhghaisas> }

18:54 < sumedhghaisas> this is failing when the matrix contains negative entries....

18:55 < sumedhghaisas> exactly how does this work?? is this same as normal RMSE calculation??

19:08 < naywhayare> andrewmw94: sorry, I meant, sort the output of Score(), but store it in an arma::vec before sorting for convenience

19:09 < naywhayare> that's a normal RMSE calculation, but NMF will never converge if you are trying to decompose an input matrix that has negative entries

19:09 < naywhayare> sumedhghaisas: sorry, forgot to address the above message to you

19:10 < sumedhghaisas> okay... But I am using SVD...

19:11 < naywhayare> okay, so what is happening? how is it failing?>

19:11 < sumedhghaisas> what is that acccu function??

19:11 < naywhayare> equivalent to sum(sum(WH % WH))

19:11 < naywhayare> accu(WH % WH) returns the sum of the squared elements of WH

19:11 < sumedhghaisas> umm... its giving normal results for positive matrices...

19:12 < sumedhghaisas> but not with negative entries...

19:12 < sumedhghaisas> okay... there must be some error in my code... I will check again...

19:12 < naywhayare> yeah, that RMSE calculation looks fine to me

19:18 < udit_s> naywhayare: hey !

19:18 < naywhayare> udit_s: hello there

19:19 < udit_s> I've just finished the decision stump...

19:19 < naywhayare> okay, good to hear

19:20 < udit_s> Was hoping you could have a look at it - I'll start optimizations now and documentation tomorrow.

19:20 < naywhayare> sure... do you mind if I wait until after my paper deadline and look at it this weekend?

19:20 < naywhayare> 27.5 hours left...

19:21 < udit_s> sure, though I will be joining you on Saturday quite late...

19:21 < naywhayare> okay, sounds good

19:21 < naywhayare> next week I will be a much better mentor than I have been...

19:22 < udit_s> it's actually fine. no really. :)

19:23 < udit_s> I have implemented the bucket binning we talked about (conceptually) - sorting and then gathering into buckets- ranges and all.

19:23 < udit_s> assuming completely non-categorical data

19:24 < naywhayare> great! I'm glad to hear it worked out

19:24 < udit_s> anyways I'll brief you in a mail about the features, and everything else I've modfied.

19:24 < udit_s> all the best for your paper !

19:24 < naywhayare> ok, sounds good; thanks!

19:54 udit_s has quit [Quit: Leaving]

21:32 < sumedhghaisas> naywhayare: Finally ... BatchSVD is working ...

21:33 < sumedhghaisas> But its too sensitive to the learning parameter...

21:34 < sumedhghaisas> Probably IncrementalSVD will be better...

21:35 < sumedhghaisas> NMFALS is giving residue of 9 * e-11

21:35 < sumedhghaisas> BatchSVD is giving 6 * e-9...

21:35 < sumedhghaisas> not that bad... advantage would be BatchSVD can also decompose matrix with negative entries...

21:36 < sumedhghaisas> and regularization can be applied....

21:56 < naywhayare> sounds good to me; how do the runtimes compare?

21:57 < sumedhghaisas> umm... SVDBatch more time... can be justified as its a gradient descent variant... BatchWithMomentum will improve that time...

21:57 < naywhayare> okay

21:57 < sumedhghaisas> Should I keep BatchSVD as an update rule??

21:58 < sumedhghaisas> Maybe I will add momentum to it... So BatchSVD will be its special case with momentum equals zero...

21:59 < naywhayare> that sounds like a good idea

21:59 < sumedhghaisas> okay... I should get t it... Paper finished??

21:59 < naywhayare> not even close

22:00 < sumedhghaisas> :(

22:00 < naywhayare> yeah...

22:00 < sumedhghaisas> proofs over??

22:00 < naywhayare> I think I have the proofs correct; I just have to run the numerical experiments...

22:02 sumedh_ has joined #mlpack

22:05 sumedhghaisas has quit [Ping timeout: 252 seconds]