#mlpack on 2014-06-24 — irc logs at libera.irclog.whitequark.org

2014-05-21 16:24 naywhayare changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:15 < jenkins-mlpack> Project mlpack - svn checkin test build #1953: SUCCESS in 1 hr 17 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1953/

00:15 < jenkins-mlpack> * Ryan Curtin: Remove backup file from emacs or some other inferior editor that isn't vim.

00:15 < jenkins-mlpack> * Ryan Curtin: Significant refactoring of NMF tests. The sparse tests were generally invalid

00:15 < jenkins-mlpack> because NMF is not guaranteed to produce a unique decomposition. Instead, now

00:15 < jenkins-mlpack> we test the Frobenius norm. Also, the tolerances needed to be significantly

00:15 < jenkins-mlpack> adjusted because of the new convergence criteria in the AMF class. Lastly, the

00:15 < jenkins-mlpack> SparseNMFRandomDiv test and SparseNMFDefaultTest have been removed because those

00:15 < jenkins-mlpack> update rules seem to often result in NaNs when sparse matrices are used as

00:15 < jenkins-mlpack> input.

00:15 < jenkins-mlpack> * Ryan Curtin: Make a note that this set of update rules often creates lots of NaNs when sparse

00:15 < jenkins-mlpack> matrices are used.

00:15 < jenkins-mlpack> Starting build #1954 for job mlpack - svn checkin test (previous build: SUCCESS)

01:27 < jenkins-mlpack> Project mlpack - svn checkin test build #1954: SUCCESS in 1 hr 12 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1954/

01:27 < jenkins-mlpack> Ryan Curtin: First pass: comment standardization, fix header guard names, move .cpp to .hpp

01:27 < jenkins-mlpack> because it's all templated functions.

02:08 shiva3791 has joined #mlpack

02:09 < shiva3791> Hi. I have an question. Can I use LogisticRegression with sparse matrixes?

02:52 govg has quit [Ping timeout: 244 seconds]

02:54 govg has joined #mlpack

02:56 govg has quit [Client Quit]

04:09 < naywhayare> shiva3791: currently it is not possible to do that, but I don't think it would be too hard to refactor the LogisticRegression class to also take arma::sp_mat (sparse matrices)

05:29 < jenkins-mlpack> Starting build #1955 for job mlpack - svn checkin test (previous build: SUCCESS)

06:23 udit_s has joined #mlpack

06:41 < jenkins-mlpack> Project mlpack - svn checkin test build #1955: SUCCESS in 1 hr 12 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1955/

06:41 < jenkins-mlpack> Ryan Curtin: Use bool instead of int.

06:58 udit_s has quit [Quit: Leaving]

07:04 < jenkins-mlpack> Starting build #1956 for job mlpack - svn checkin test (previous build: SUCCESS)

07:15 sumedh__ has joined #mlpack

07:22 sumedh_ has joined #mlpack

07:25 sumedh__ has quit [Ping timeout: 272 seconds]

07:45 sumedh_ has quit [Quit: Leaving]

08:19 < jenkins-mlpack> Project mlpack - svn checkin test build #1956: SUCCESS in 1 hr 14 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1956/

08:19 < jenkins-mlpack> saxena.udit: Perceptron Added

09:21 Anand has joined #mlpack

09:41 shiva3791 has quit [Ping timeout: 246 seconds]

10:06 Anand has quit [Ping timeout: 246 seconds]

10:18 govg has joined #mlpack

11:50 andrewmw94 has joined #mlpack

12:49 govg has quit [Quit: leaving]

14:21 Anand has joined #mlpack

14:27 oldbeardo has joined #mlpack

14:30 < Anand> Marcus : I have added the base py file for logistic regression. I see that the code produces a parameters.csv. What exactly does that file contain?

14:48 < oldbeardo> naywhayare: please reply to my mails whenever you get time

15:24 < naywhayare> oldbeardo: please be patient. I am doing my best to catch up and I have not forgotten your emails

15:28 oldbeardo has quit [Ping timeout: 246 seconds]

15:33 oldbeardo has joined #mlpack

15:34 < oldbeardo> naywhayare: okay, was just reminding, I'll be patient :)

15:45 oldbeardo has quit [Quit: Page closed]

16:12 < andrewmw94> naywhayare: I have a version of the R Tree that seems to be constructed correctly uploaded now. I'm going to get lunch soon, but I'd appreciate it if you could look through it sometime when you are free, paying attention to memory leaks (I keep thinking in terms of java). Also, I'm wondering if you could help me with the method softDelete() in rectangle_tree_impl.hpp:111 .

16:12 < andrewmw94> Basically, I want to use it to delete nodes that I made copies of, but I need to save some of the data such as the points and I don't want to delete the child nodes. I'll take out all of the debugging code I have in it once you read through it and make the requisite changes. No rush though. I'll try to write up the heuristics for X trees and R* trees while I wait

16:13 < jenkins-mlpack> Starting build #1957 for job mlpack - svn checkin test (previous build: SUCCESS)

16:21 < naywhayare> andrewmw94: have you written unit tests for the R tree construction?

16:34 Anand has quit [Ping timeout: 246 seconds]

17:04 Anand has joined #mlpack

17:12 sumedhghaisas has joined #mlpack

17:12 < sumedhghaisas> naywhayare: Hey ryan, you free??

17:22 sumedh_ has joined #mlpack

17:25 sumedhghaisas has quit [Ping timeout: 240 seconds]

17:28 < jenkins-mlpack> Project mlpack - svn checkin test build #1957: SUCCESS in 1 hr 14 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1957/

17:28 < jenkins-mlpack> * andrewmw94: bug fix. had n_rows when I needed n_cols

17:28 < jenkins-mlpack> * andrewmw94: bug fixes for memory leaks

17:28 < jenkins-mlpack> Starting build #1958 for job mlpack - svn checkin test (previous build: SUCCESS)

17:38 Anand has quit [Ping timeout: 246 seconds]

18:02 < andrewmw94> naywhayare: So I'm looking through the testing code (I haven't written the unit tests) and I don't see any "main" file where I would need to add the names of all the tests I want to run. If I add them to eg. allknn_test.cpp, will they run automatically?

18:02 udit_s has joined #mlpack

18:09 < naywhayare> udit_s: your decision stump tests don't actually test anything; they write output to a file

18:09 < naywhayare> can you change them so they use BOOST_REQUIRE_* and BOOST_CHECK_* to test things automatically?

18:10 < naywhayare> andrewmw94: write a new file, rectangle_tree_test.cpp, which is like allknn_test.cpp

18:10 < naywhayare> add the file to the CMakeLists.txt file in the tests/ directory

18:10 < naywhayare> then it will get built into the mlpack_test executable

18:10 < udit_s> I did when I comitted today.

18:11 < udit_s> naywhayare: I pushed code today. I had that fixed.

18:11 < udit_s> naywhayare: or let me check that.

18:11 < naywhayare> udit_s: I don't see any changes to the decision stump test, maybe you forgot to commit that one?

18:11 < naywhayare> andrewmw94: anyway, supposing you name your test suite RectangleTreeTest, you can then run those tests with 'bin/mlpack_test -t RectangleTreeTest'

18:12 < udit_s> naywhayare: oops. yeah, I did. let me just get that.

18:12 < naywhayare> sure, sounds good

18:12 < sumedh_> naywhayare: did you find any bug in SVDBatch code??

18:12 < naywhayare> sumedh_: I am trying to catch up from last week; I have not been able to do that yet

18:13 < sumedh_> ohh okay... Cause I checked again and again... still 1.8 :(

18:14 < sumedh_> naywhayare: Can we go ahead and write a test with 1.8?? the performance is better than current NMF ... accuracy wise atleast.. but taking longer time...

18:15 < andrewmw94> naywhayare: should I put all of them there or should I put the traits in tree_traits_test.cpp?

18:16 < naywhayare> sumedh_: have you tried any other datasets?

18:17 < naywhayare> andrewmw94: yes, put all of the tests in a new file, something like rectangle_tree_test.cpp

18:17 < naywhayare> tree_test.cpp is really long and should probably be split into a few files, so it's probably better to not make it even longer

18:17 < sumedh_> naywhayare: yeah... GroupLens100k... But its not their in the paper.... they have used subsets of MovieLens1M

18:18 < sumedh_> *there

18:18 < naywhayare> okay; what about the netflix dataset?

18:18 < udit_s> naywhayare: done.

18:18 < naywhayare> udit_s: thanks

18:19 < udit_s> naywhayare: also, about the perceptron. You'll see I've left the update weights as a template, which can be extended further as required. I guess that's fine. I was having trouble getting the gradient descent to converge.

18:19 < udit_s> naywhayare: so, I'll start working on the adaboost aspect now ? Reading up and designing ?

18:20 < sumedh_> naywhayare: netflix dataset?? Its too too big... MovieLens1M contains 6000 users.. netflix dataset contains 480,000 users and 20,000... I dont think I have enough computation power to process that dataset:(

18:20 < udit_s> naywhayare: and I also wanted to talk about the mid term evaluation.

18:20 < naywhayare> sumedh_: ok

18:21 < naywhayare> sumedh_: I will try and get to the RMSE issue soon. like I said (and as you can see) I am completely overloaded right now

18:23 < naywhayare> udit_s: I saw your email about the perceptron, but I have not been able to respond

18:23 < naywhayare> since you're a little ahead of schedule, I wouldn't mind if you took a little time to implement another learning algorithm for the perceptron

18:24 < sumedh_> naywhayare: yeah I guess I will be up tonight... so you can ping any time you are free... After the tests for SVDBatch are done I will commit the SVDIncrementalLearning code :)

18:24 < naywhayare> sumedh_: can you take some time and add a template parameter to the AMF code, for the convergence criterion?

18:25 < naywhayare> currently, you're terminating when the change in residue is below tolerance

18:25 < naywhayare> but before, it terminated when the residue itself was below a tolerance

18:25 < naywhayare> we should allow users to choose either of these things (or the slightly weirder SVDBatch condition which is based on the validation RMSE)

18:25 < naywhayare> so a template parameter is probably the right thing to do here

18:26 < naywhayare> so the while() loop in the AMF::Apply() function should be something like 'while(!ConvergencePolicy::IsConverged(...))'

18:26 < sumedh_> naywhayare: yes... I wanted to discuss that... I have currently some working code for that...

18:26 < sumedh_> yes... sort of the same... I am using terminate function there...

18:29 < naywhayare> ok, that sounds good. probably as parameters the IsConverged() policy will need V, W, and H

18:29 < naywhayare> maybe more? not sure

18:30 < sumedh_> yes... you are right... V, W and H will suffice...

18:30 < naywhayare> okay. if you can spend a little time developing that abstraction and at least the two convergence policies we are already using, I'd appreciate that

18:31 < naywhayare> make sure it passes the tests before you check it in. I recently overhauled a lot of the NMF tests

18:32 < sumedh_> okay I will do that right away...

18:40 < jenkins-mlpack> Project mlpack - svn checkin test build #1958: SUCCESS in 1 hr 11 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1958/

18:40 < jenkins-mlpack> Ryan Curtin: Minor changes to test. const-correctness and comment normalization for Doxygen.

18:40 < jenkins-mlpack> Starting build #1959 for job mlpack - svn checkin test (previous build: SUCCESS)

18:40 udit_s has quit [Quit: Leaving]

19:49 < sumedh_> naywhayare: How to run a specific test with mlpack_test... like nmf_test or cf_test

19:52 < jenkins-mlpack> Project mlpack - svn checkin test build #1959: SUCCESS in 1 hr 12 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1959/

19:52 < jenkins-mlpack> saxena.udit: Decision Stump test fixed

19:53 arcane has joined #mlpack

20:01 arcane has left #mlpack []

21:02 sumedh_ has quit [Ping timeout: 244 seconds]

21:11 < andrewmw94> sumedh_: you run bin/mlpack_test -t RectangleTreeTest

21:11 < andrewmw94> if RectangleTreeTest is the name of the test

21:12 < andrewmw94> the names are in the source files though not the names of the source files, so it would be something like NMFDefaultTest

23:04 < naywhayare> you can specify the test suite name and the individual test name

23:04 < naywhayare> so for the test suite NMFTest, you'd do mlpack_test -t NMFTest

23:05 < naywhayare> but for the individual test NMFDefaultTest which is in the suite NMFTest, you'd do mlpack_test -t NMFTest/NMFDefaultTest