#mlpack on 2014-06-13 — irc logs at libera.irclog.whitequark.org

2014-05-21 16:24 naywhayare changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:04 sumedh_ has joined #mlpack

00:08 sumedhghaisas has quit [Ping timeout: 240 seconds]

01:54 andrewmw94 has left #mlpack []

03:34 govg has quit [Ping timeout: 240 seconds]

03:36 govg has joined #mlpack

03:48 Anand_ has joined #mlpack

04:39 < jenkins-mlpack> Project mlpack - nightly matrix build build #484: FAILURE in 39 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20nightly%20matrix%20build/484/

04:39 < jenkins-mlpack> * Ryan Curtin: Fix #334 by ensuring vector accesses don't go out of bounds.

04:39 < jenkins-mlpack> * saxena.udit: Decision Stump added

04:39 < jenkins-mlpack> * Ryan Curtin: Remove leafSize parameter from DTB constructor.

06:12 Anand_ has quit [Ping timeout: 246 seconds]

07:06 < jenkins-mlpack> Starting build #1946 for job mlpack - svn checkin test (previous build: UNSTABLE -- last SUCCESS #1944 1 day 0 hr ago)

07:24 udit_s has joined #mlpack

07:39 < jenkins-mlpack> Project mlpack - svn checkin test build #1946: STILL UNSTABLE in 32 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1946/

07:39 < jenkins-mlpack> Ryan Curtin: Fix #334 by ensuring vector accesses don't go out of bounds.

07:51 udit_s has quit [Ping timeout: 264 seconds]

08:03 udit_s has joined #mlpack

08:19 sumedh_ has quit [Quit: Leaving]

08:19 sumedhghaisas has joined #mlpack

08:20 govg has quit [Ping timeout: 252 seconds]

08:35 < sumedhghaisas> naywhayare: hey ryan, you there??

08:52 govg has joined #mlpack

08:54 sumedhghaisas has quit [Ping timeout: 240 seconds]

08:56 udit_s has quit [Ping timeout: 245 seconds]

09:10 udit_s has joined #mlpack

09:24 udit_s has quit [Ping timeout: 240 seconds]

09:36 udit_s has joined #mlpack

09:40 govg has quit [Ping timeout: 272 seconds]

09:51 govg has joined #mlpack

09:51 govg has quit [Changing host]

09:51 govg has joined #mlpack

10:44 udit_s has quit [Quit: Leaving]

11:15 govg has quit [Ping timeout: 240 seconds]

12:25 govg has joined #mlpack

12:30 oldbeardo has joined #mlpack

12:30 < oldbeardo> naywhayare: just sent you a mail

13:56 govg has quit [Quit: leaving]

14:01 < naywhayare> oldbeardo: great! I will take a look shortly

14:36 < oldbeardo> naywhayare: good thing I could get it done before your vacation :)

14:51 udit_s has joined #mlpack

15:05 andrewmw94 has joined #mlpack

15:26 Anand_ has joined #mlpack

15:35 < naywhayare> udit_s: ok, so perceptrons

15:35 < naywhayare> I'm looking at the API you wrote in your proposal

15:36 < udit_s> I think I'll probably have to change it, though the main skeleton will remain the same. So, like always, take input, inputLabels.

15:37 < udit_s> Make a weight vector matrix.

15:37 < naywhayare> right; and in this case, input should be real-valued (not categorical) and inputLabels can only handle two classes; is that right?

15:38 < udit_s> We could go for multi-class perceptron. And yeah, the input will be real-valued, like in decision_stumps.

15:40 < naywhayare> it's your call on whether to do multi-class perceptron

15:40 < udit_s> multiple classes can be handled quite easily too.

15:40 < udit_s> just differently.

15:40 < naywhayare> I'm trying to understand the differences between the binary perceptron and the multi-class perceptron now

15:40 < udit_s> and because we're doing so in the decision stump, keeping that aspect consistent would be better.

15:40 < naywhayare> right, I agree

15:41 < udit_s> should I send you some links ?

15:41 < naywhayare> no, I think I get it. the wikipedia article section, I think, is a bit confusing

15:42 < naywhayare> my understanding is that the basic idea is to construct several perceptrons, each of which recognizes an individual class

15:42 < naywhayare> then, to do classification, you run your input vector through all of these perceptrons, then whichever one output the highest prediction is taken to be the class label

15:42 < naywhayare> does that seem about right?

15:44 < udit_s> almost. Instead of multiple perceptrons, you just have multiple weight vectors, each of for one class.

15:44 < naywhayare> oh, ok. I see now

15:44 < naywhayare> that seems like a straightforward generalization then

15:45 < udit_s> yeah.

15:45 Anand__ has joined #mlpack

15:45 Anand__ has quit [Client Quit]

15:45 Anand__ has joined #mlpack

15:45 < udit_s> now,

15:46 < udit_s> about our update criterion, and the bias ...

15:47 Anand_ has quit [Ping timeout: 246 seconds]

15:48 < udit_s> while reading up, I came across different implementations,

15:48 < udit_s> some use a bias weight with value 1.

15:48 < udit_s> others, don't or ignore it.

15:48 < naywhayare> it seems to me like maybe the bias should be a user parameter given in the constructor

15:48 < udit_s> and I've also come across multiple update criterions.

15:49 < oldbeardo> naywhayare: multilayer perceptron sounds like softmax regression

15:49 < udit_s> the user input will be an option for bias or the value of the bias ?

15:49 < naywhayare> oldbeardo: yes, but the focus here is single-layer perceptron :)

15:50 < naywhayare> udit_s: I think the value of the bias; then if the user doesn't want bias, they just specify 0

15:51 < udit_s> okay. then the update criterion. apparently, there are several update criteria, with no information on which one converges the earliest.

15:51 < naywhayare> we can do this just like the AMF/NMF code then -- make it a template parameter, and implement one (or a few) update criteria

15:51 < naywhayare> you have it written like that in your proposal

15:52 < udit_s> okay.

15:53 < udit_s> also, I'm a bit confused as to how the update will proceed.

15:54 < naywhayare> assuming that each of these algorithms are iterative, then the update only needs to provide an updated weight vector

15:54 < naywhayare> i.e. UpdateRule::Update(weights, ...) (where ... is whatever other parameters the update rule needs) should just take the existing weights vector and update its value using a single application of the update rule

15:55 < udit_s> say a 'run' is going through the input set once. so in one run, for *each* input vector, you update the weight matrix if reqd so, and then restart the run; am I correct ? also, I think there should be a lower and upper limit to the number of runs.

15:56 < udit_s> or should the perceptron stop only on convergence ?

15:56 < naywhayare> a limit on the number of runs (or iterations) should be a parameter, yeah

15:57 < naywhayare> I would write UpdateRule::Update() to perform the update for every input vector, not just one

15:57 < naywhayare> because there may be some update algorithms that don't use that type of loop-over-every-input-vector approach (which is kind of like stochastic gradient descent)

16:00 < naywhayare> I'd like to grab some lunch... can we continue this in a few hours? (I think we've covered most everything ?)

16:00 < udit_s> okay, so update weights every time you go over an incorrect classification while training, in one run, and then repeat.

16:01 < naywhayare> yeah, that seems reasonable to me

16:01 < udit_s> yeah, I'll have dinner as well. I'll see if anything else comes up.

16:01 < naywhayare> ok. I'll be back in about two hours

16:01 < udit_s> okay.

16:04 Anand__ has quit [Ping timeout: 246 seconds]

16:07 < oldbeardo> naywhayare: you saw the code?

16:17 udit_s has quit [Ping timeout: 255 seconds]

16:27 Anand_ has joined #mlpack

16:29 udit_s has joined #mlpack

16:33 < Anand_> Marcus : I am adding metrics to weka today. I will try to follow the same design as scikit.

16:40 andrewmw94 has left #mlpack []

16:41 < marcus_zoq> Anand_: Great, sounds like a plan.

16:46 < jenkins-mlpack> Starting build #1947 for job mlpack - svn checkin test (previous build: STILL UNSTABLE -- last SUCCESS #1944 1 day 9 hr ago)

16:48 < Anand_> Marcus : How will I get the predicted labels in weka? I donot find any function that predicts the class of an instance.

16:52 < Anand_> Also, I will need to include the weka src path into the file to call weka functions. Right?

17:07 < marcus_zoq> Anand_: You need to modify the NBC.java file. https://github.com/zoq/benchmarks/blob/master/methods/weka/src/NBC.java

17:08 < marcus_zoq> Anand_: And the function you need to use is called: 'distributionForInstance(Instance instance)'.

17:09 < Anand_> 'distributionForInstance(Instance instance)' returns the probabilities. It is not the actual classifier function like the predict function we used in scikit

17:10 < Anand_> I need to use the classifyInstance for this

17:12 < Anand_> Maybe I will add functions to NBC.java to return the probabilities and the predicted labels

17:12 < marcus_zoq> Anand_: Right to get the actual classes, it's line 86. So you need to save the results in a file to use them.

17:13 < marcus_zoq> Anand_: I think there isn't antoher way. Because you can't use weka with python?

17:13 udit_s has quit [Ping timeout: 264 seconds]

17:15 < marcus_zoq> Anand_: You are familiar with java?

17:19 < jenkins-mlpack> Project mlpack - svn checkin test build #1947: STILL UNSTABLE in 32 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1947/

17:19 < jenkins-mlpack> andrewmw94: More bug fixes.

17:25 < Anand_> Marcus : Yes, I know Java

17:25 < marcus_zoq> Anand_: Okay great :)

17:25 udit_s has joined #mlpack

17:26 < Anand_> So, I need to write the prediction values to a file

17:26 < Anand_> Right?

17:26 < marcus_zoq> Anand_: Right and then you can load the file in python.

17:27 < Anand_> And the probabilities will come from distributionForInstance. I can directly use the vector returned

17:27 < Anand_> No need to write to file for the probabilities, I guess

17:28 < Anand_> I will add a method that will return the probability vector for a given instance

17:28 < Anand_> to NBC.java

17:29 < marcus_zoq> Anand_: If you need the probabilities in python you have to save them into a file. Or is there another way?

17:30 < marcus_zoq> Anand_:I think you can use stdout.

17:32 < Anand_> There is something called jpype for this. But, it is better to write to file, I guess

17:35 < marcus_zoq> Anand_: Interesting, I didn't know that there is something

17:36 < Anand_> Yeah!

17:38 < Anand_> And where will I call the main() function from the Java file?

17:38 < Anand_> How will the files be generated?

17:41 < marcus_zoq> Anand_:Once we have successfully compiled the java source code, we invoke the Java Vm to run the code. Line 63 https://github.com/zoq/benchmarks/blob/master/methods/weka/nbc.py.

17:42 < Anand_> Alright!

17:43 < Anand_> I think I can do it now

17:43 < marcus_zoq> Anand_: You can compile the source code with 'make scripts WEKA_CLASSPATH=<location to the weka.jar file>'.

17:45 < Anand_> So, I will need the weka jar?

17:45 < Anand_> And where will the .class files be?

17:47 < marcus_zoq> Anand_: methods/weka methods/weka/src/; I use the following command on the build server: make scripts WEKA_CLASSPATH=".:/opt/weka/weka-3-6-9:/opt/weka/weka-3-6-9/weka.jar"

17:48 < marcus_zoq> Anand_: And right you need the weka.jar file.

17:52 < Anand_> ok

18:07 Anand_ has quit [Ping timeout: 246 seconds]

18:35 sumedhghaisas has joined #mlpack

18:35 sumedhghaisas has quit [Client Quit]

18:39 sumedhghaisas has joined #mlpack

18:40 < sumedhghaisas> naywhayare: hey ryan... you there?

18:44 < naywhayare> sumedhghaisas: yes, I am here now

18:45 < sumedhghaisas> WH matrix and the original seems to differ ... even though residue is very small

18:47 < oldbeardo> naywhayare: any feedback on the code?

18:53 < jenkins-mlpack> Starting build #1948 for job mlpack - svn checkin test (previous build: STILL UNSTABLE -- last SUCCESS #1944 1 day 11 hr ago)

18:56 < naywhayare> sumedhghaisas: in what way do they differ?

18:56 < naywhayare> oldbeardo: I am solving a bug with the NMF tests first

18:58 < oldbeardo> naywhayare: it would be great if you could give me feedback today, since you would be sort of unavailable for the next 10 days

18:58 < naywhayare> oldbeardo: yes, it will be done before I leave, don't worry

18:58 < sumedhghaisas> naywhayare: Means entires don't seem to match... should I paste the output here??

18:58 < oldbeardo> naywhayare: okay, thanks

18:58 < sumedhghaisas> naywhayare: you unavailable for next 10 days??

18:59 < naywhayare> sumedhghaisas: show me the code you are using, not the output

18:59 < sumedhghaisas> okay I will just send you the code my mail... okay??

18:59 < naywhayare> well, sort of; I will do my best to be unavailable. I sent you (and everyone) an email about it

18:59 < naywhayare> sure, email is fine

19:02 < sumedhghaisas> naywhayare: Okay I have sent you the amf_impl.hpp...

19:02 < sumedhghaisas> by the way... What bug with NMF tests??

19:03 < naywhayare> http://www.mlpack.org/trac/ticket/353

19:03 < naywhayare> the tests are written poorly -- NMF isn't guaranteed to return a unique factorization

19:04 < naywhayare> so I'm rewriting the tests to not check the individual values of the W and H matrices since they aren't guaranteed to be the same for two different runs of NMF

19:04 < naywhayare> anyway, you sent me amf_impl.hpp, but where is the code that shows the entries don't seem to match?

19:07 < sumedhghaisas> run the code it will print the some values at the end... basically test values... entries for matrix V and WH...

19:11 oldbeardo has quit [Ping timeout: 246 seconds]

19:11 oldbeardo_ has joined #mlpack

19:11 < naywhayare> std::cout << V(i, r_test(i, j)) << " " << WH(i, r_test(i, j)) << std::endl;

19:12 < sumedhghaisas> yes... this will print the real value and computed value...

19:12 < naywhayare> the RMSE calculation seems okay, and it's definitely true that some of the WH values will be different than the V values

19:12 < naywhayare> the residue can't take the test points into account, so the residue might be tiny but the difference between the V and WH values for the test points may be larger

19:12 < naywhayare> how is the RMSE performance?

19:12 < sumedhghaisas> RMSE is really bad...

19:12 < sumedhghaisas> wait I will just paste the values here...

19:13 < sumedhghaisas> 2683 2192.77

19:13 < sumedhghaisas> 186 1829.49

19:13 < sumedhghaisas> 2102 3336.24

19:13 < sumedhghaisas> 1663 963.512

19:13 < sumedhghaisas> 595 2486.42

19:13 < sumedhghaisas> 1892 649.105

19:13 < sumedhghaisas> 3 0.602375

19:13 < sumedhghaisas> 4 1.40428

19:13 < sumedhghaisas> 1 1.93011

19:13 < sumedhghaisas> 1.11958e+06

19:14 < naywhayare> you should be taking the sqrt of the RMSE, too, don't forget that part

19:14 < naywhayare> I don't know what the values you printed mean

19:15 < sumedhghaisas> the first column is actual entries and second is computed entries...

19:15 < sumedhghaisas> the last value is RMSE...

19:16 < naywhayare> I thought the movielens dataset was full of ratings from 1 to 5, but you are showing values as high as 2683

19:17 < naywhayare> hang on, you are setting 'r_test(i, count) = temp' but you should do 'r_test(i, count) = V(i, temp)'

19:17 < naywhayare> and then V(i, temp) = 0

19:18 < naywhayare> you should be able to do this code entirely without the I matrix; I think that's the problem

19:18 < naywhayare> instead of checking if(I(i, temp) == 1) you can just do if(V(i, temp) != 0)

19:19 < naywhayare> remember that the I matrix is only representing which values of V are not zero -- which is information you can already get directly from V, so there's not much reason to maintain I at all

19:19 < sumedhghaisas> Ohh For testing I have just passed I into update rule...

19:19 < sumedhghaisas> I think there is some problem in my main... hang on...

19:23 < sumedhghaisas> naywhayare: ohh my god... I am using the direct csv file as the matrix... I have to compute the matrix from raw csv right??

19:23 < sumedhghaisas> I did that for GroupLens...

19:24 < naywhayare> yeah, take a look at how it is done in cf_test.cpp

19:25 < sumedhghaisas> yeah I am using that code only...

19:26 udit_s has quit [Read error: Connection reset by peer]

19:26 < jenkins-mlpack> Yippie, build fixed!

19:26 < jenkins-mlpack> Project mlpack - svn checkin test build #1948: FIXED in 32 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/1948/

19:26 < jenkins-mlpack> saxena.udit: Fixed armadillo issues, along with removing uninitialized and unused variables

19:28 < naywhayare> ok, if you are using that code, then that should be fine

19:28 < naywhayare> but either way, I don't think you need to have an I matrix at all, and I think that is the source of your problems

19:29 < sumedhghaisas> no... I compute the matrix and store it for later use... I forgot to do that for MovieLens... I will compute it now...

19:38 sumedhghaisas has quit [Ping timeout: 264 seconds]

19:42 udit_s has joined #mlpack

19:48 < naywhayare> udit_s: thanks for fixing the build :)

19:55 < udit_s> Awesome.

19:55 < udit_s> :)

19:55 udit_s has quit [Quit: Leaving]

20:13 < oldbeardo_> naywhayare: you there?

20:13 < naywhayare> oldbeardo_: yeah, I was going to leave in a few minutes, but I'm here for now

20:14 < naywhayare> I can hang around as long as you need (well... for a few hours :))

20:14 < oldbeardo_> heh, it won't take that long :)

20:15 < oldbeardo_> so, I saw your mail, the part where you talk about ExtractSVD(), it's not test code, that is how it is in the algorithm

20:16 < oldbeardo_> also about the columns > rows part, I agree

20:16 < naywhayare> oh... I see, ok. I misunderstood the algorithm

20:16 < naywhayare> so you build the CosineTree to basically get a smaller basis

20:16 < naywhayare> then run actual SVD on the smaller basis

20:17 < oldbeardo_> yes, that's right

20:17 < naywhayare> you should include a flag that allows the user to specify whether or not to use arma::svd() or arma::svd_econ(), then

20:17 < naywhayare> because sometimes arma::svd() can be slow, but svd_econ() produces approximate results in much less time (if I remember right)

20:18 < oldbeardo_> okay, will do that, otherwise the new code looks fine right?

20:19 < naywhayare> seems fine to me, as long as it passes the tests

20:19 < naywhayare> I don't dig into code too hard until we have tests that it passes so I can start trying to make simple speed modifications, avoiding temporaries, etc.

20:19 < oldbeardo_> right, will you be available tomorrow and day after?

20:21 < naywhayare> for some of the day, yes

20:21 < oldbeardo_> okay, see you tomorrow then

20:23 oldbeardo_ has quit [Quit: Page closed]