#mlpack on 2014-07-09 — irc logs at libera.irclog.whitequark.org

2014-05-21 16:24 naywhayare changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

01:05 sumedh_ has quit [Ping timeout: 240 seconds]

01:34 witness___ has quit [Quit: Connection closed for inactivity]

01:46 andrewmw94 has quit [Quit: Leaving.]

04:04 Anand has joined #mlpack

04:48 Anand has quit [Ping timeout: 246 seconds]

05:05 zGz|govg has quit [Ping timeout: 240 seconds]

05:07 zGz|govg has joined #mlpack

05:31 zGz|govg has quit [Ping timeout: 248 seconds]

05:32 witness___ has joined #mlpack

05:32 zGz|govg has joined #mlpack

08:12 Anand has joined #mlpack

08:31 < marcus_zoq> Anand: Hello, I think in line 32 in the 'LogisticRegression.java' file you should return the index.

08:46 < Anand> Marcus: I didn't get you. Are you talking about weka?

08:46 < Anand> Btw, I have fixed the error. Now all tests are succeeding

08:46 < Anand> I have also added Linear regression modified files for scikit and mlpack

08:46 < Anand> You can have a look

08:48 < marcus_zoq> Anand: I'm talking about the 'maxProb()' function the weka code. Right now the function return the max propb but we need the predicted class right?

08:52 < Anand> Ok, I got you. I have made the changes. But, will this always work?? I doubt

08:55 < marcus_zoq> Anand: It shoudl work, if you use the index of the max probability.

08:56 < Anand> But can we always say that the index of the prob will directly denote its class? What if indices are 0,1,2 but classes are 1,2,3?

08:58 < Anand> You go my point, right?

08:58 < Anand> *got

09:01 < marcus_zoq> Anand: Yes, good point, we need to map the predicted classes.

09:02 < Anand> But to map the classes, we need to the classes, first.

09:03 < Anand> And for this, I guess there is no other way but to parse the true labels file once and retrieve all the classes

09:03 < Anand> What do you say?

09:06 < marcus_zoq> Anand: We can iterate through the train set (weka instances) and extract the classes with getClass.

09:09 < marcus_zoq> Anand: Thats what we do in the mlpack nbc code.

09:09 < Anand> Ok. You mean data.instance(i).getClass() ?

09:13 < marcus_zoq> Anand: data.instance(i).classValue() http://weka.sourceforge.net/doc.dev/weka/core/Instance.html

09:14 < Anand> Ok. I will create the mapping.

09:14 < marcus_zoq> Anand: Okay, great!

09:15 < Anand> Meanwhile, please try to build again and have a look at the other code I mentioned, whenever you get time. :)

09:15 < marcus_zoq> Anand: Did you commit the mlpack linear regression?

09:17 < Anand> Yes, I did. I have some doubts regarding the structure there. We never changed the structure for mlpack.

09:18 < marcus_zoq> Anand: Okay I take a look in a few minutes.

09:18 < Anand> Ok, cool!

09:34 witness___ has quit [Quit: Connection closed for inactivity]

10:11 Anand has quit [Ping timeout: 246 seconds]

10:14 Anand has joined #mlpack

10:15 < Anand> Marcus : Added the class mapping too! Have a look

10:19 Anand has quit [Ping timeout: 246 seconds]

10:35 zGz|govg has quit [Ping timeout: 245 seconds]

12:37 andrewmw94 has joined #mlpack

13:24 < naywhayare> andrewmw94: ok, I did some thinking and I think I have an idea

13:25 < naywhayare> it involves... a bit of refactoring. but it's not impossible

13:25 < naywhayare> (at least, I think)

13:26 < naywhayare> so I think the assumption to make is that when we insert a new point into a tree, its given index should be (new number of points in tree - 1)

13:26 < naywhayare> we can have the Insert() function return a size_t with the index of the point, or, just make it clear what the index will be in the documentation

13:27 < naywhayare> for the R tree, then, each node can hold a std::vector<arma::vec*> or arma::mat or some structure that only holds the points that are held in that particular node

13:27 < naywhayare> and it can also hold a std::vector<size_t>, which holds the indices of each of those points

13:28 < naywhayare> then, we use return type overloading to provide both 'size_t Point(const size_t i)' and 'arma::vec& Point(const size_t i)'

13:28 < naywhayare> lastly, we change all the BaseCase() functions to have the signature BaseCase(const size_t queryIndex, const VecType& queryPoint, const size_t referenceIndex, const VecType& referencePoint)

13:29 < naywhayare> and then modify the traversers for the slightly modified BaseCase() signature

13:29 < naywhayare> what do you think of this idea? if you think it's terrible, that's okay :)

13:36 < andrewmw94> I'm not sure I understand it entirely, but it sounds very similar to what we have now

13:36 < andrewmw94> with the addition of holding both the arma::vec* and the size_t in the leaf nodes

13:37 < andrewmw94> I think we discussed this idea about a week ago

13:38 < andrewmw94> let me see if I can find the logs

13:39 < naywhayare> yeah, this is similar to a discussion we had about a week ago

13:39 < naywhayare> the only real thing I came up last night with is the modification to BaseCase() and how Insert() should work

13:39 < naywhayare> I have to walk to campus... I'll be back in about 15

13:40 < andrewmw94> ok

13:53 < naywhayare> alright, back

13:54 < andrewmw94> ok

13:54 < andrewmw94> So I think I can prove that it's not possible to store points the way you do it in Binary Space Trees and have insertion/deletion

13:54 < naywhayare> yes, you are probably right about that

13:55 < naywhayare> the key about the BinarySpaceTree point storage is just that points in a leaf are contiguous

13:55 < naywhayare> if you were to store each point in an R tree leaf in a single arma::mat object (or something like that), you'd end up with much the same effect

13:55 < naywhayare> unless I have overlooked something

13:56 < andrewmw94> yeah. That's what it does now. You just lose the contiguity

13:56 < naywhayare> well, it has contiguity at least for points in a single node

13:56 < andrewmw94> inserting points can be done at the end. Deleting points would just require swapping the point in question with the last point (in the matrix) and then updating the last point's index in the tree

13:57 < naywhayare> hm, I didn't think about what to do with the indices when a point is deleted

13:57 < andrewmw94> The R tree?

13:57 < naywhayare> yeah, I'm talking about the R tree

13:57 < andrewmw94> currently, at least, it doesn't have any contiguity. And I don't think it's possible to have contiguity in one matrix and have arbitrary insertions/deletions

13:58 < andrewmw94> we can get contiguity for about 15 points at a time by storing the matrices in the leaf nodes

13:58 < naywhayare> yeah, and that's probably good enough contiguity

13:58 < andrewmw94> (conveniently, the user can adjust that at will by changing the leaf size and minimumLeafFill)

13:59 < naywhayare> I think when a user calls Delete() it should remove the requested point, but not update any indices

13:59 < naywhayare> updating the indices could take forever...

13:59 < naywhayare> so I guess when a user calls Insert(), if they have deleted lots of things from the tree, the index of the inserted point may be much larger than the number of points currently in the tree

13:59 < andrewmw94> Well, that depends. Can I tell you my thought

13:59 < andrewmw94> thoughts*

13:59 < naywhayare> yeah, go for it

14:00 < andrewmw94> Ok, so I can think of two options.

14:00 < andrewmw94> 1)We can store an arma::mat in each leaf. This gives us contiguity for about 15 points at a time. However, it is unclear how you insert and delete points.

14:01 < andrewmw94> sorry, that should have been, "it is unclear how you index the points."

14:02 < andrewmw94> 2) We store all of the points in a central matrix, and each leaf stores indices to the points it holds. We have the cost of dereferecing, and if we can insert and delete points, we lose the ability to have the data matrix continuous

14:04 < andrewmw94> if it is not continuous, we can insert points at the end, and we can delete points by swapping the last point for the deleted point in the centeral matrix, and then finding that point in the tree and updating it (searching for a point you have an exact match for should be very efficient, so I think that's ok)

14:04 < andrewmw94> Can you think of any other ways you could theoretically do it?

14:04 < naywhayare> those are basically the two ways that I see

14:05 < naywhayare> for idea (1), I think indexing the points can be done by having each leaf hold a std::vector<size_t>

14:05 < andrewmw94> Also, as a side note, can I assume that there are never two points at the exact same location?

14:05 < naywhayare> no, that's probably not a good assumption, unfortunately

14:05 < naywhayare> there are some test datasets I use that can have duplicate points (I think the corel dataset does, actually)

14:06 < andrewmw94> hmm. ok. I still think searching for a point which you have an exact match of will be fairly quick.

14:06 < andrewmw94> but I'm not positive

14:07 < naywhayare> the only thing is that if you delete a point and change the index of another, the user has to have some way to know what point's index changed and what it changed to

14:07 < naywhayare> which is why I thought maybe it was better to just remove deleted points from the tree without modifying any indices

14:08 < naywhayare> and then when you insert a new point, the root node has some variable which is tracking the next index it should use

14:09 < andrewmw94> ahh, I didn't think of that. Why wouldn't the index of the next point always be the index of the column in the data matrix?

14:09 < andrewmw94> (last column/column where the point was added)

14:09 < naywhayare> that depends on whether you are using strategy (1) or (2)

14:10 < naywhayare> if you're using strategy (2), then yeah, the index of the last column in the data matrix works; but then, insertion will require allocation of an entire new data matrix and copying of the whole thing

14:10 < andrewmw94> well, I suspect that the best way to do it is to combine them. Strategy 1 has no obvious way to index points and allow arbitrary insertions/deletions

14:11 < andrewmw94> if by index we mean a way to map it back to a central matrix

14:11 < andrewmw94> but now I'm not sure if that's what you mena

14:11 < andrewmw94> mean*

14:11 < naywhayare> by index I don't necessarily mean a way to map it back to a central matrix

14:12 < naywhayare> which I mean is just some unique identifier for each point in the dataset, whether or not it is contiguous

14:12 < andrewmw94> ok. That should be equivalent. The central matrix that is used now isn't contiguous.

14:17 < andrewmw94> How much do you know about how arma::mat works? Do you know if a matrix stores each arma::vec?

14:18 < naywhayare> no; arma::mat stores a contiguous block of memory, not individual arma::vec objects

14:18 < naywhayare> I wrote a quick little program to demonstrate kind of what I have in mind when a user may be adding points to a tree

14:18 < naywhayare> http://pastebin.com/DHAiipGq

14:18 < andrewmw94> ahh. I think that would work too. Is there a way to get the memory address of an element of the matrix?

14:19 < naywhayare> I guess you could do mat.memptr() + row + mat.n_rows * col

14:19 < naywhayare> but that's a little hackish-seeming when armadillo is meant to provide a nice interface to matrices so you don't have to do pointer arithmetic...

14:21 < andrewmw94> yeah. My idea is that we could use the memory address of the first row of each point (vector) as an input to a Map to get the index out. But if we have to know the index in the matrix, that doesn't work.

14:21 < naywhayare> wouldn't just holding a std::vector<size_t> in each RTree node work?

14:22 < andrewmw94> I think it should but I thought there was a problem with it when I suggested it last week. I'm looking through the IRC logs.

14:24 < andrewmw94> 14:05 < andrewmw94> that should work. We discussed it before and I thought we agreed it would work but it would have bad memory locality

14:24 < andrewmw94> 14:05 < naywhayare> it could have bad memory locality, yeah

14:24 < andrewmw94> 14:05 < andrewmw94> perhaps storing both the vectors and the size_t?

14:24 < andrewmw94> 14:06 < andrewmw94> it's not too much memory and would solve the locality issue

14:24 < andrewmw94> 14:06 < naywhayare> that won't help, it has the same memory locality problem

14:24 < andrewmw94> 14:06 < naywhayare> because you'll have to store a pointer to the vector

14:24 < andrewmw94> 14:06 < andrewmw94> oh, I forgot the base case will use the index rather than the stored point

14:24 < andrewmw94> 14:06 < andrewmw94> duh

14:24 < andrewmw94> 14:06 < naywhayare> either that or you store the vector itself, and that takes a ton of space

14:24 < andrewmw94> 14:07 < naywhayare> the NeighborSearchRules abstractions are modifiable, if you have an idea that I don't, so don't rule that out as a possibility

14:24 < andrewmw94> 14:07 < naywhayare> the key being that it can still work with the other types of trees

14:24 < andrewmw94> 14:08 < naywhayare> unfortunately making all these things work together can get really difficult :-S

14:24 < andrewmw94> http://www.mlpack.org/irc/mlpack.20140701.html if you want more context

14:25 < naywhayare> ok, my memory is jogged

14:25 < andrewmw94> so if I understand your solution correctly, we do what I suggested and change the base case so that it takes both an index and a reference to arma::vec

14:25 < andrewmw94> That seems like it should work. Or am I missing something?

14:26 < naywhayare> yeah, I think that could work

14:26 < naywhayare> it looks like a week ago I said that exactly what I was proposing was a bad idea

14:26 < naywhayare> and now I'm saying exactly the opposite

14:26 udit has joined #mlpack

14:26 udit has quit [Client Quit]

14:27 < andrewmw94> haha. I'll mention it in my blog post :P

14:27 < naywhayare> and it also seems possible that I told you your idea had problems, and then a week later came back with the exact same idea

14:27 udit_s has joined #mlpack

14:27 < andrewmw94> Is it ok if I store the whole vector twice though?

14:28 < andrewmw94> matrix*

14:28 < naywhayare> hm, yeah, that is undesirable

14:28 < andrewmw94> I mean, if I'm using pointers to arma::vec, I might as well just store size_t right?

14:29 < naywhayare> I'm not sure what you mean, could you clarify?

14:29 < andrewmw94> so I'm going to store a vector<size_t> to map to the original indices

14:30 < andrewmw94> and then we also want to store a vector<arma::vec> or a vector<arma::vec*> to handle the points

14:30 < andrewmw94> but I don't think the vector<arma::vec*> is much better than just using the vector<size_t> to find the points

14:31 < naywhayare> I think you could just store an arma::mat instead of a vector<arma::vec> or vector<arma::vec*>, and resize it as necessary

14:31 < naywhayare> I was also struck with another idea

14:32 < naywhayare> suppose we created another matrix class and shoved it into the Armadillo namespace 'cause why not... we could call it 'arma::split_mat' or something, and it wouldn't hold contiguous memory, but instead a vector of vectors

14:32 < naywhayare> this means insert_cols() is O(1) (amortized) and so is shed_cols()

14:33 < naywhayare> this also means we get to have just one matrix object, and we don't have to modify our abstractions

14:34 < naywhayare> it would need to imitate the Armadillo API, but that's not too hard to do, I don't think; we just need col(), row(), element-wise operators, and other things like that

14:34 < naywhayare> this is kind of a half-baked idea right now. there are probably lots of things I am overlooking

14:34 < andrewmw94> So this would replace the centeral matrix or both of them?

14:34 < naywhayare> what do you think? hopefully I communicated the idea at least semi-coherently

14:34 < naywhayare> this would replace the central matrix

14:35 < naywhayare> the whole tree would only need to hold one of these objects

14:35 < naywhayare> and indexing is no longer a problem, since we have just one matrix

14:35 < andrewmw94> and the main advantage is that adding a column to an arma::mat is really slow?

14:36 < naywhayare> yeah; insert_cols() for arma::mat is devastating if the matrix is large

14:36 < andrewmw94> ok. Then I think I understand and I think it's a good idea.

14:37 < naywhayare> so, in this case, write your tree to hold one central matrix, and insert points into it using insert_cols()

14:37 < andrewmw94> ok sounds good

14:37 < naywhayare> if the user is planning to insert lots of points, they should use MatType = arma::split_mat (or whatever we call it)

14:37 < naywhayare> otherwise they can use arma::mat and get memory locality bonuses

14:37 < naywhayare> we (or I) can worry about implementing arma::split_mat later

14:38 < andrewmw94> alright

14:38 < naywhayare> if you wanted to implement it, feel free, but maybe it's a little outside the scope of your project, so that one's up to you

14:38 < naywhayare> almost certainly Conrad Sanderson (the Armadillo guy) will never accept the matrix type as a patch

14:38 < naywhayare> but that's why we have src/mlpack/core/arma_extend/ :)

14:39 < andrewmw94> haha. We'll have to put the lunar cycle stuff in there sometime too.

14:40 < naywhayare> n

14:40 < naywhayare> oops, IRC is not gdb

14:40 < andrewmw94> I'm rather surprised that CMake didn't think it was an issue. I guess it is pretty easy to work around, but you would think using the whole file name would be an obvious thing to do.

14:41 < naywhayare> I imagine whoever wrote the code just needed to get it done fast and thought "hopefully people don't have duplicate filenames"

14:41 < naywhayare> I don't think it would be too hard to dig into the CMake codebase and figure out a solution, but that is absolutely not on my list of "things I am enthusiastic about doing"

14:42 < naywhayare> we have a workaround, at least, so that is good

14:42 < naywhayare> I also get the impression that the CMake developer team is quite a bit overworked, given by the number of backlogged bugs...

14:43 < andrewmw94> yeah. It's a popular project.

14:48 Anand has joined #mlpack

14:48 < Anand> Marcus, I fixed the weka logistic regression code as per your suggestion. However, I don't understand what you wanted to convey regarding the mlpack linear regression code.

14:50 < udit_s> naywhayare: Have you had time to look at the mail I sent you and Marcus ?

14:51 < udit_s> naywhayare: Also, let's talk about the Perceptron Code Review when you're free today ?

14:55 < naywhayare> udit_s: I saw the mail you sent. I hope you are feeling better! I've been looking through the perceptron code, so I'll let you know when I am done

14:55 < naywhayare> currently I am writing one more test for the decision stump

14:58 < udit_s> naywhayare: Thanks. Almost better. About the decision stump, a test for what ? Anything I can help with ? Were you satisfied with the decision stump code changes I had done ?

14:59 Anand has quit [Ping timeout: 246 seconds]

15:02 < naywhayare> yeah, the changes were fine. I just wanted to write one more that I was thinking of, it's no problem

16:16 < naywhayare> udit_s: can you tell me how you generated the dataset for the CorrectAttributeChosen test?

16:25 Anand has joined #mlpack

16:29 < udit_s> naywhayare: I picked up that data set from the example where we were discussing it from.

16:31 < udit_s> naywhayare: page 8 of this pdf: http://202.154.59.182/mfile/files/Information%20System/Data%20Mining%3B%20Concepts,%20Models,%20Methods,%20and%20Algorithms%20(2nd%20Edition)/Chapter%206%20%20Decision%20Trees%20And%20Decision%20Rules.pdf

16:33 < naywhayare> okay, thank you

16:34 < naywhayare> the document there says that Attribute1 is the best attribute to split on, but in your test you are checking that Attribute2 is the one that is split on

16:34 < naywhayare> I think this maybe was a simple zero-indexing off by one error :)

16:35 oldbeardo has joined #mlpack

16:35 < oldbeardo> naywhayare: hey, needed some help

16:35 < naywhayare> oldbeardo: I'm about to step out for lunch, but maybe it's quick?

16:36 < oldbeardo> yeah, I wrote a template specialization for SGD<>::Optimize

16:36 < oldbeardo> I get this error

16:36 < oldbeardo> error: no matching function for call to ‘mlpack::svd::RegularizedSVDFunction::Gradient(arma::mat&, size_t&, arma::mat&)

16:37 < oldbeardo> I have commented the Gradient() function out

16:37 < oldbeardo> but it is not using the specialization for some reason

16:37 < naywhayare> do you mean that you have commented the call to the Gradient() function out inside of your SGD<>::Optimize specialization?

16:37 < naywhayare> can you show me the code you used to declare the specialization?

16:37 < oldbeardo> sure, one sec

16:38 < oldbeardo> http://pastebin.com/C8g7tf7U

16:40 < naywhayare> what file is that in?

16:40 < oldbeardo> regularized_svd_function.cpp

16:41 < naywhayare> just as a test, can you try putting it into sgd_impl.hpp? make it an inline function so that the compiler doesn't complain about it being defined multiple times

16:42 < naywhayare> you might have to add an #include <../regularized_svd_function.hpp> at the top of that file just to make it work

16:43 < udit_s> naywhayare: have a look at how they have arrived at that. It isn't a one-off error.

16:43 < oldbeardo> naywhayare: /usr/local/include/mlpack/core/optimizers/sgd/sgd_impl.hpp: In member function ‘double mlpack::optimization::SGD<DecomposableFunctionType>::Optimize(arma::mat&) [with DecomposableFunctionType = mlpack::svd::RegularizedSVDFunction, arma::mat = arma::Mat<double>]’

16:43 < udit_s> naywhayare: The way they split the second column is different than the way we're doing it.

16:43 < oldbeardo> naywhayare: it is using the Optimize already present in the library

16:44 < udit_s> Needless to say, I checked the maths behind our method for inpBucketSize = 3,4,5 and it was the second column which was the best split.

16:46 < naywhayare> udit_s: your math must be backwards; it is picking the dimension with worst split, not with the best split

16:46 < naywhayare> take a look at this test I have written:

16:46 < naywhayare> actually, hang on, this will be easier if I check it in

16:46 < naywhayare> give me just a moment

16:47 < naywhayare> oldbeardo: I understand it is using the wrong one, but put your specialization into sgd_impl.hpp to see if it picks up the correct one

16:48 < oldbeardo> naywhayare: okay, I'll try that

16:49 < udit_s> naywhayare: Okay. The calculateEntropy returns the least entropy (most negative) according to our method for the second attribute. somewhere around -0.089...

16:50 < udit_s> what is the test that you have written ?

16:50 < naywhayare> udit_s: okay, I checked in a new test in r16793; could you take a look at it?

16:50 < naywhayare> I have made a 4-dimensional dataset

16:50 < udit_s> yeah, hang on.

16:50 < naywhayare> the observations in each dimension come from two gaussians

16:50 < naywhayare> and in some dimensions the gaussians are overlapping, and in others they aren't

16:51 < naywhayare> so the decision stump should choose to split on the dimension with gaussians that are furthest apart

16:51 < naywhayare> but what I found was that it always split on the dimension with the closest gaussians... the worst dimension

16:51 < naywhayare> the change I made to the entropy calculation you can see here: http://www.mlpack.org/trac/changeset/16792

16:52 < naywhayare> the first couple diffs in decision_stump_impl.hpp

16:52 < naywhayare> I think your calculation might actually be returning information gain, which is negative entropy. I'm not certain, which is why I'm asking you to take a look

16:52 < naywhayare> but regardless of whether or not my change is correct, I think something was still wrong either way

16:53 < naywhayare> I need to get lunch now... I am starving

16:53 < naywhayare> oldbeardo: hopefully that idea will work. I will try and think about more things that the problem could be, but my best guess is that the compiler is not finding your specialization where it needs to find it

16:54 < naywhayare> I'll have to think about where it should actually go, or why this is even a problem. C++ can be very weird sometimes...

16:54 < oldbeardo> naywhayare: okay, but the same abstraction works for LRSDP, that's why I asked you

16:54 < naywhayare> yeah, which is why I thought it might have to do with the file that it's in

16:55 < naywhayare> do you have regularized_svd_function.cpp in the CMakeLists.txt?

16:56 < jenkins-mlpack> Starting build #2003 for job mlpack - svn checkin test (previous build: SUCCESS)

16:58 < oldbeardo> I'm not working in the sources code, I have a separate folder for Reg SVD

16:58 < oldbeardo> *source code

17:15 < naywhayare> oldbeardo: can you tell me how you are compiling your code and which command is producing the error?

17:24 < udit_s> naywhayare: If you look at the diff, you've changed the initial value of bestEntropy to -DBL_MAX and then are looking for the maximum entropy by changing the 'if' condition statement from '<' to '>'.

17:25 < udit_s> But if you look at the value of entropy you are returning, it is actually sum(x*p(x)) and not -sum(x*p(x))

17:26 < udit_s> Hence you're getting the worst split and not the bestsplit.

17:27 < udit_s> *it should be sum(p(x)*log(p(x)))

17:28 < udit_s> I feel that is what you might have overlooked.

17:30 < naywhayare> udit_s: sorry, maybe I should have clarified more

17:31 < naywhayare> if you revert the changes I made to the selection algorithm

17:31 < naywhayare> i.e. change > back to < and change -DBL_MAX to DBL_MAX

17:31 < naywhayare> then the stump selects the worst splitting attribute on the test I wrote

17:32 < naywhayare> but the changes I made cause the test to pass

17:33 < naywhayare> so even if what I have changed it to is wrong (which it may very well be), I think there was a problem before I changed things too

17:37 < udit_s> hmm... I'm trying with different bucket sizes, but I don't think that makes a difference.

17:38 < oldbeardo> naywhayare: I compile the code using 'g++ test.cpp regularized_svd_function.cpp -lmlpack -larmadillo'

17:38 Anand has quit [Ping timeout: 246 seconds]

17:40 < naywhayare> oldbeardo: did putting the specialization into sgd_impl.hpp and marking it inline help?

17:41 < naywhayare> udit_s: I tried different bucket sizes but that didn't seem to make much of a difference

17:41 < oldbeardo> naywhayare: didn't try it yet, had gone for a bath

17:41 < naywhayare> ah, ok

17:41 < oldbeardo> naywhayare: I'm going to include everything into the source and then try, it may be easier that way

17:42 < naywhayare> udit_s: I noticed that when I printed the entropy, it was ordered backwards -- the best dimension had smallest (most negative) entropy, the next best dimension had next smallest, etc., and the worst dimension had entropy closest to 0

17:46 < udit_s> naywhayare: hang on, let me check something.

18:07 oldbeardo has quit [Ping timeout: 246 seconds]

18:08 sumedhghaisas has joined #mlpack

18:08 < sumedhghaisas> naywhayare: free for some time??

18:14 oldbeardo has joined #mlpack

18:15 < oldbeardo> naywhayare: I built the source with that part of the code commented out

18:15 < oldbeardo> it works fine with L_BFGS

18:15 < jenkins-mlpack> Project mlpack - svn checkin test build #2003: SUCCESS in 1 hr 19 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2003/

18:15 < jenkins-mlpack> * Ryan Curtin: Another test to make sure the correct splitting attribute is used.

18:15 < jenkins-mlpack> * Ryan Curtin: Fix some formatting, fix backwards entropy splitting, add getters/setters, and

18:15 < jenkins-mlpack> comment a little bit about the internal structure of the class.

18:15 < naywhayare> oldbeardo: but you are trying to use SGD; I don't understand what you mean

18:15 < naywhayare> sumedhghaisas: I am here, what do you need?

18:15 < oldbeardo> naywhayare: now when I include the code, I get the following error

18:15 < naywhayare> oh, ok, you weren't done typing yet, sorry about that

18:16 < oldbeardo> for some reason my messages aren't going though

18:17 < oldbeardo> *through

18:17 < oldbeardo> expected initializer before ‘<’ token

18:17 < naywhayare> you probably need to include regularized_svd_function.hpp, or make a forward declaration of the RegularizedSVDFunction class

18:18 < sumedhghaisas> with the use of reverseStepTolerance... sometimes local optimum is not returned... like RMSE reaches 0.91... but then increases to 0.98 ... cause less than 3 jumps are allowed... so it takes 2 jumps... then again decreases... again 2 jumps... can we somehow store the matrices associated with least RMSE...

18:18 < sumedhghaisas> I know it would be costly...

18:18 < oldbeardo> naywhayare: I have included it

18:18 < oldbeardo> the line that it is referring to is:

18:18 < oldbeardo> template<> double SGD<mlpack::svd::RegularizedSVDFunction>::Optimize(arma::mat& parameters)

18:19 < naywhayare> sumedhghaisas: sometimes that is necessary, to keep the local optimum. I guess there is no other choice if reverseStepTolerance > 1

18:19 < naywhayare> oldbeardo: unfortunately that's not enough information for me to be able to figure it out. can you send me a copy of the file you are using or something like that?

18:20 < oldbeardo> naywhayare: okay, I'll send you all the files

18:20 < sumedhghaisas> oldbeardo: sometimes I get that error cause of interlinked includes...

18:22 < sumedhghaisas> naywhayare: so is it okay to not return that minimum value??

18:22 < naywhayare> sumedhghaisas: what do you mean? it sounds like the best thing to do is store the best matrix, and return that when the algorithm has converged

18:23 < naywhayare> sorry, the best W and H, I mean

18:23 < sumedhghaisas> ohh okay... I thought you were against storing those matrices... sorry...

18:23 < oldbeardo> naywhayare: I sent you the files

18:23 < naywhayare> well, in this case we don't have a choice

18:23 < oldbeardo> sumedhghaisas: I will check that out

18:23 < naywhayare> but I would like to do this only in the ValidationRMSETermination policy

18:23 < naywhayare> so the bestW and bestH matrices should be stored there, not in AMF

18:24 < naywhayare> because not all termination policies require holding the bestW or bestH matrices

18:24 < sumedhghaisas> oldbeaddo: I am not sure... But when SGD is not defined C++ will give such an error...

18:24 < sumedhghaisas> naywhayare: ummm... both validation RMSE and simple tolerance...

18:24 < naywhayare> oldbeardo: I need the test file too

18:25 < sumedhghaisas> naywhayare: Okay I will figure out some way...

18:25 < naywhayare> sumedhghaisas: you're right, we will need to modify both of those

18:25 < naywhayare> sumedhghaisas: thank you; I don't think it should be too hard to do that

18:26 < oldbeardo> naywhayare: got it?

18:27 < naywhayare> oldbeardo: yes, thank you

18:27 < udit_s> naywhayare: I wrote one more test. And a little change/improvement. I think I have it, but I'm just going to be sure, and get back to you in a moment.

18:28 < naywhayare> udit_s: thank you for looking into that. let me know when you check it in

18:34 < naywhayare> oldbeardo: I added #include <mlpack/core/optimizers/sgd/sgd.hpp> to the top of regularized_svd_function.cpp and it seems to be getting further

18:34 < oldbeardo> naywhayare: ah, I had commented that out

18:35 < naywhayare> but the errors are different now, and are not the same ones you were getting with an undefined reference to RegularizedSVDFunction::Gradient

18:37 < oldbeardo> yeah, I got those, have no idea what they mean

18:38 < naywhayare> the issue is that you are trying to access local variables of RegularizedSVDFunction, but you are inside of the class SGD

18:38 < naywhayare> so you may need to refactor a little bit so that you can access the lambda parameter in RegularizedSVDFunction

18:38 < naywhayare> maybe provide RegularizedSVDFunction::Lambda()

18:39 < oldbeardo> I'm also getting errors like these -> /home/mlpack/trunk/src/mlpack/../mlpack/core/optimizers/sgd/sgd.hpp:85:13: error: ‘size_t’ does not name a type

18:40 < naywhayare> now there is an interesting one

18:40 < naywhayare> what is the first error in that chain of errors? or is that the only one?

18:41 < oldbeardo> this is the first one

18:41 < naywhayare> oh... <mlpack/core.hpp> wasn't included in sgd.hpp

18:41 < naywhayare> just add that to your list of includes, before including sgd.hpp

18:42 < naywhayare> I'm going to modify sgd.hpp to include that

18:42 < naywhayare> (committed in r16794)

18:43 < oldbeardo> right, now I get the manageable errors, I will get back in some time

18:46 < andrewmw94> naywhayare: I have another question that can hopefully resolved with a C++ feature. The RectangleTree class takes several template parameters: SplitType and DescentType are the ones I need to change for the R* trees and X trees. I want to pass a RectangleTree<> class to methods in each of these (SplitType and DescentType) classes. However, as far as I know, that means these classes need to have templates for each other, which put

18:46 < andrewmw94> How do you deal with stuff like that?

18:47 < naywhayare> template<typename TreeType> class SplitType ?

18:47 < naywhayare> or template<typename TreeType> SplitType(TreeType& node) { // constructor }

18:47 < naywhayare> the second is better if you only need the tree for the constructor

18:48 < andrewmw94> But if I do that, when I declare an R Tree, wouldn't I need to do: RectangleTree<SplitType<RectangleTree<SplitType...

18:49 < jenkins-mlpack> Starting build #2004 for job mlpack - svn checkin test (previous build: SUCCESS)

18:49 < naywhayare> in the second case, no, the RectangleTree<...> gets inferred by the compiler

18:51 < naywhayare> also, I think we have some confusion, maybe

18:51 < naywhayare> in the changeset you just committed, it seems like the R tree nodes now each hold their own dataset

18:52 < andrewmw94> they each hold a reference to the central dataset and a local dataset

18:52 < naywhayare> ok; can you clarify what each is for?

18:53 < naywhayare> I know we've gone back and forth on this several times, so confusion is likely (especially since I'm involved and I can never keep anything straight)

18:53 < andrewmw94> The central dataset is the original matrix, so it is used to keep track of the indices

18:53 < andrewmw94> The local dataset holds say 8 to 20 points contiguously

18:54 < naywhayare> oh, ok, I see

18:55 < andrewmw94> It should be possible to remove the central dataset entirely I think, but I kind of want to keep it because I have the start of an idea of how to make it contiguous

18:55 < naywhayare> so when tree.Insert(vec&) is called, then insert_cols() is called on the main dataset, and the local dataset has the vector appended?

18:55 < andrewmw94> (continuing my previous comment) if the user specifies that they don't want to have a dynamic tree

18:56 < naywhayare> I guess I was under the impression that the best thing we can do is store one central matrix, and later write an arma::split_mat class that holds a vector of arma::vec

18:56 < andrewmw94> (in answer to yours) currently, you would add the datapoint to the matrix yourself, and then call tree.Insert(index) to insert it

18:57 < naywhayare> okay

18:57 < andrewmw94> Yeah, I think the arma::split_mat idea is good. But what if we had the option to build a static tree? Then the central dataset can be remapped in a reasonable time at the end of tree construction I think.

18:58 < naywhayare> true

18:58 < naywhayare> maybe this is something the user should do after construction? some function like RectangleTree::LinearizePoints(std::vector<size_t>& oldFromNew /* the mappings */)

18:59 < naywhayare> and it would only really make sense for the user to call that if they were using a static tree with arma::mat not arma::split_mat

18:59 < naywhayare> or we could even use templates to detect when arma::mat is being used and then assume that the user isn't interested in inserting/deleting points, and linearize it

18:59 < andrewmw94> Yeah. That's the idea. I'm not sure how to handle the point remapping in treetraits then

18:59 < andrewmw94> since it is sometimes true sometimes false

19:00 < andrewmw94> I guess you could just say it modifies the dataset and have it be slightly slower if it really didn't

19:00 < naywhayare> hm, we'll have to make a decision... if arma::mat always implies remapping of points, then we just partially specialize TreeTraits to the case where RectangleTree is using arma::mat

19:01 < naywhayare> I think that's reasonable to say, because if you're using arma::mat it doesn't really make any sense to insert and delete since it will take forever to call insert_cols() on the big dataset

19:01 < naywhayare> (in fact we could even use templates to disable Insert() and Delete() when arma::mat is being used, or issue some huge warning or something)

19:01 < andrewmw94> yeah, I think this should work.

19:02 < naywhayare> I think we can worry about the mapping later, though; that should be straightforward to work in according to the ideas you've proposed after we have the whole thing working and cleaned up

19:02 < naywhayare> also, I got to the bottom of the kd-tree issue

19:02 < naywhayare> it took me a little bit of paper... http://ratml.org/misc_img/tree_debugging.jpg

19:03 < naywhayare> I traced out the entire path down to the query node and reference node, then kept track of all the relevant pruning decisions and Score() calls (and some other ones too...)

19:03 < naywhayare> I ended up collecting way more information than necessary, but I don't ever mind because it often results in interesting insights

19:04 < naywhayare> for instance, note that for whatever reason, on this dataset, the children at the top of the tree are very unbalanced

19:04 < naywhayare> I've written down the first point in the node, and the count of points in the node

19:04 < naywhayare> so the first left child has 36858 points... and the first right child has about 900

19:04 < naywhayare> the next right child has about 200

19:04 < naywhayare> and there continues like 30 or 40 levels of the tree where the right child has very few points in it

19:05 < andrewmw94> interesting.

19:05 < andrewmw94> I guess there's always the possibility of getting a skewed tree. But that's a really big difference

19:05 < naywhayare> using median split instead of mean split would produce a perfectly balanced tree, but I wonder how it would perform in practice in comparison to mean split

19:06 < naywhayare> I think the corel dataset is special because it appears to be a sparse dataset

19:06 < naywhayare> so points only have nonzero values in a handful of the 32 dimensions

19:06 < naywhayare> my guess is that the mean in a dimension ends up being a very small number, with the majority of points having values of 0

19:07 < naywhayare> so there are tons of points to the left of the mean, but only a handful of outliers to the right

19:07 < andrewmw94> ahh.

19:07 < andrewmw94> Perhaps it would work better to sample n points randomly and use the median of those?

19:07 < andrewmw94> but that still doesn't deal with seleting a dimension

19:08 < andrewmw94> I guess you could choose the dimension that has the most even split with that sample

19:08 < naywhayare> the dimension is selected as the dimension with maximum variance; but the actual splitting is done in the MeanSplit class

19:08 < naywhayare> and it's templatized, so it'd be really easy to figure something else out

19:09 < naywhayare> I suppose the dimension selection could be templatized too...

19:09 < naywhayare> either way, the actual bug doesn't have anything to do with the weird splits, it's an invalid assumption in neighbor_search_rules_impl.hpp

19:09 < naywhayare> I haven't yet figured out the best way to fix it

19:10 < andrewmw94> I would actually move it to the SplitType code. (the dimension selection)

19:10 < naywhayare> anyway, back to the original topic, any confusion on how we should proceed with the RectangleTree?

19:10 < naywhayare> and yeah, maybe the dimension selection should be in the SplitType code

19:10 < andrewmw94> I'm still not sure on how to do the templatized function for TreeDescent.

19:11 < andrewmw94> It isn't a constructor, so I don't think the second option would work.

19:11 < naywhayare> hm, let me think for a minute

19:12 < naywhayare> could you just templatize all the functions? template<typename TreeType> RTreeSplit::SplitLeafNode(TreeType* node)

19:12 < naywhayare> and assume that TreeType is RectangleTree<...>

19:13 < andrewmw94> Basically, I want to pass a node of the tree to the function DescentType::EvalNode() so that it can look at the children. The problem is that if I templatize DescentType, then I need to include SplitType in the template. And SplitType needs to have DescentType included in it's template. I can do what you suggested there (I think, I'm still not fluent with templates) but when I declare the tree won't I have issues?

19:14 < andrewmw94> RectangleTree<RTreeSplit<RectangleTree<RTreeSplit...

19:14 < naywhayare> I think you should be able to drop all references to RectangleTree using the idea I proposed

19:15 < naywhayare> so DescentType::EvalNode() becomes template<typename TreeType> DescentType::EvalNode(TreeType* node)

19:15 < naywhayare> and SplitType::SplitLeafNode() becomes template<typename TreeType> SplitType::SplitLeafNode(TreeType* node)

19:15 < naywhayare> and you're just depending on the compiler to do all the type deduction for you

19:15 < andrewmw94> but when I build the R tree in allknn_main.cpp, don't I need to fill out the template?

19:16 < naywhayare> yeah, but it'll just be RectangleTree<SplitType, DescentType, MatType> (or... something like that)

19:16 < naywhayare> and each of those template parameters don't take any template arguments

19:16 < naywhayare> their functions do, but the values of those template arguments get deduced by the compiler

19:16 < andrewmw94> ahh.

19:16 < andrewmw94> I'll give that a try then.

19:16 < andrewmw94> thanks

19:17 < naywhayare> yeah; take a look at this example:

19:17 < naywhayare> http://pastebin.com/1fXenZw5

19:18 < naywhayare> specifically, line 12 -- you don't need to explicitly specify the type to the template function DoSomeThings(), because the compiler can already deduce it from the argument passed to DoSomeThings()

19:18 < naywhayare> so this is basically the same idea... let the compiler deduce the tree type by what's passed to SplitNode::SplitLeafNode() and DescentType::EvalNode()

19:18 < andrewmw94> alright sounds like it should work.

19:18 < naywhayare> it should, and it will make all your function signatures a lot cleaner too :)

19:19 < naywhayare> let me know if you have any problems with that approach

19:19 < andrewmw94> ok thanks.

19:19 < naywhayare> udit_s: you said you had added a new test to the decision stump?

19:20 < udit_s> yeah, I just pushed the code, r16796

19:22 < udit_s> so basically what we were calculating was gain. So I changed the constructor to calculate the least negative gain (take care of the minus sign), hence the gain < bestgain condition.

19:22 < naywhayare> ah, ok! I had thought maybe the issue was something like that, but I wasn't exactly sure

19:22 < udit_s> I was going through the Classify function.

19:23 < udit_s> and I couldn't understand the line 93.

19:24 < udit_s> why are you taking it the row as the splitAttribute row from test?

19:24 < naywhayare> that's the value that we need to place in one of the bins

19:25 < naywhayare> for test point i, we want the attribute (or dimension) splitAttribute, which is the attribute/dimension that the stump is built on

19:25 < naywhayare> or did I misinterpret what that should be?

19:26 < naywhayare> (I made a very simple style fix in r16797, by the way)

19:29 < udit_s> Yeah, I had left it there for a your clarity. Btw, what is the analogue of git diff in git svn ?

19:30 < naywhayare> svn diff :)

19:30 < naywhayare> you can specify revision numbers too... svn diff -rXXXXX

19:31 < naywhayare> that will give you the changes between revision XXXXX and trunk

19:31 < naywhayare> or you can get the changes between revisions X and Y... svn diff -rX:Y

19:33 < naywhayare> there may be a lot of lines that look identical in diffs... those are usually my editor stripping whitespace from the end of lines

19:34 < udit_s> oh. okay.

19:36 < sumedhghaisas> naywhayare: okay making a copy at each iteration is very costly...

19:36 < sumedhghaisas> so I implemented a different solution...

19:37 < sumedhghaisas> everytime the index starts to increase I make a copy in termination policy...

19:37 < sumedhghaisas> and keep a note that copy is made...

19:37 < naywhayare> udit_s: my last comment (at least for now) about the decision stump is that I think the if statement at line 47 (if (isDistinct<double>(data.row(i)))) is a situation that will happen extremely infrequently with continuous data

19:38 < naywhayare> so I think that's going to slow things down a lot; it's an O(N) loop each time it's called

19:38 < naywhayare> sumedhghaisas: ok, that sounds good

19:38 < udit_s> ( :D - "at least for now" )

19:39 < udit_s> okay, but it is an edge case, is it not ?

19:39 < naywhayare> what do you mean? will SetupSplitAttribute() have problems if all the points are the same?

19:39 < sumedhghaisas> naywhayare: then some case handling here and there... and we can save the minimum in minimum number of copies... :)

19:40 < sumedhghaisas> group lens taking only 2 copies...

19:41 < naywhayare> sumedhghaisas: great, good to hear that

19:41 < sumedhghaisas> but only one thing... I am storing W and H after increase is detected ....

19:41 < udit_s> naywhayare: Ah- that, no it won't.

19:41 < sumedhghaisas> so the minima is not stored...

19:42 < sumedhghaisas> but jumps are not that high... so its almost equivalent to the minima...

19:42 < naywhayare> udit_s: ok; so I think we can do one of two things here: we can drop the call to isDistinct<> since it happens so infrequently, or, we can rewrite isDistinct to take a lot less time, something like this:

19:43 < naywhayare> double val = featureRow[0]; for (size_t i = 1; i < featureRow.n_elem; ++i) { if (featureRow[i] != val) return false; } return true;

19:43 < naywhayare> (sorry that it's all on one line, I just did that to keep it compact for IRC)

19:43 < naywhayare> the cost should be two comparisons for most datasets (it gets to the i = 1 iteration of the for loop, and featureRow[0] != featureRow[1] so it terminates

19:43 < naywhayare> )

19:44 < naywhayare> sumedhghaisas: that's okay, I think that's still better than copying it every single time

19:44 < udit_s> yeah, we could do that. This would be definitely better (faster) than what we have now.

19:45 < naywhayare> ok, would you like to make that change?

19:45 < naywhayare> also we should change it from isDistinct to IsDistinct so it's in line with the rest of mlpack function names (I know, it is a very trivial change... :))

19:45 < udit_s> Yeah, I'll do that.

19:45 < naywhayare> ok, thank you

19:46 < udit_s> Anything else ? Or will look at Perceptron after this ?

19:46 < oldbeardo> naywhayare: when I include the specialization in sgd_impl.hpp I get this error : /home/mlpack/trunk/src/mlpack/../mlpack/core/optimizers/sgd/sgd_impl.hpp:117:12: error: ‘svd’ is not a member of ‘mlpack’

19:48 < naywhayare> udit_s: yeah, I am looking at the perceptron now. usually I think it's easier to do a code review in email, so I am writing one up and I'll send it to you (and CC marcus) soon

19:48 < naywhayare> oldbeardo: I thought you had it working with the specialization in regularized_svd_function.cpp ?

19:49 < naywhayare> I got it to compile and run after fixing the issues where SGD was trying to access internal variables of RegularizedSVDFunction

19:49 < naywhayare> (or, more specifically, I just removed those bits to make sure I could get it to compile...)

19:49 < oldbeardo> well, the source code builds, but the test function does not

19:50 < naywhayare> really? I had it compiling fine... let me send you what I have

19:50 < oldbeardo> it gives the same Gradient() missing error

19:50 < naywhayare> ok, sent... be careful, the overload is no longer correct because I simply removed the lambda parameter from the two lines it appeared in

19:51 < naywhayare> I compiled like this:

19:52 < naywhayare> g++ -o test test.cpp regularized_svd_function.cpp -I/home/ryan/work/fastlab/mlpack/trunk/build/include/ -L/home/ryan/work/fastlab/mlpack/trunk/build/lib/ -lmlpack -larmadillo -I/usr/include/libxml2 -lxml2

19:52 < naywhayare> and it compiled fine; you'll have to change the name of the directories given to -I and -L, but I think that should work for you

19:56 < oldbeardo> naywhayare: the code you sent me has the optimizer as L_BFGS, the errors arise when SGD is used as AugLagrangian is used for LRSDP

19:56 < naywhayare> oh. I see...

19:56 < naywhayare> hang on...

19:58 < oldbeardo> I added this in the initialization list in regularized_svd_impl.hpp -> rSVDFunc(data, rank, lambda), optimizer(rSVDFunc, 0.01, iterations * data.n_cols),

19:58 < oldbeardo> and declared the same in the header file

19:59 < naywhayare> okay, now I am reproducing the same problem you were

20:00 < oldbeardo> yeah, this doesn't make sense to me, the same thing works for LRSDP

20:02 < naywhayare> okay, I made it compile, by providing a forward declaration of the specialization in regularized_svd_function.hpp

20:02 < naywhayare> I also had to include sgd.hpp in that file

20:03 < oldbeardo> okay, let me try it out

20:04 < oldbeardo> how do you declare a specialization?

20:05 < naywhayare> the same way you declare a function --

20:05 < naywhayare> namespace mlpack {

20:05 < naywhayare> namespace optimization {

20:05 < naywhayare> template<>

20:05 < naywhayare> double SGD<mlpack::svd::RegularizedSVDFunction>::Optimize(arma::mat& iterate);

20:05 < naywhayare> }; // namespace optimization

20:05 < naywhayare> }; // namespace mlpack

20:09 < oldbeardo> okay, it compiles, not running properly though for some reason

20:09 < jenkins-mlpack> Project mlpack - svn checkin test build #2004: SUCCESS in 1 hr 20 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2004/

20:09 < jenkins-mlpack> * andrewmw94: R tree now has dataset and indices

20:09 < jenkins-mlpack> * Ryan Curtin: Include mlpack/core.hpp.

20:09 < jenkins-mlpack> Starting build #2005 for job mlpack - svn checkin test (previous build: SUCCESS)

20:09 < naywhayare> did you use the version I sent you? I knowingly broke that one just to make it compile

20:10 < naywhayare> the specialization is way wrong because it doesn't use the lambda parameter of the RegularizedSVDFunction at all

20:11 < oldbeardo> yes, but the error is for index out of bounds

20:11 < oldbeardo> not a performance issue

20:11 < naywhayare> was it preceded by this message? [WARN ] Cannot open file 'GroupLens100k.csv'; load failed.

20:12 < naywhayare> I didn't put that file in the tarball I sent you

20:12 < oldbeardo> heh, no, I put the file in the folder

20:12 < naywhayare> I would start over with the implementation you sent me earlier today... any changes I made were only to make it compile, and like I said, I probably broke the whole thing

20:13 < oldbeardo> yeah, I will try it out with the source version

20:17 < udit_s> naywhayare: I've committed changes in r16801. So that's Decision Stump, for now.

20:17 < naywhayare> thanks! I think I am just about done with the code review

20:17 < naywhayare> the perceptron is way simpler than the decision stump

20:18 < udit_s> Yeah. It is. I don't I really grasped how complex the decision stump could be when I proposed a time line for it. :)

20:18 < udit_s> *think

20:19 < udit_s> And, now I'm off to sleep. I'll look at the mail when I wake up tomorrow. Thanks for everything today.

20:20 udit_s has quit [Quit: Leaving]

20:21 < oldbeardo> naywhayare: the mistake was in the new Optimize() function

20:21 < oldbeardo> rating = data(2, i)

20:21 < oldbeardo> the 'i' should have been 'currentFunction'

20:22 < naywhayare> ah, glad that you found the issue

20:30 < oldbeardo> naywhayare: the implementation works, though it takes as much time as the L_BFGS implementation

20:33 < oldbeardo> slightly more actually

20:37 oldbeardo has quit [Quit: Page closed]

21:28 < jenkins-mlpack> Project mlpack - svn checkin test build #2005: SUCCESS in 1 hr 18 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2005/

21:28 < jenkins-mlpack> * Ryan Curtin: Don't use arma::unique() because it's slow.

21:28 < jenkins-mlpack> * Ryan Curtin: Use bool instead of int for tracking convergence.

21:28 < jenkins-mlpack> * Ryan Curtin: Fix some formatting issues; no functionality change.

21:28 < jenkins-mlpack> * Ryan Curtin: Const-correctness and 80-character lines... very trivial fix, no functionality

21:28 < jenkins-mlpack> change.

21:28 < jenkins-mlpack> * saxena.udit: Entropy calculation improved.

21:28 < jenkins-mlpack> Starting build #2006 for job mlpack - svn checkin test (previous build: SUCCESS)

21:36 < sumedhghaisas> naywhayare: okay that problem is fixed... committing that code...

21:37 < sumedhghaisas> how to test SVDIncremental learning?? regularization is same as SVDBatch ... so need to test that...

21:45 < naywhayare> even if the regularization is the same, there's no guarantee (in the future) that it'll be implemented exactly the same as SVDBatch, so you should write a test for it too

21:46 < naywhayare> you can reuse many of the test cases from SVDBatch, but you will probably have to change the RMSE that it converges too because it's a slightly different algorithm

21:46 < sumedhghaisas> okay... so 2 tests...

21:47 < sumedhghaisas> negative element test and regularization test...

21:49 < naywhayare> we should also have a test to make sure it converges for simple datasets

21:50 < naywhayare> you could randomly generate simple datasets or load them

21:50 < naywhayare> there should also be a test like that for SVDBatch

21:51 < sumedhghaisas> generating random matrix would be better for this test...

21:51 < naywhayare> if you want it to be sparse you can use sp_mat::sprandu() or sprandn()

21:51 < sumedhghaisas> i won't make it too big...

21:52 < sumedhghaisas> yes... I was going to ask that... thanks...

21:53 < sumedhghaisas> very boring match.... :(

22:19 < naywhayare> less exciting than yesterday, huh? :)

22:19 sumedhghaisas has quit [Ping timeout: 240 seconds]

22:47 < jenkins-mlpack> Project mlpack - svn checkin test build #2006: SUCCESS in 1 hr 19 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2006/

22:47 < jenkins-mlpack> * Ryan Curtin: Lengthen comments that weren't 80 columns long. This may be the most trivial

22:47 < jenkins-mlpack> fix ever in my long, decorated history of trivial commits.

22:47 < jenkins-mlpack> * Ryan Curtin: Very minor changes.

22:47 < jenkins-mlpack> * saxena.udit: IsDistinct() improved.

22:47 < jenkins-mlpack> Starting build #2007 for job mlpack - svn checkin test (previous build: SUCCESS)