#mlpack on 2014-07-16 — irc logs at libera.irclog.whitequark.org

2014-05-21 16:24 naywhayare changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

01:14 sumedh__ has quit [Ping timeout: 240 seconds]

04:03 Anand has joined #mlpack

04:28 Anand has quit [Ping timeout: 246 seconds]

08:30 Anand has joined #mlpack

09:06 Anand has quit [Ping timeout: 246 seconds]

09:09 sumedh__ has joined #mlpack

09:23 Anand has joined #mlpack

09:36 < Anand> Marcus : I am adding a new 'bootstrap' task to run_benchmark.py. Will it be fine>

09:36 < Anand> ?

09:37 < Anand> The RunMetrics(..) method now returns a dictionary containing metric name and result

09:38 < Anand> Also I think we should create a separate 'print' task too to print the table we discussed yesterday. Because, it is not a good idea to print metrics for all libraries and all methods when executing 'print' task of a single method

09:41 < Anand> To do this how should I iterate over the methods to call RunMetrics. In instance.RunMetrics(..), how should I loop over the 'instance'?

09:44 < Anand> Sorry, I meant "'metric' task of a single method" not "'print' task of a single method" above!

09:50 Anand has quit [Ping timeout: 246 seconds]

09:56 Anand has joined #mlpack

10:34 < marcus_zoq> Anand: Hello, yeah let's add a new bootstrap. I think you iterate through all methods, and store the results into a data structure. Afterwards, you can iterate through the data structure.

10:34 < marcus_zoq> Anand: You can write something like: result = instance.RunMetrics(options) results['methodname'] = results.

10:36 < Anand> Marcus : We have this : instance = methodCall(modifiedDataset[0], timeout=timeout, verbose=False)

10:36 < Anand> How to change the method here?

10:40 < marcus_zoq> Anand: Todo what?

10:40 < Anand> To call instance.RunMetrics()

10:41 < Anand> I mean how do we iterate through all methods?

10:49 < marcus_zoq> Anand: The line 'instance = methodCall(modifiedDataset[0], timeout=timeout, verbose=False)' calls the constructor of the specified method. Currently we use the constructor to set the dataset and some other options. You can use this instance to call the metric or the timing method. 'instance.RunMetrics(options)'. The options parameter is a string which contains additional information for the method (e.g rank). The existing code basis already itera

10:52 < Anand> Marcus : You are saying that the current code already iterates through all methods and all libraries?

10:52 < Anand> Ok, then I guess it should be fine.

10:54 < Anand> The data structure for storing the metrics for all methods should be a global then. I can create a dictionary there, probably

10:55 < marcus_zoq> Anand: Yeah, sounds good.

11:20 < jenkins-mlpack> Starting build #2020 for job mlpack - svn checkin test (previous build: SUCCESS)

11:25 < Anand> Marcus : instance.description gives the method name? If not, how can I get the method name for which we ran the metrics?

11:32 < marcus_zoq> Anand: The 'method' parameter contains the method name (e.g. Linear Regression).

11:34 < Anand> So we are taking results for a particular method for all libraries and then moving to a new method, right? (The library for loop is inside the method loop!)

11:42 < marcus_zoq> Anand: Right!

12:05 sumedh_ has joined #mlpack

12:09 sumedh__ has quit [Ping timeout: 264 seconds]

12:14 Anand has quit [Ping timeout: 246 seconds]

12:47 < jenkins-mlpack> Project mlpack - svn checkin test build #2020: SUCCESS in 1 hr 26 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2020/

12:47 < jenkins-mlpack> sumedhghaisas: * added svd incomplete incremental learning tests

12:47 < jenkins-mlpack> * combined functions IsConverged and Step of termination policies into IsConverged

12:54 < marcus_zoq> naywhayare: Is it more reliable to use a static bool instead of forward a pointer (mat or vec) to an overloaded function. That points to the select points from the dataset or a completely new matrix? I know there are some hints in the mlpack future paper...

13:18 < naywhayare> marcus_zoq: I'm not completely sure what you mean, can you explain more?

13:41 < marcus_zoq> naywhayare: Maybe this makes my question more clear; Instead of returning void the SelectionPolicy could return a pointer. Which could be used to choose between point indexes and completely new points. Maybe this is a horrible idea...

14:11 < naywhayare> marcus_zoq: ok, I see what you mean

14:12 < naywhayare> but the difficulty I see there is that if you are selecting points, it would need to return a pointer to an arma::Col<size_t>, but if you were making new points, it would have to return a pointer to an arma::mat object

14:17 andrewmw94 has joined #mlpack

14:21 < marcus_zoq> naywhayare: Right, this is the basic idea, use the return type as decision rule. Is there something wrong, with this idea, except that it is 'against' the design rules?

14:23 < naywhayare> hmmm... how would you use the return type to make the decision? that's the part I don't get

14:33 < marcus_zoq> naywhayare: I thought I can forward the result of the SelectingPolicy::Select(...) to an overloaded function something like: 'Evalutate( arma::mat* ) { /* special treatment for arma::mat */ } Evalutate( ... ) { /* default treatment */ }'

14:35 < naywhayare> ah, I see what you mean

14:35 < naywhayare> that would be a nice solution because it means no static bool is required

14:36 < naywhayare> it doesn't seem like that's overly complex, either, and the SelectionPolicy is simpler than before, so I'd say give it a shot and see if it works

14:42 < marcus_zoq> naywhayare: Okay, so it's not against the design policy because in general we pass the result matrix by reference?

14:43 < naywhayare> yeah, you don't need to pass by pointer if you're using return type overloading; you can just use a reference

14:43 < naywhayare> plus, the design policy guidelines are only valid until they make something impossible. then they have to change :)

15:07 < jenkins-mlpack> Starting build #2021 for job mlpack - svn checkin test (previous build: SUCCESS)

15:08 udit_s has joined #mlpack

15:10 < naywhayare> andrewmw94: rectangle_tree.hpp, line 118

15:10 < naywhayare> the constructor that takes a RectangleTree* is being called implicitly to convert the RectangleTree* to a RectangleTree&

15:12 < naywhayare> it's a bit dangerous to leave that constructor there as-is; some user may end up doing the same thing later and winding up with very weird, difficult to debug results

15:12 < naywhayare> so there are two things that you could do: one is mark the constructor explicit; then the compiler will never try to use it for implicit casts

15:13 < naywhayare> the other is refactoring so that either that constructor isn't necessary, or so that it has a different signature

15:14 < andrewmw94> ok. I really don't know which option is best. That constructor could probably be removed entirely, it just makes the splitting of nodes more messy

15:15 < naywhayare> it's your call. if it's only used internally, you could also mark it private

15:16 < naywhayare> where is it used?

15:17 < naywhayare> oh, sorry, I found it. I was searching in the wrong way...

15:18 < andrewmw94> Is there a reason that the splitType isn't a member class of the BinarySpaceTree?

15:19 < andrewmw94> I just copied that design here, but I think it would work to move SplitType and then make the constructor private

15:19 < naywhayare> it probably doesn't need to be a member of BinarySpaceTree, but it wouldn't be a bad idea if a user could pass an instantiated SplitType object to the constructor

15:19 < naywhayare> for whatever reason, that wasn't done when SplitType was factored out

15:20 < naywhayare> if you wanted to file a bug on trac about it, I wouldn't mind, but I see it as pretty low-priority

15:21 < andrewmw94> yeah, I don't think it makes a difference either way for BSP trees. It's slightly nicer if the R tree matches though, and if that's the case, then I can't make the constructor private.

15:21 < andrewmw94> And I can think of any real cases where the use would want to be able to use that constructor.

15:22 < andrewmw94> can't*

15:22 < andrewmw94> big difference

15:22 < naywhayare> what do you mean by "if the R tree matches"?

15:22 < andrewmw94> I mean it's nice to have the design be as similar as possible

15:23 < andrewmw94> so if SplitType is a member class of one, it would be a member class of the other.

15:23 < andrewmw94> although partially that decision was to allow me to copy BinarySpaceTree whenever I didn't know how MLPACK worked

15:24 < naywhayare> I wouldn't be too worried about that

15:24 < naywhayare> the only part of the API that needs to match is the actual functions that the tree provides

15:24 < andrewmw94> yeah. I just never know whether there was a reason for BSP trees to be like that which will come back and haunt me later.

15:24 < naywhayare> still, templatizing SplitType probably isn't a bad idea, in case there are R tree variants that someone might want to use someday that have different splitting mechanisms

15:25 < andrewmw94> Ahh, SplitType is a template, I just don't have it as a member class

15:25 < naywhayare> well, you don't need to have it as a member class; it's only relevant at construction time

15:26 < naywhayare> so no need to hold on to it in the completed R tree (which is what you get back after calling the constructor)

15:26 < andrewmw94> but I need it to be a member class if I want it to be able to access a private constructor that takes a pointer right?

15:26 < naywhayare> oh! we are talking about different things

15:26 < naywhayare> I'm sorry. everything I've said (or most of it) is irrelevant

15:27 < andrewmw94> ahh. I thought I was just really sleepy

15:27 < naywhayare> no, that's probably me

15:27 < andrewmw94> probably both.

15:27 < naywhayare> ok, I see the problem then

15:27 < naywhayare> so, I would say here that just marking the constructor explicit is good enough

15:27 < andrewmw94> ok. I'll leave it like that for now then.

15:27 < naywhayare> the other option is to provide a constructor that takes everything that's necessary to build a ready-to-go R tree node

15:27 < naywhayare> BinarySpaceTree has a constructor like this, which takes the begin, count, dataset, etc.

15:28 < naywhayare> binary_space_tree.hpp:166, or something like that

15:28 < naywhayare> but that may not be relevant in your situation

15:28 < andrewmw94> yeah. Begin and count aren't the same in the R tree so that doesn't really work.

15:29 < naywhayare> yeah, of course -- I was just suggesting a constructor of a similar manner, which takes everything the R tree node needs to be built by the end of the constructor

15:29 < andrewmw94> I'm not sure if I follow

15:30 < andrewmw94> so the effect would be the same as calling the normal constructor right?

15:30 < andrewmw94> but the difference is that you do some of the processing outside of the constructor and give that as input?

15:30 < naywhayare> that's how that constructor is used in the binary space tree

15:30 < naywhayare> but after a little more reading of what you've done, I'm not sure it's applicable to the rectangle tree

15:31 < naywhayare> so for now you are probably best off by just marking the constructor explicit and moving on

15:31 < naywhayare> eventually when you have it working, I'll make a pass over it and we can talk about refactoring (which is usually pretty straightforward) then

15:32 < andrewmw94> Yeah. I'm not sure if something like that's a good idea. It seems like it's begging the user to insert the node somewhere in the tree, which breaks the guarantee that the leaves are all on the same level.

16:32 < jenkins-mlpack> Project mlpack - svn checkin test build #2021: SUCCESS in 1 hr 24 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2021/

16:32 < jenkins-mlpack> Ryan Curtin: The size parameter is unused outside the constructor; also, commit this change

16:32 < jenkins-mlpack> to trunk this time...

16:32 < jenkins-mlpack> Starting build #2022 for job mlpack - svn checkin test (previous build: SUCCESS)

17:08 Anand has joined #mlpack

17:21 Anand has quit [Ping timeout: 246 seconds]

17:24 Anand has joined #mlpack

17:54 < sumedh_> naywhayare: you there??

17:57 < naywhayare> sumedh_: yeah, I am here

17:58 < sumedh_> okay I there a function to get the total non-zero entries in a matrix??

17:58 < sumedh_> *is

17:58 < naywhayare> for a sparse matrix? yeah

17:58 < naywhayare> so, sp_mat X; X.n_nonzero

17:58 < sumedh_> okay and for normal matrix?? iterators??

17:59 < naywhayare> for a normal matrix I think the best you can do is 'accu(X != 0)'

17:59 < sumedh_> iterators will be faster right??

17:59 < naywhayare> for a normal matrix? no, that should be the same

17:59 < naywhayare> accu() should use iterators or whatever is fastest internally

18:00 < naywhayare> but don't use accu(X != 0) for a sparse matrix, because n_nonzero already tells you how many nonzero elements there are

18:00 < sumedh_> in this X is the matrix right??

18:00 < naywhayare> yes

18:01 < sumedh_> okay... So I will use template specialization for sp_mat

18:01 < jenkins-mlpack> Project mlpack - svn checkin test build #2022: SUCCESS in 1 hr 29 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2022/

18:01 < jenkins-mlpack> * saxena.udit: Minor changes to the macros in the *main.cpp files.

18:01 < jenkins-mlpack> * saxena.udit: Minor changes

18:11 < sumedh_> naywhayare: oops... I guess I cannot explicitly specialize a function when enclosing class is not specialized can I ??

18:14 < naywhayare> no, I don't believe you can do that

18:14 < naywhayare> what are you trying to specialize?

18:15 Anand has quit [Ping timeout: 246 seconds]

18:15 < sumedh_> basically CompleteIncrementalTermination cass is templatized with Termination policy...

18:15 < sumedh_> its initialize function has to be templatized with matrix type...

18:16 < naywhayare> yeah, so,

18:16 < naywhayare> template<>

18:16 < naywhayare> template<typename TerminationPolicy>

18:16 < naywhayare> void CompleteIncrementalTermination<TerminationPolicy>::Initialize(sp_mat& X) ?

18:16 < naywhayare> that doesn't work/

18:16 < naywhayare> *?

18:17 < sumedh_> no I am getting this error...

18:17 < sumedh_> /home/sumedh/mlpack_test/amf/termination_policies/svd_complete_incremental_learning.hpp|60|error: prototype for ‘void mlpack::amf::CompleteIncrementalTermination<TerminationPolicy>::Initialize(const sp_mat&)’ does not match any in class ‘mlpack::amf::CompleteIncrementalTermination<TerminationPolicy>’|

18:18 < sumedh_> /home/sumedh/mlpack_test/amf/termination_policies/svd_complete_incremental_learning.hpp|17|error: candidate is: template<class TerminationPolicy> template<class MatType> void mlpack::amf::CompleteIncrementalTermination<TerminationPolicy>::Initialize(const MatType&)|

18:18 < sumedh_> template<typename TerminationPolicy>

18:18 < sumedh_> template<>

18:18 < sumedh_> void CompleteIncrementalTermination<TerminationPolicy>::Initialize(sp_mat& X)

18:18 < sumedh_> right??

18:18 < sumedh_> I have tried both...

18:19 < naywhayare> that should not have a problem

18:19 < naywhayare> where are you putting the specialization in the file?

18:21 < sumedh_> okay one cannot do that... I also specialized the class and it worked...

18:21 < sumedh_> I remembering reading about this in the book...

18:21 < sumedh_> but I was not sure about it...

18:22 < sumedh_> template<typename TerminationPolicy>

18:22 < sumedh_> <sumedh_> template<>

18:22 < sumedh_> <sumedh_> void CompleteIncrementalTermination<TerminationPolicy>::Initialize(sp_mat& X)

18:22 < sumedh_> when I do this I get this error...

18:22 < sumedh_> /home/sumedh/mlpack_test/amf/termination_policies/svd_complete_incremental_learning.hpp|61|error: enclosing class templates are not explicitly specialized|

18:23 < sumedh_> /home/sumedh/mlpack_test/amf/termination_policies/svd_complete_incremental_learning.hpp|61|error: invalid explicit specialization before ‘>’ token|

18:24 < naywhayare> so, it turns out this is not allowed by the standard, as you pointed out

18:24 < naywhayare> I couldn't remember if this case was illegal or if it was another case that was illegal

18:24 < naywhayare> but you can provide an overload for sp_mat in the definition of CompleteIncrementalTermination

18:24 < sumedh_> the first template is assigned to class and the second template is assigned to the function...

18:24 < naywhayare> in addition to the function

18:24 < naywhayare> template<typename MatType> Initialize(const MatType&)

18:24 < naywhayare> you can also provide

18:24 < naywhayare> Initialize(const sp_mat&)

18:24 < sumedh_> thats why it was giving prototype error...

18:25 < sumedh_> yes... I just wanted to conform there is no way around this with templates...

18:26 < naywhayare> there probably is, but it is probably very ugly...

18:27 < sumedh_> I won't implement that in the code... but still I would like to try that out.. do you know how to start??

18:27 < sumedh_> I love templates... so learning them is fun...

18:28 < naywhayare> I would have to think about it for a while, but I'm doing other things right now so I don't have any good advice on where to start

18:30 < sumedh_> okay... no problem... even I have to complete the update rule and termination policy right now :)

18:30 < sumedh_> lets think about that later...

18:32 < jenkins-mlpack> Starting build #2023 for job mlpack - svn checkin test (previous build: SUCCESS)

18:57 udit_s has quit [Quit: Leaving]

19:02 sumedh_ has quit [Ping timeout: 250 seconds]

19:58 < jenkins-mlpack> Project mlpack - svn checkin test build #2023: SUCCESS in 1 hr 25 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2023/

19:58 < jenkins-mlpack> saxena.udit: Adaboost and Perceptron modified (improved constructor), going for tests on one Weak L

21:13 < jenkins-mlpack> Starting build #2024 for job mlpack - svn checkin test (previous build: SUCCESS)

22:40 < jenkins-mlpack> Project mlpack - svn checkin test build #2024: SUCCESS in 1 hr 26 min: http://big.cc.gt.atl.ga.us:8080/job/mlpack%20-%20svn%20checkin%20test/2024/

22:40 < jenkins-mlpack> andrewmw94: point deletion. bug fix. more detailed test.