#mlpack on 2016-05-31 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:09 Mathnerd314 has quit [Quit: No Ping reply in 180 seconds.]

00:09 Mathnerd314 has joined #mlpack

02:31 tham has joined #mlpack

02:34 < tham> nilay zoq : I saw your discussion on view_as_windows

02:38 < tham> There are something quite confuse

02:38 < tham> for me

02:39 < tham> I type following codes

02:39 < tham> A = np.arange(3*4*4).reshape(3,2,8)

02:40 < tham> The results looks like 3 channels, 2 rows, 8 columns

02:40 < tham> but not 3 rows, 2 columns, 8 channels

02:43 < tham> The whole results are place at http://pastebin.com/mXJptgqe

02:46 < tham> It do not looks like the codes try to extract 16*16 pixels patch from the input images(I think it is 32*32 from the paper)

02:51 < tham> Am I interpret the codes with the wrong way?

02:52 < tham> keonkim : Hi, do you still have problem about issue #658?

02:54 < tham> I misunderstand the codes; python codes extract features at different locations(but still work); this is the intention of paper?

03:32 < rcurtin> tham: I think maybe Keon is right, it does not make sense to me why he is able to produce a dimension with only one mapping when the dimension takes more than one value

03:32 < rcurtin> so maybe there is a bug in what I wrote... not sure, need to look more

03:32 < rcurtin> I thought I wrote a lot of test cases for DatasetInfo but maybe not enough :)

03:33 < rcurtin> but it is late and I need to sleep, so I will look more tomorrow and comment on the issue (unless you and keon solve the issue before I get to it :))

03:33 < tham> rcurtin : i think it works if you do not tranpose the matrix

03:34 < tham> i ma not sure about the behavior after traspose

03:34 < tham> i can look into the codes and contribute the later on

03:35 < tham> what would you expect the behaviour after transpose?

03:45 < rcurtin> hm good point this may have to do with the transposition being incorrectly handled...

03:45 < rcurtin> let me look into it tomorrow morning, I need to go to bed for now

03:45 < tham> ok, I do not how to fixed it before I confirm the correct behavior after transpose

03:57 nilay has joined #mlpack

04:53 < nilay> tham: the codes calculate features at smp_loc locations for p_size * p_size patch.

05:08 < tham> nilay : I think you are correct

05:09 < tham> I check the doc of opencv, it is the right way to access the pixels

05:09 < tham> thanks

05:46 Mathnerd314 has quit [Ping timeout: 240 seconds]

06:48 < keonkim> tham: thanks for the response! I will further investigate :)

06:49 < keonkim> sorry for the delayed response by the way :/

07:19 < tham> keonkim : don't mind, I am not always online :)

07:20 < tham> debugging

07:20 < tham> load_impl.hpp

07:43 < tham> rcurtin keonkim : I think I fixed the problem, but only for non transpose matrix

07:44 < tham> Deal with transpose matrix is more difficult because it is unnatural to read the data by column

07:45 mentekid has joined #mlpack

07:46 < tham> I think the easiest solution to deal with tranpose file are

07:46 < tham> 1 : read the whole file into a 2d matrix

07:47 < tham> 2 : write the whole file into another temporary file with transpose format

07:49 < tham> 3 : Remove transpose option, provide a file transpose api for the users, this way they could transpose the file, save it, and load the data

07:56 < tham> I put the codes at https://github.com/stereomatchingkiss/mlpack/blob/fix_mapping_issue/src/mlpack/core/data/load_impl.hpp

07:57 < tham> This would not work if you read the matrix with transpose option as one

07:57 < tham> if you turn on transpose option

08:00 < tham> This function is quite long, I suggest we could put some implementation details in the details namespace

08:01 < tham> I have not open the pull request yet

08:01 < tham> because I am not sure how do you want to deal with transpose option?

08:03 < tham> keonkim : you can copy and paste the codes to implement your algo first if you like

08:29 < keonkim> tham: thanks!

08:45 < tham> keonkim : you are welcome, please tell me if there are bugs

09:03 tham has quit [Quit: Page closed]

09:15 nilay has quit [Quit: Page closed]

09:18 boby has joined #mlpack

09:56 boby has quit [Quit: Page closed]

10:54 < mentekid> rcurtin: I am not convinced that my code is actually correct... I just noticed a weird behavior.

10:55 < mentekid> If you print the resulting additionalProbingBins just before exiting GetAdditionalProbingBins, you get the same bin over and over again

10:56 < mentekid> the first 4 or 5 are different but when I request 10 I get 4-5 different ones followed by the same over and over again

11:49 < mentekid> I am probably creating the perturbation vectors wrong

12:40 < rcurtin> mentekid: yeah, I did not really look into the correctness of the code at all yet since there are no tests

12:40 < rcurtin> I think that is your goal for this week if I remember right

12:42 < mentekid> I have implemented a test but apparently it isn't sufficient

12:42 < mentekid> I though I had pushed it

12:42 < mentekid> anyway yeah I had hoped the code would be correct so I could start thinking about optimizations and parllelization, but it's debugging time

13:15 < rcurtin> I thought that the test was for the recall calculation, not the multiprobe

13:15 < rcurtin> maybe I misread the test

13:17 < mentekid> no I ran multiprobe with increasing numProbes (for the same LSH object, so the tables didn't change)

13:17 < mentekid> and expected the recall to improve or stay the same

13:17 < mentekid> but the test passed without the code being correct because that's not a serious requirement... So it didn't really catch the bug I just discovered

13:18 < rcurtin> oh, okay

13:18 < rcurtin> I read it on a phone in the back of a car so it is not surprising that I misunderstood :)

13:19 < mentekid> I think before going onto parallelization I should implement get/set functions for the projection tables, that way we can set our own and make better tests

13:20 < mentekid> I just corrected the bug I mentioned (at least, I think I did) and there were so many (little) mistakes I'm surprised it even ran at all

13:21 < rcurtin> yeah, functions to access the projection table are fine... I thought you had already done that earlier?

13:21 < mentekid> yeah I did that but in a pretty hacky way just for the needs of my thesis

13:21 < mentekid> I should do it properly, so the table sizes etc are checked

13:21 < mentekid> and the functions documented

13:23 < mentekid> I think if I try to parallelize stuff without better tests I will break the universe :P

13:25 < rcurtin> yeah, always better to do tests first :)

13:26 < rcurtin> although testing a parallelized version is not too hard, you can often uncover a lot of bugs by just running with different numbers of threads and checking the output to ensure it is the same

13:26 < rcurtin> but actually I guess in this case since LSH is randomized you would need to set the same seed... but I am not sure of the behavior of the C++ RNGs when using multiple threads

13:26 < rcurtin> so maybe that idea will not work

13:38 < mentekid> actually I think you are right

13:39 tham has joined #mlpack

13:39 < mentekid> if I only parallelize the search part

13:39 < mentekid> then I don't need to re-train, so randomness doesn't affect me

13:40 < mentekid> but that would require having an argument in Search() that defines number of threads (instead of automatically defining them)

13:40 < mentekid> so I could run Search twice, once with numThreads=1 and once with NumThreads = 4 (for example) and check that results are the same

13:40 < mentekid> that should work

13:41 < rcurtin> I'd use openmp and then your user can just set the number of threads with OMP_NUM_THREADS as an environment variable

13:41 < mentekid> oh yeah that's true

13:42 < mentekid> I forgot about that

13:42 < mentekid> but can you also set that from the boost testcases?

13:44 < rcurtin> omp_set_num_threads(int);

13:45 < rcurtin> but, openmp is an optional dependency for mlpack, so any test case should probably be surrounded by #ifdef _OPENMP or something like this

13:45 nihajsk has joined #mlpack

13:45 < mentekid> Cool! I have completely forgotten openMP... I should refresh before starting the implementation

13:46 < mentekid> yeah that makes sense

13:49 < mentekid> did you see my comment about the low recall thing?

13:49 < mentekid> have you also noticed something or is it my code? Though I did test the master branch as well and I think I got similar results

13:51 < mentekid> actually no, wait... That's even weirder... Running with -K 1 returns much lower recall than -K 10

13:51 < mentekid> -K is the number of hash functions per table so larger should mean fewer neighbors found... and L should increase it

13:52 Mathnerd314 has joined #mlpack

13:52 < rcurtin> I am not too surprised about the low recall, I have found in my simulations with LSH that you have to tune it with very large bins to get good neighbors

13:53 < rcurtin> but unfortunately I have to go for a little while so I can't help dig in right now... I will be back in some hours

13:53 < rcurtin> I'll try and check in on my phone while I am out

13:53 < mentekid> I'll try and figure it out too, we can talk later

13:53 < mentekid> thanks for the help :)

13:53 < rcurtin> sure, sorry I am flaky today :(

13:55 < tham> rcurtin : what is your recommendation for issue #658?

13:55 < tham> The matrix after transpose is uneasy to deal with

13:57 < rcurtin> tham: we have to support transposing, because users generally have their data in row-major form but we need it in column major form

13:57 travis-ci has joined #mlpack

13:57 < travis-ci> mlpack/mlpack#857 (master - e36eec5 : Ryan Curtin): The build passed.

13:57 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/1dad2b662d59...e36eec5cb250

13:57 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/134165833

13:57 travis-ci has left #mlpack []

13:57 < rcurtin> I was writing some tests based on Keon's example matrices, but I have to step out, can't commit them yet

13:57 < rcurtin> I saw your commit, I think isdigit() is not applicable in this case because "1.000e+04" is a valid number that can be casted, but will fail the isdigit() test

13:58 < rcurtin> I think the best way to test is to try to extract it into an eT, like "val << token" in the original code

13:58 < rcurtin> when I get back I will finish those test cases

13:59 < tham> rcurtin : you are right, did not notice that

14:16 wasiq has joined #mlpack

14:16 < tham> rcurtin : I can't find a memory efficient way to parse the matrix after transpose, it has to record the data to find out which "row" is numerical vs category

14:17 < tham> I will give the dumbness solution a try first--parse all of the lines and transpose them

14:18 marcosirc has joined #mlpack

15:31 < tham> rcurtin keonkim : finish the dumb solution, now it can support transpose--https://github.com/stereomatchingkiss/mlpack/blob/fix_mapping_issue/src/mlpack/core/data/load_impl.hpp

15:32 < tham> At first I parse all of the row tokens into a std::vector<std::string>

15:33 < tham> After that I transpose the row one by one

15:33 < tham> If the data do not need to transpose, I will read it from the file row by row

15:34 < tham> After rcurtin finish the test case, I will run those test cases one by one

15:38 < tham> fix format--https://github.com/stereomatchingkiss/mlpack/tree/fix_mapping_issue/src/mlpack/core/data

15:40 < tham> however, if rcurtin fixed the issue already, this fix can be omitted

15:55 sumedhghaisas has joined #mlpack

15:58 < rcurtin> tham: I didn't make any fixes, just a test case

15:59 < tham> rcurtin : Then I am waiting for your test cases, atleast old cases work

15:59 nilay has joined #mlpack

15:59 < tham> I run the examples of keonkim, it works too

16:02 < nilay> zoq tham: Hello, in the python codes, in the training data, does one mat file corresponds to one image?

16:03 < tham> nilay : please give me a few minutes, I need to peek at the codes, but in normal case

16:03 < tham> yes

16:04 < nilay> tham: i am talking about this: https://github.com/ArtanisCV/StructuredForests/tree/master/toy/BSDS500/data/groundTruth/train

16:06 < zoq> nilay: The mat file contains the segmentations and boundaries for one particular image, not the image itself.

16:07 < tham> sorry, misunderstand

16:07 < nilay> ok, but it is very confusing, for one image of size (x,y,z) the boundary must be of size (x,y).

16:08 < zoq> nilay: that's right, size size of the boundary and segmentation is (x,y)

16:09 < zoq> nilay: *the size

16:09 < nilay> but here, when he writes in prepare_data function, this loop: for j, (img, bnds, segs) in enumerate(input_data): bnds is a list of 6 or 7 matrices each of size (x,y) (which is the size of image, without the channels).

16:10 < zoq> nilay: yeah, one mat file could contain more than one segmentation and boundary

16:11 < zoq> nilay: But I think the python code just uses the first segmentation and boundary

16:14 < zoq> nilay: Here is an example: http://www.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/BSDS300/html/dataset/images/color/159029.html

16:14 < zoq> nilay: as you can see the dataset contains 6 segmentation matrices.

16:15 < nilay> zoq but in that loop img is one image only, of size (x,y,z)

16:15 < nilay> i tried printing img.shape

16:16 < nilay> zoq: so in the dataset you generated, you have also taken first boundary?

16:18 < nilay> or how can we know

16:19 < zoq> nilay: right, so you have your input image of size (x,y,z), and the mat file contains all segmentations and boundary for that particular image with size of (x, y) but the mat file often contains more than one boundary and segmentation for one particular image, because the authors who generated the dataset tested different parameters e.g. to get more fine-grained segmentations.

16:19 < zoq> nilay: yes, right the dataset I generated only contains the first segmentation nd boundary

16:20 < nilay> zoq: ok then, its all good. :)

16:22 < zoq> nilay: https://gist.github.com/zoq/814de0589c9d0820ca02538416beb560: 'small_images.csv' does contain two images, 'small_segmentation_1.csv' does contain two segmentations one segmentation for each image

16:27 < nilay> zoq: yes that is ok.

16:37 < mentekid> rcurtin: I fixed the bugs, tested everything by hand/eye. It should be working properly now

16:39 < mentekid> I also tested some other datasets, miniboone and phy no weird behavior regarding numProj. I'll test more to see if CorelColorHistogram is alone in this

16:57 sumedhghaisas has quit [Ping timeout: 260 seconds]

16:57 < tham> rcurtin : I think I can write the tests if you don't mind, based on keonkim examples

16:57 < tham> it should be done within a few hours

17:22 < rcurtin> tham: sure, go ahead, I am almost done with lunch and will check in my code theb

17:22 < rcurtin> we can combine our tests

17:23 < tham> rcurtin : ok, I am studying the codes of structure random forest, will finish the test later on

17:23 < rcurtin> sure, sounds good

17:24 < rcurtin> are you considering an RF implementation for mlpack?

17:24 < rcurtin> I have implemented a random forest built on hoeffding trees but I have not finished testing yet

17:28 < nilay> rcurtin: yes we need to implement RF for edge boxes algorithm

17:28 < zoq> rcurtin: The structure random forest, is one part of the edge boxes method nilay implements to get some ROI's. If you already have an random forest in place, based on the hoeffding trees that's even better, that would come in handy.

17:29 < zoq> rcurtin: The plan was to use the hoeffding tree anyway.

17:32 < zoq> rcurtin: Or the modified decision stump by Cloud, I think, either one of this two should work.

17:36 nihajsk has quit [Ping timeout: 244 seconds]

17:36 tsathoggua has joined #mlpack

17:36 tsathoggua has quit [Client Quit]

17:37 nihajsk has joined #mlpack

17:41 < rcurtin> zoq: maybe the decision stump is better, the Hoeffding random forest is not working anywhere near as well as Breiman's typical random forest (which is what scikit implements)

17:42 nihajsk has quit [Ping timeout: 260 seconds]

17:45 < rcurtin> my observations so far are that the Hoeffding random forest sometimes doesn't even outperform a single Hoeffding tree

17:45 < zoq> rcurtin: hm, okay, either way, it would be great to see the code, maybe there is an easy way, to make it work with the decision stump?

17:45 < rcurtin> I need more time to look into this, but right now I don't have any time for it, so I don't know when I'll be able to look into that further

17:45 < rcurtin> https://github.com/rcurtin/mlpack/tree/vfdt/src/mlpack/methods/hoeffding_trees

17:45 < rcurtin> you could definitely use the same ideas I've used there, but you'd need to refactor the decision stump

17:46 < rcurtin> the most important change I had to make was to add a template parameter "SplitSelectionStrategyType", which basically encodes the splitting dimensions the tree is allowed to consider

17:47 < rcurtin> so for Breiman's random forest with single randomly chosen dimension, you use SingleRandomDimensionSplit, which only gives the tree one dimension to split on

17:47 < rcurtin> but if you are using a default Hoeffding tree, AllDimensionSplit is used, which lets the tree split on any dimension

17:50 < zoq> rcurtin: okay great, I'll take a look at the code in the next days, and probably come back with some questions.

17:50 < rcurtin> yeah, it is not production-quality, so maybe there will be many questions to answer :)

17:51 < rcurtin> I still have not fully documented it, because my goal was to get simulations working quickly to see if the idea worked, and at the last point I had time to work on it, it did not work very well

17:52 < rcurtin> I didn't know Cloud wrote a modified decision stump, is his code available anywhere?

17:52 < rcurtin> I thought his ideas were great and would be good changes, I just didn't know if he actually implemented those changes

17:53 < rcurtin> mentekid: it's possible the weird behavior of CorelColorHistogram has to do with some odd properties of the dataset

17:53 < rcurtin> that dataset I think is somewhat high-dimensional but has clusters in it. that doesn't explain why more hash functions would result in better recall, though, unless the hash width is also getting larger when the number of hash functions increases

17:55 < zoq> rcurtin: The last time I check there was some code: https://mailman.cc.gatech.edu/pipermail/ mlpack/2016-April/000991.html ... looks like he delete his fork

17:56 < rcurtin> yeah, I remember seeing that code, I thought it was just a stub outline of the changes he wanted to make, with no actual implementation

17:57 < rcurtin> I guess we could email him and ask if he wrote anything, but I don't see anything else on his github page

17:57 < zoq> rcurtin: I think it worked as a proof of concept.

18:03 < rcurtin> ok, I did not remember that bit... maybe he still has it on his desktop or something... hopefully...

18:04 < zoq> rcurtin: let's see , I'll go and write him an email

18:04 nihajsk has joined #mlpack

18:05 nilay has quit [Ping timeout: 250 seconds]

18:17 nihajsk has quit [Ping timeout: 260 seconds]

18:19 nihajsk has joined #mlpack

18:34 < rcurtin> tham: you beat me to it, sorry I took so long... I was still writing some other test cases but I think you have already written them :)

18:34 < mentekid> rcurtin: Should I make the changes regarding access to LSH hash tables in the same branch as Multiprobe?

18:34 < rcurtin> mentekid: your call, I can easily merge those immediately

18:35 < mentekid> Will there be no conflict if I have 2 different versions one with multiprobe and one with accessors?

18:35 < mentekid> I am confused about how git handles that

18:35 < mentekid> Or I guess if it's different parts of the file it's no problem

18:36 < tham> rcurtin : build fail, need to find the reasons

18:36 < mentekid> since I'm not changing the same function in two different ways or anything

18:36 < rcurtin> mentekid: git might be smart enough to merge it well, but if not, I can figure it out, I am an expert at git merges :)

18:36 < mentekid> cool then, I'll start a branch so we can keep each one focused :)

18:36 < rcurtin> or I guess, do you mean, that you would have the same commit in two different branches?

18:36 < rcurtin> I'd need to merge that by hand, but that's easy to do, I just merge all the commits except the duplicate one

18:37 < rcurtin> github's interface won't do that automatically I don't think, but that's no problem

18:37 < mentekid> I was thinking about starting from master again

18:37 < mentekid> which one is simpler for you?

18:37 < rcurtin> tham: mark TransposeTokens() inline :)

18:38 < rcurtin> mentekid: all the changes in one branch are easier, but I mean, for me they are both pretty simple, so it's no issue to start a new branch from master---go ahead and do that

18:38 < mentekid> ok cool!

18:50 < mentekid> rcurtin: Also, I found the reason behind the weird numProj behavior

18:51 < mentekid> the second hash size is to blame... the default value is 500, so (2nd level) buckets with more points are just filled to capacity and ignore new ones

18:51 < tham> rcurtin : thanks, I forgot the declaration and definition issue

18:51 < rcurtin> tham: no problem. I am adapting one of the tests I wrote to make a more difficult one that you can add to the list of tests

18:51 < rcurtin> I'll open a PR for it when it's done

18:52 < mentekid> so setting it to 1 creates one huge bucket with (theoretically) 68000 points and then only keeps the first 500 of these

18:54 < rcurtin> ahh, okay, this makes sense

18:55 < rcurtin> but I guess, shouldn't there be more than one bucket if we set numProj = 1?

18:55 < rcurtin> or I guess, is it the case that there are many buckets, but each end up with more than 500 points regardless?

18:56 < mentekid> I'm not sure how many buckets are created to be honest, I think that's random

18:57 < mentekid> but if it's less than N/500 then you can expect some of them to be overfilled

18:57 < mentekid> and the spillover points are ignored

18:57 < mentekid> at least, I think that's what is happening...

18:58 < rcurtin> do you think it's worth the time to check?

18:58 < rcurtin> if it takes a while, maybe it is not worth it for just this dataset

18:59 < mentekid> I would expect it to behave similarly for all big datasets

19:00 < mentekid> I didn't run such extreme values for phy so maybe that's why I didn't see it

19:00 < rcurtin> yeah, I agree with your explanation

19:00 < mentekid> I'll run a few big ones at night so we can have a better idea. And I'll look at the code to see where that is happening

19:00 < rcurtin> it might be worth thinking about what better defaults for secondHashSize are, but like I said maybe not worth investigating... we should limit the number of rabbit holes we crawl down :)

19:01 < mentekid> well I enjoy this so much that I'll probably end up exploring it at some point

19:01 < mentekid> I don't know why I'm so infatuated with this algorithm :P

19:02 < mentekid> I think maybe replacing a "magic" 500 with some heuristic relative to reference size and numProj could work

19:02 tham has quit [Quit: Page closed]

19:02 < mentekid> but maybe we should leave that for after the tuning algorithm is done - I'm not sure but they could have a model for the second hash size as well

19:03 < mentekid> quick question - assert doesn't fail on non-debug builds does it?

19:03 < mentekid> so if a user gives me wrong table sizes and I assert they are correct it won't stop them?

19:04 < rcurtin> right

19:05 < rcurtin> when compiling without debug symbols, assert (or Log::Assert) is not called

19:05 < mentekid> ok then I guess it should throw an exception instead

19:05 < rcurtin> Log::Assert is nice because it gives a backtrace

19:05 < rcurtin> yeah, I'd go with a std::invalid_argument

19:12 travis-ci has joined #mlpack

19:12 < travis-ci> mlpack/mlpack#861 (master - 02e31b3 : Ryan Curtin): The build passed.

19:12 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/e36eec5cb250...02e31b3b07f1

19:12 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/134251158

19:12 travis-ci has left #mlpack []

19:27 nihajsk has quit [Ping timeout: 276 seconds]

19:45 < mentekid> does running make mlpack_lsh recompile the lib/ stuff as well? or do they remain the same?

19:51 < rcurtin> depends on whether or not any .cpp file changed

19:51 < rcurtin> if no .cpp file changed, then libmlpack.so is up to date and there is nothing to update there

19:51 < mentekid> what if I changed .hpp files

19:51 < rcurtin> (this is with the exception of, say, lsh_search_main.cpp, since that is not compiled into libmlpack.so)

19:52 < rcurtin> if anything that includes those .hpp files is a .cpp file that is compiled into libmlpack.so, then that will need to be recompiled

19:52 < rcurtin> so like if you change prereqs.hpp, I think everything will be recompiled

19:52 < rcurtin> but if you change something in lsh_search.hpp I don't think anything will need to be recompiled in libmlpack.so, just in mlpack_test and the mlpack_lsh program

19:53 < mentekid> I've written a custom cpp file that uses lsh_search.hpp. I compile it with g++ accesstables.cpp -L lib/libmlpack.so (and flags)

19:53 < mentekid> accesstables.cpp is my file

19:53 < mentekid> now I want to change something in lsh_search.hpp and re-compile libmlpack.so

19:54 < mentekid> so that accesstables "sees" the update

19:54 < rcurtin> so there is one other detail there that is important... whenever you make anything, all the header files in src/ need to be copied to <build-directory>/include/

19:54 < rcurtin> you can do that by 'make mlpack_headers'

19:54 < rcurtin> but also 'make mlpack' will do that, because the mlpack target depends on mlpack_headers

19:54 < mentekid> aha

19:55 < rcurtin> if you modify lsh_search.hpp and then type 'make mlpack', then it should show that it is copying all the headers files with the mlpack_headers target, but then it will not actually compile anything for the mlpack target

19:55 < mentekid> so in order to include my updated version I need to run make mlpack_headers

19:55 < rcurtin> yes, that should do it

19:55 < mentekid> I didn't know that, thanks :)

19:55 < rcurtin> no problem, glad I could help

19:55 < rcurtin> maybe it is worth collecting little things like this and writing them up somewhere

19:56 < rcurtin> there are lots of little undocumented build system tricks like this in mlpack

19:56 < rcurtin> but I dunno how to make all of that useful to someone new to the project... if you present them with a big list of "little tricks" they might not remember any of them because they had not encountered any of the issues where those tricks are relevant yet

19:57 < mentekid> Could some "sample cases" tutorial or wiki help?

19:58 < mentekid> like "Recompiling the library after changing updating header files"

19:58 < mentekid> etc

19:58 < rcurtin> yeah, maybe like some developer FAQ or something like that

19:58 < rcurtin> either as a wiki page or in doc/

19:59 < mentekid> by the way I still can't get it to work... I tried both make mlpack and make mlpack_headers

19:59 < mentekid> it uses the old version of the .hpp file which doesn't throw exceptions

20:00 < rcurtin> can you give your full g++ invocation?

20:00 < rcurtin> do you have -Iinclude/ ?

20:00 < rcurtin> if not it is probably searching in /usr/include/ or /usr/local/include/

20:00 < mentekid> no I compile with g++ accesstables.cpp -L lib/libmlpack.so -lmlpack -larmadillo --std=c++11

20:00 < rcurtin> okay, try adding -I include/, I think that may fix your issue

20:01 < mentekid> ah yes

20:01 < mentekid> perfect

20:01 < mentekid> I thought it was the libmlpack.so file that was kept old

20:02 < rcurtin> nope; actually, the way you were doing it could cause some really big weirdness to happen

20:02 < rcurtin> if it had compiled

20:02 < rcurtin> you would have ended up with a program that was built using headers in /usr/include/ which were maybe old, but implementations in lib/libmlpack.so which might work differently

20:03 < rcurtin> like for instance if, e.g., ARMA_64BIT_WORD was for some reason set in the /usr/include/ version of mlpack but not in the one you built

20:03 < rcurtin> then when you ran it, all calls to anything in libmlpack.so would be with the wrong size arma::uword, and then you would have a difficult-to-debug disaster :)

20:03 < mentekid> ahhh so the whole problem was that I have an installed version of libmlpack

20:03 < rcurtin> yeah, that's not necessarily a problem, but it is something to be aware of :)

20:04 < mentekid> I'm not used to build systems so it only makes sense after you tell me

20:05 < rcurtin> yeah, CMake can be quite complex

20:17 < mentekid> Ok I finished the code, I'll start a PR so you can see what I've done. It's only a few lines of code really

20:24 < rcurtin> sounds good

20:38 marcosirc has quit [Quit: WeeChat 1.4]

20:54 nilay has joined #mlpack

20:55 < rcurtin> mentekid: added comments, did not expect to add so many. I think many of the comments have to do with Pari's original design, not your changes... let me know what you think

21:22 nilay has quit [Ping timeout: 250 seconds]

21:27 travis-ci has joined #mlpack

21:27 < travis-ci> mlpack/mlpack#866 (master - e6d2ca7 : Ryan Curtin): The build passed.

21:27 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/02e31b3b07f1...e6d2ca7bf64b

21:27 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/134285822

21:27 travis-ci has left #mlpack []

21:49 mentekid has quit [Ping timeout: 240 seconds]

22:27 benchmark has joined #mlpack

22:27 benchmark has quit [Client Quit]

22:28 < zoq> welcome back, I thought they blocked the port

22:52 < rcurtin> that's very odd, I wasn't informed that they had changed anything

22:55 < zoq> rcurtin: Maybe they never blocked the irc port, I just thought they do, because of the python irc error.