#mlpack on 2016-11-30 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

01:45 benchmark has joined #mlpack

01:46 benchmark has quit [Client Quit]

03:28 Stellar_Mind has joined #mlpack

03:51 Stellar_Mind has quit [Ping timeout: 246 seconds]

05:58 Stellar_Mind has joined #mlpack

06:16 sumedhghaisas_ has quit [Ping timeout: 260 seconds]

06:40 govg has joined #mlpack

06:42 Stellar_Mind has quit [Ping timeout: 260 seconds]

07:10 Stellar_Mind has joined #mlpack

08:26 Stellar_Mind has quit [Ping timeout: 246 seconds]

09:02 govg has quit [Ping timeout: 246 seconds]

09:25 Stellar_Mind has joined #mlpack

10:19 gtank has quit [Ping timeout: 258 seconds]

10:19 keonkim has quit [Ping timeout: 258 seconds]

10:20 gtank has joined #mlpack

10:20 zoq_ has joined #mlpack

10:25 Stellar_Mind has quit [Ping timeout: 260 seconds]

10:25 zoq has quit [Ping timeout: 258 seconds]

10:28 keonkim has joined #mlpack

11:25 zoq_ is now known as zoq

11:27 keonkim has quit [Ping timeout: 240 seconds]

11:33 keonkim has joined #mlpack

15:07 govg has joined #mlpack

16:35 sumedhghaisas_ has joined #mlpack

18:19 sumedhghaisas_ has quit [Ping timeout: 260 seconds]

18:23 sumedhghaisas_ has joined #mlpack

18:36 sumedhghaisas_ has quit [Ping timeout: 260 seconds]

18:54 a-l-e has joined #mlpack

19:43 sumedhghaisas_ has joined #mlpack

19:43 sumedhghaisas_ has quit [Client Quit]

19:44 sumedhghaisas_ has joined #mlpack

19:44 sumedhghaisas_ has quit [Client Quit]

19:44 sumedhghaisas_ has joined #mlpack

19:45 sumedhghaisas_ has quit [Client Quit]

19:45 sumedhghaisas__ has joined #mlpack

19:45 sumedhghaisas__ has quit [Client Quit]

19:45 sumedhghaisas_ has joined #mlpack

19:50 sumedhghaisas_ has quit [Ping timeout: 260 seconds]

20:43 govg has quit [Ping timeout: 265 seconds]

20:44 govg has joined #mlpack

21:01 travis-ci has joined #mlpack

21:01 < travis-ci> mlpack/mlpack#1629 (master - 86689aa : Ryan Curtin): The build passed.

21:01 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/ed10f0522925...86689aa4f384

21:01 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/180200203

21:01 travis-ci has left #mlpack []

21:04 a-l-e has quit [Ping timeout: 258 seconds]

21:51 benchmark has joined #mlpack

21:51 benchmark has quit [Client Quit]

22:06 govg has quit [Ping timeout: 246 seconds]

22:07 govg has joined #mlpack

22:27 govg has quit [Ping timeout: 250 seconds]

22:27 radens has joined #mlpack

22:29 < radens> Hello, does MLPack impement AdaBoost+Random forest? I have a python sci-kit learn script which I need to port to C++ and I'm evaluating libraries.

22:31 < rcurtin> radens: do you mean AdaBoost with random forests as the weak learners?

22:31 < radens> rcurtin: I believe so, but my knowledge of machine learning is weak.

22:31 < radens> Yes, that sounds right.

22:31 < rcurtin> mlpack does implement AdaBoost but not random forests (there is some experimental code for streaming random forests, but I doubt that will be useful)

22:32 < radens> In your opinion can I change the weak learners for Adaboost, or should I keep looking at other libraries?

22:32 < rcurtin> ok, so, for what it's worth, the whole idea of AdaBoost is that it does boosting with a bunch of weak learners, so it doesn't really make sense to use a strong learner instead

22:33 < rcurtin> (hang on, phone)

22:33 < radens> Yeah, I get that, but that's what the guy in research used, and I'd like to follow his lead. Take your time, I'll be around.

22:40 < rcurtin> ok, sorry for the delay, I'm back now

22:40 < rcurtin> typically _either_ adaboost or random forests are used, so it is strange to me that you would use both together

22:40 < radens> :)

22:41 < radens> I'm just the engineer with a monkey wrench.

22:41 < rcurtin> :)

22:41 < radens> Do you think I could use them together with this library, or is dlib a better option, or something else?

22:42 < rcurtin> I don't think that dlib has adaboost, but it may have random forests

22:42 < rcurtin> basically, the mlpack adaboost implementation allows you to specify, as a template class, the weak learner

22:42 < rcurtin> if you definitely need random forests to be used as your weak learner, then you would need to implement random forests in mlpack (semi-difficult and could be time consuming, I dunno what your timeframe is)

22:43 < rcurtin> and then you could plug them in like 'AdaBoost<RandomForest>' and the AdaBoost algorithm should work fine

22:43 < radens> So MLPack doesn't implement random forests? Or do they just not fulfill the weak learner interface?

22:44 < rcurtin> unfortunately there aren't random forests in mlpack at the moment

22:44 < rcurtin> what we do have (which was actually written specifically to be an adaboost weak learner) are 'decision stumps'

22:44 < rcurtin> which are... part of the way to random forests but not very much :)

22:44 < radens> heh

22:45 < rcurtin> you could take the existing decision stump code and adapt it into a decision tree (I think someone has worked on this before but I don't think the code was ever finished), then writing random forests on top of that is not so hard

22:45 < radens> Are there other libraries I should look at first?

22:45 < rcurtin> looking at dlib, it doesn't seem to have either adaboost or random forests

22:45 < rcurtin> there's shogun, but I don't think it has either of those algorithms either (not 100% sure)

22:46 < rcurtin> there's also Ross Quinlan's C5.0 package, which is open source and implements a C5.0 decision tree and random forests

22:46 < rcurtin> if you were clever you could use the C5.0 decision trees from there, wrap an mlpack weak learner interface around it, and then use them together

22:46 < rcurtin> to be honest I think that might be your best bet

22:47 < rcurtin> probably C5.0 is a different type of decision tree than what your researcher used, but there are a lot of similarities so it should give comparable results

22:48 < rcurtin> I suspect that even if you did use adaboost with a weak learner (like decision stumps or perceptrons, which are already in mlpack ready to go), you'd get comparable performance to adaboost which uses random forests as weak learners

22:48 < rcurtin> but I only mention that again because it could be a nice way to save a lot of potential work :)

22:48 < radens> rcurtin: thank you, I appreciate your help. I'll go run through a couple tutorials for both libraries and see about trying that.

22:48 < radens> I may just start with the stumps.

22:49 < radens> Oh, is C5.0 LGPL?

22:49 < rcurtin> sure, feel free to come ask for help if you have any problems

22:49 < rcurtin> well, that's the other issue, the license is weird for C5.0

22:49 < rcurtin> C5.0 is GPL only, so if you're in a company that's almost certainly a non-starter

22:49 < radens> Dang.

22:50 < radens> I mean, I'm a dyed in the wool open source head, but that will go over like a lead balloon here.

22:50 < rcurtin> I know exactly what you mean

22:50 < rcurtin> I'm at Symantec and we've done some experiments with C5.0 here before but whenever the word "GPL" comes out everyone runs

22:51 < radens> I'll focus on the weak learner for now and see how performance compares. Thanks!

22:51 < rcurtin> it took no small amount of paperwork just to get them to be ok with me contributing to mlpack too :)

22:51 < rcurtin> sounds good---like I said, feel free to ping with questions if you have any

22:51 < radens> See that's the nice thing about startups, you don't have to worry about that sort of paperwork.

22:51 < radens> Thanks!

22:53 < rcurtin> :)

23:08 mentekid has joined #mlpack

23:10 < radens> rcurtin: Hey, I just installed mlpack and armadillo from homebrew (because the work box runs OS X) and tried to compile it like this and got a whole bunch of linker errors: https://pastebin.osuosl.org/43781/

23:10 < radens> the program is from this page: http://www.mlpack.org/docs/mlpack-1.0.12/doxygen.php?doc=sample.html

23:14 < rcurtin> did you link with mlpack? -lmlpack

23:15 < radens> yup and -larmadillo too

23:15 < radens> Oh wait, figured it out.

23:15 < rcurtin> oh, sorry, now I see the command at the top of the page

23:15 < rcurtin> what was the issue?

23:15 < radens> g++ didn't know where the libraries live, but clang++ does.

23:15 < rcurtin> ok, I figured it was a path issue like that or something

23:16 < rcurtin> it looks like the page you found is from 1.0.12 documentation, but homebrew has 2.1.0 so this may be more accurate:

23:16 < rcurtin> http://www.mlpack.org/docs/mlpack-2.1.0/doxygen.php?doc=sample.html

23:16 < radens> Thanks

23:16 < rcurtin> sure; I have to go get dinner now, but I'll be back later, and I (and others) check the logs here so if you leave a question it'll be answered eventually

23:16 < radens> Thanks

23:59 mentekid has quit [Ping timeout: 268 seconds]