#mlpack on 2015-09-15 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

03:01 travis-ci has joined #mlpack

03:01 < travis-ci> mlpack/mlpack#239 (master - bbe9cd1 : Ryan Curtin): The build has errored.

03:01 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/754ff98bb991...bbe9cd161571

03:01 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/80360442

03:01 travis-ci has left #mlpack []

03:41 sid_1986 has joined #mlpack

03:42 sid_1986 has quit [Client Quit]

17:36 rsv has joined #mlpack

17:40 < rsv> Hi everyone, I'm trying to implement logistic regression via lasso using the C++ API. I've already managed to get an example working without lasso using the mlpack::regression::LogisticRegression< OptimizerType > class template. I'm confused about how to use mlpack::regression::LARS in the same way.

20:04 < naywhayare> rsv: hi there. do you mean that you want to train a logistic regression model with an L1 and L2 penalty like LASSO?

20:04 < naywhayare> I'm not sure I completely understand what you're trying to do

20:25 rsv has quit [Ping timeout: 246 seconds]

21:08 rsv has joined #mlpack

21:09 < rsv> yes that's exactly what i want to do

21:09 < rsv> but i am not sure how to use mlpack::regression::LARS in conjunction with mlpack::regression::LogisticRegression or if that's even the right idea

21:18 < naywhayare> rsv: I don't think it will be too hard to do, but you'll have to either modify or copy some code

21:19 < naywhayare> in src/mlpack/methods/logistic_regression/, there is a file called 'logistic_regression_function.hpp' and 'logistic_regression_function.cpp'

21:19 < naywhayare> these two files define the LogisticRegressionFunction, which is what the optimizer uses to learn the model

21:19 < rsv> okay

21:19 < naywhayare> right now an L2 penalty parameter can be specified in LogisticRegressionFunction, but if you want L1 penalty, you'll have to modify the code

21:20 < rsv> so i have to put the lambda1=0 penalty in the LogisticRegressionFunction by hand basically?

21:20 < naywhayare> if you take a look at the Evaluate() function, you can see the objective is the loss plus a regularization term

21:20 < naywhayare> wait, are you planning to use lambda1 = 0?

21:20 < naywhayare> if so the code already does what you need; it has L2 regularization

21:20 < rsv> isn't that what the L1 penalty is for lasso?

21:21 < naywhayare> oh, right, sorry

21:21 < naywhayare> in that case I think you are set; just use the LogisticRegression class or the logistic_regression program with lambda set to the lambda2 value you wanted to use

21:22 < rsv> what is the default setting? l1 and l2 > 0 ?

21:22 < naywhayare> the default setting for logistic regression is that there is no support for L1 regularization (so, l1 == 0), and L2 regularization is 0 by default but can be specified to another value

21:23 < rsv> so looking here: http://www.mlpack.org/doxygen.php?doc=classmlpack_1_1regression_1_1LARS.html#_details i see that lasso is actually l2=0

21:24 < naywhayare> ah yes, I'm sorry, I became mixed up

21:24 < naywhayare> LASSO is L1 regularization but not L2 regularization

21:24 < naywhayare> LARS is both L1 and L2 regularization

21:24 < naywhayare> and ridge regression is just L2 regularization

21:24 < naywhayare> so basically, the LogisticRegressionFunction class implements L2 regularization, but not L1 regularization

21:24 < naywhayare> you would need to modify the Evaluate() and Gradient() functions to take into account the desired L1 regularization term

21:24 < rsv> so when one instantiates LogisticRegression which regression technique is the default?

21:25 < naywhayare> the default is standard regression (no regularization), but you can specify L2 regularization

21:25 < rsv> ahh i see

21:25 < naywhayare> if you want L1 regularization (so, LASSO or LARS), you'll unfortunately have to modify LogisticRegressionFunction

21:25 < rsv> okay

21:25 < naywhayare> which will essentially mean just modifying the Evaluate() and Gradient() functions to include the L1 penalty terms

21:25 < rsv> so what does this do: http://www.mlpack.org/doxygen.php?doc=namespacemlpack_1_1regression.html

21:26 < rsv> sorry i mean http://www.mlpack.org/doxygen.php?doc=classmlpack_1_1regression_1_1LARS.html

21:26 < naywhayare> the LARS class is a regression algorithm that assumes a linear model, like Y = AX

21:27 < naywhayare> but the LogisticRegression class is a different model

21:27 < naywhayare> at least in the implementations here, there is not a relation between the two

21:27 < rsv> ohhh, okay

21:27 < rsv> this makes a lot more sense

21:28 < naywhayare> yeah, so I think in your case, getting LASSO-like training for logistic regression just means putting the L1 penalty parameter into LogisticRegressionFunction

21:29 < rsv> right, basically implementing something similar to the way the code already handles the optional L2 regularization

21:30 < naywhayare> yeah

21:30 < rsv> perfect, thanks for the help

21:30 < naywhayare> sure, let me know if you have any issues

21:30 < naywhayare> one thing to note is that there are two overloads of Evaluate() and Gradient()

21:30 < naywhayare> one is meant to give the objective or gradient on the entire dataset (for optimizers like L-BFGS)

21:30 < rsv> okay

21:30 < naywhayare> and the other, which also takes an index of a point, is meant to give the objective or gradient on only a single point in the dataset (for optimizers like SGD)

21:31 < naywhayare> if you take a look at the code, it should make sense

21:31 < rsv> okay, i see that in logistic_regression_function.cpp

21:31 < naywhayare> yeah; just something to be aware of. if you don't implement an L1 penalty in both overloads, then the L1 penalty will only work for one type of optimizer

21:32 < naywhayare> (of course, if you only plan to use SGD, then you only need to implement the L1 penalty for the overloads that take an index :))

21:32 < rsv> makes sense

21:32 < rsv> indeed

23:52 rsv has quit [Ping timeout: 246 seconds]