#mlpack on 2019-05-09 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

00:52 Mulx10 has joined #mlpack

00:59 < Mulx10> sreenik, gmanlan, jeffin143, zoq, rcurtin : using '#ifdef OPENCV_INSTALLED' is a good idea.

01:00 < Mulx10> Also, I don't understand what you mean by OpenCv converter?

01:01 < Mulx10> However, if introduction of optional dependency is fine, I shall go ahead with it.

01:04 < rcurtin> Mulx10: I mean that if there is already some way, without implementing a special function, to load data in OpenCV and then convert it to arma::mat, then there is no need for us to implement a function

01:12 Mulx10 has quit [Ping timeout: 256 seconds]

02:15 petris_ is now known as petris

02:19 gmanlan has joined #mlpack

02:29 < gmanlan> rcurtin: you there?

02:29 < rcurtin> yeah, I am finishing debugging these random forest changes

02:29 < rcurtin> I of course introduced some bug that's really irritating to track down

02:29 < rcurtin> I wanted to take the night off after I got it working, but now it's 10:30pm and no end in sight...

02:31 < gmanlan> oh, I'm sorry to hear that

02:31 < rcurtin> "nothing can be simple" :)

02:31 < gmanlan> anything I can do to help?

02:31 < rcurtin> on the up side, I improved the implementation's speed by orders of magnitude in some cases

02:31 < gmanlan> oh that's awesome!

02:31 < rcurtin> nah, I feel like I am close

02:31 < rcurtin> I also ended up adding a few more parameters to the random_forest binding

02:32 < gmanlan> that's great

02:32 < gmanlan> ok, so to cheer you up, I made some progress with the python bindings on windows

02:32 < rcurtin> so I'm happy I took the time to fix this. the implementation still isn't as good as it could possibly be, but it seems to be faster than scikit in many cases (in part because it is easily parallel)

02:32 < rcurtin> great to hear!

02:32 < gmanlan> (faster than scikit makes me so happy)

02:33 < gmanlan> I will be adding my findings tomorrow to the python binding issue we were working on, but may need your guidance on how to change a couple of cmakes

02:33 < rcurtin> like I said, I know there's more there. but I don't have the time at the moment to accelerate it further

02:33 < rcurtin> sure, I'll try to help while I'm waiting for things to compile :)

02:33 < gmanlan> and to continue with the news... I have started working on an entirely new website

02:33 < gmanlan> based on what we discussed last time via email

02:34 < rcurtin> that sounds good, is your plan to use the existing website as a base?

02:34 < rcurtin> I would imagine it's mostly just new content that needs to be dropped into place

02:34 < rcurtin> but I'm not totally sure what you had in mind

02:34 < gmanlan> yes, I'm keeping the current jekyll site, but heavily customized

02:35 < gmanlan> I'm trying to bring material and flat design to the website

02:35 < gmanlan> putting a lot of emphasis on the landing experience (i.e. getting started, multiple download/build options, etc)

02:35 < rcurtin> that sounds great

02:35 < gmanlan> I will be sharing some previews with you soon

02:35 < rcurtin> I am a completely incompetent web developer so I am sure what you will come out with is way better :)

02:36 < gmanlan> :) you did a great job, I'm just trying to bring mlpack to a TensorFlow level of website

02:36 < rcurtin> even the template we have now is taken from the ensmallen website, which was itself taken from the armadillo website...

02:36 < rcurtin> I think my websites are optimized for people who are trying to pretend they still live in the 90s, like myself :)

02:37 < gmanlan> hahahahaa

02:37 < gmanlan> yeah, the template is somehow basic and there are no suitable templates out there, so I'm keeping it as a base but adding many customizations (which I hope will not be hard to maintain)

02:37 < jeffin143> Wonderful

02:37 < jeffin143> To head it gmanlan :)

02:37 < gmanlan> I don't really expect we would be changing the landing page/getting started sections too often

02:37 < gmanlan> :)

02:37 < jeffin143> Hear*

02:38 < rcurtin> yeah, definitely not, mostly I just need to be able to use sed/awk scripts to update the versions of links, etc.

02:38 < rcurtin> but that should be no problem at all

02:38 < gmanlan> coolo - I'm a little bit of a perfectionist so I will take my time, but bear with me

02:38 < gmanlan> *cool

02:38 < rcurtin> if it's finally time to transition to a black-on-white theme I'm okay with that too

02:39 < rcurtin> I know the feeling---take your time :)

02:39 < jeffin143> No , white on black suits much better , there are lot black on white, we can be unique

02:39 < jeffin143> May be we could vote for that :)

02:39 < gmanlan> haha, well it depends on the goal

02:40 < gmanlan> if we want to increase adoption from users such as 'the industry' then it's better to be more 'serious'

02:40 < rcurtin> sounds fine to me. I do think the white-on-black is a distinctive thing, but I don't know how easy it would be to accomplish gmanlan's goals like that

02:41 < rcurtin> maybe a less "extreme" light-on-dark can do the job, not sure?

02:41 < gmanlan> I have some UX friends

02:41 < gmanlan> I can ask around for advice

02:42 < rcurtin> :+1:

02:42 < rcurtin> or I guess I should actually go dig up the right Unicode character... 👍

02:42 < rcurtin> (not that it displays in my terminal. I just see a black box)

02:43 < gmanlan> haha, it looks great

02:43 < rcurtin> yeah, I checked the IRC log page to make sure it went through :)

02:44 < gmanlan> rcurtin: are you tagging/saving the .msi files for each release?

02:44 < gmanlan> or better said, the artifacts in general?

02:44 < rcurtin> I think AppVeyor builds them but I don't currently have a process to grab or tag them

02:44 < rcurtin> it should be easy enough to get them, since I think AppVeyor will build the tags too

02:44 < gmanlan> ah ok, we will need that - AppVeyor deletes the stuff after a while

02:46 < rcurtin> oh, let me see if I can quickly get a copy of the 3.1.0 tag one then

02:47 < rcurtin> https://ci.appveyor.com/project/mlpack/mlpack/builds/24113716

02:47 < rcurtin> I grabbed the msi's for now

02:49 < gmanlan> great thanks

02:49 < gmanlan> we may need to add it in /files like the .tars

02:50 < rcurtin> yeah, I think so. not sure exactly how to automate it yet, but we can figure that out later

02:51 < gmanlan> (Y)

02:51 < gmanlan> 👍

02:53 < rcurtin> only thing is, I'm not totally sure that installer works

02:53 < rcurtin> it doesn't seem to want to start in wine, but that's probably wine being broken

02:54 < gmanlan> I tested the .msi from your RF PR and it worked just fine

02:55 < rcurtin> ok, let me update the website then

02:55 < rcurtin> it's a different msi but probably works fine

04:02 < rcurtin> gmanlan: ok, finally, that took way longer than I had hoped. I'm headed to bed now; I think the RF fix is done now (at least for now... hopefully it works right for you :))

04:02 < gmanlan> fantastic rcurtin

04:02 < gmanlan> I will test it first thing tomorrow

04:03 < gmanlan> thanks for all your help!

04:04 < rcurtin> sure, I should have avoided writing the bug in the first place :)

04:05 < gmanlan> :)

04:09 < jeffin143> gmanlan : seems you are very good with windows os , :-p I am much better of with Linux

04:12 gmanlan has quit [Ping timeout: 256 seconds]

07:46 Mulx10 has joined #mlpack

07:47 < Mulx10> rcurtin : loading images followed by conversion to arma::mat is possible

07:48 < Mulx10> I found this https://stackoverflow.com/questions/26973970/conversion-between-cvmat-and-armamat

07:48 < Mulx10> It would be optional as such, kind of like an extension to data::Load

08:03 Mulx10 has quit [Ping timeout: 256 seconds]

08:06 < jenkins-mlpack2> Project docker mlpack nightly build build #319: STILL UNSTABLE in 3 hr 52 min: http://ci.mlpack.org/job/docker%20mlpack%20nightly%20build/319/

08:17 < jeffin143> Mulx10 : did u try that..??

08:27 jeffin143 has quit [Ping timeout: 258 seconds]

10:01 pd09041999 has joined #mlpack

11:23 pd09041999 has quit [Ping timeout: 245 seconds]

11:54 pd09041999 has joined #mlpack

13:02 < rcurtin> gmanlan: the msi is now displayed on the homepage; hopefully that helps somewhat with the ease of Windows install

13:11 jeffin143 has joined #mlpack

13:31 frogEye has joined #mlpack

13:31 < frogEye> Hi

13:32 < frogEye> @rcurtin: What type of encoding do we use for categorial data?

13:32 < frogEye> Thanks

13:37 < rcurtin> frogEye: I get more than a hundred emails a day. the backlog is ridiculous

13:37 < rcurtin> I will respond to you when I have a chance. I need to look into some things to put together a good response

13:38 < rcurtin> the categorical encoding itself, if you look at DatasetMapper, just encodes categories as size_t

13:38 < frogEye> Thanks for the reply

13:48 frogEye has quit [Quit: Page closed]

13:56 < chandramouli_r> zoq: Can I still work on that Idea (Reinforcement Learning) one to add it as an API to mlpack.

13:59 < zoq> chandramouli_r: Totally

14:43 xiaohong has joined #mlpack

14:52 xiaohong has quit [Ping timeout: 256 seconds]

16:05 < chandramouli_r> so I need some help regarding this project and some clarity about it.

16:06 < chandramouli_r> What about that research opportunity in that project. What should be done for that ?

16:06 pd09041999 has quit [Ping timeout: 246 seconds]

16:20 pd09041999 has joined #mlpack

17:38 Toshal has joined #mlpack

17:41 saksham189 has joined #mlpack

17:48 < Toshal> saksham189: Hi,

17:48 < saksham189> Hi

17:52 < Toshal> ShikharJ saksham189: I am fine with any timing. But it would be great if we meet after 9:00 pm as it ensures that I am in my room.

17:55 < saksham189> ShikharJ: Toshal: After 9pm is fine for me. Are you guys free tomorrow?

18:05 < Toshal> saksham189: Yes I am free.

18:17 Toshal has quit [Remote host closed the connection]

18:28 pd09041999 has quit [Ping timeout: 248 seconds]

18:40 pd09041999 has joined #mlpack

18:42 gmanlan has joined #mlpack

18:48 jeffin143 has quit [Quit: AndroIRC - Android IRC Client ( http://www.androirc.com )]

18:57 favr49 has joined #mlpack

18:57 favr49 has quit [Client Quit]

18:58 favre49 has joined #mlpack

18:59 < favre49> rcurtin: http://www.mlpack.org/doc/stable/doxygen/build.html mentions mlpack 3.0.4 instead of mlpack 3.1.0

19:04 < ShikharJ> saksham189: Toshal: Okay, tomorrow after 9pm sounds good.

19:19 < rcurtin> favre49: thanks, I'll see if I can find that bit and fix it tonight

19:37 favre49 has quit [Quit: Page closed]

20:40 pd09041999 has quit [Ping timeout: 245 seconds]

20:41 < gmanlan> rcurtin: I'm still testing the changes you pushed for RF - one question: is it possible to control tree depth in the current implementation? If not, what's the criteria being used?

20:57 pd09041999 has joined #mlpack

21:18 < rcurtin> gmanlan: we haven't implemented any support for tracking depth in the tree building procedure, but I don't think it would be particularly hard

21:23 pd09041999 has quit [Quit: pd09041999]

21:29 < gmanlan> ok - I guess it would be a good addition to prevent overfiting when playing with large datasets, I will add a personal note so we consider it in the future

21:29 < saksham189> ShikharJ: Ok, let's have it then.

21:29 < rcurtin> I dunno, I think also minimum_leaf_size is a (roughly) equivalent way to prevent overfitting, but I can agree, it could be nice to add in the future

21:33 < gmanlan> that's the thing, unfortunately I'm not an expert but I'm trying to help move some implementations from other frameworks such as sci-kit, and because parameters don't match 100%, users go into panic mode

21:34 < gmanlan> it's not necessarily an mlpack problem though - it's a knowledge problem :)

21:47 < rcurtin> gmanlan: yeah, you're right about that, which would be a good reason to also include a max depth parameter

21:47 < rcurtin> I thought about it for a while, the two things are somewhat different ways of regularizing the tree

21:47 < rcurtin> in practice they probably *usually* perform about the same, but not necessarily

21:51 < gmanlan> rcurtin: yes - thanks for sharing your thoughts. I wish I could have more experience on the tuning area, hopefully soon

22:02 preacher_ has joined #mlpack

22:02 < preacher_> hello

22:02 preacher_ has quit [Client Quit]

22:03 < rcurtin> gmanlan: no problem, even if we add more parameters one could use the hyperparameter tuner to find the best ones :)

22:04 < rcurtin> the hyperparameter tuner is really a cool piece of code; however, it's currently not very discoverable

22:04 < rcurtin> we have a tutorial, but I don't think it makes it very easy to use

22:05 < gmanlan> oh, well, I didn't know we have one (I suspected it)

22:53 KimSangYeon-DGU has quit [Ping timeout: 256 seconds]

23:36 gmanlan has quit [Quit: Page closed]