#mlpack on 2019-02-28 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

01:25 vidhan has joined #mlpack

01:55 Poulami101 has joined #mlpack

01:56 Poulami101 has quit [Client Quit]

01:59 vidhan has quit [Ping timeout: 256 seconds]

02:09 KimSangYeon-DGU has quit [Quit: Page closed]

02:56 MystikNinja has joined #mlpack

03:01 < MystikNinja> Hey all, I'm applying to mlpack for GSoC 2019. I'm interested in implementing a deep learning module along with associated tests and docs. I'd like some advice on preparing my application beyond what is mentioned in the Ideas page.

03:02 < MystikNinja> 1. What are you looking for in a successful application?

03:03 < MystikNinja> 2. What level of knowledge regarding the literature and background material do you expect? I'm sure everyone will be willing to learn, but I don't know if I will have time to learn enough, if the gap is too large.

03:06 < MystikNinja> 3. You ask us to "provide some comments/ideas/tradeoffs/considerations about your decision process". Could you elaborate more on what kind of things you expect us to reason about? Really, the only thing I can think of to select a model to implement is if I feel I can implement it in time. Are there particular models you all would prefer implemented over others?

03:11 < MystikNinja> 4. What do you expect someone to know (or learn) if they want to work on this idea i.e. implementing a deep learning module?

03:24 MystikNinja has quit [Quit: Page closed]

03:41 tnsahr2580 has joined #mlpack

03:42 tnsahr2580 is now known as Soonmok

03:42 < Soonmok> Hi! I'm trying to implement GAN application on mnist data using mlpack lib.

03:44 < Soonmok> but I don't understand what is noise Function in GAN lib.

03:45 < Soonmok> what should i put into the noise function place??

03:47 KumarRIshabh has joined #mlpack

03:51 < KumarRIshabh> Hello everyone, I am Rishabh Kumar a student majoring in Math from India. I am quite interested in the project 'QGMM' and have previously worked with GMM. I have not worked with mlpack prior to this. Are there any issues or previous builds of QGMM or GMM available in mlpack ?

03:52 Suryo has joined #mlpack

03:52 < Suryo> hello zoq!

03:53 < Suryo> I've submitted a pull request to include the PSO code. Sorry for the delay - I've had some pretty tough coursework this semester.

03:54 < Suryo> However, I've not included any parallelization so far. I'll do that once the current request has been approved.

03:54 < Suryo> Thank you.

03:58 Suryo has quit [Quit: Page closed]

04:05 < ayesdie> rcurtin: I was about to get started on making bindings for `LinearSVM` and then benchmarking it. Should I wait for for the PR to be approved?

04:06 KumarRIshabh has quit [Quit: Page closed]

04:07 KumarRishabh has joined #mlpack

04:18 < rcurtin> ayesdie: nah, no need to wait---but let's open the bindings and benchmarks in a different PR if that's ok

04:24 < ayesdie> alright, I'll get into it and make a PR when it will be ready for an initial review.

04:31 < rcurtin> sounds good, thanks. I hope to be able to review the linear SVM PR in full in the next few days

04:37 sk1499 has joined #mlpack

04:39 sk1499 has quit [Client Quit]

04:40 KumarRishabh has quit [Ping timeout: 256 seconds]

04:54 kinshuk has joined #mlpack

04:56 KimSangYeon-DGU has joined #mlpack

05:06 kinshuk has left #mlpack []

05:14 KimSangYeon-DGU has quit [Quit: Page closed]

05:24 Bellalau_ has joined #mlpack

05:27 cjlcarvalho has joined #mlpack

05:29 Bellalau_ has quit [Ping timeout: 256 seconds]

07:13 vivekp has quit [Read error: Connection reset by peer]

07:15 vivekp has joined #mlpack

07:45 Suryo has joined #mlpack

07:47 < Suryo> zoq, rcurtin: I have a doubt. I submitted a pull request for a PSO module and the travis ci check failed. However, it failed for the spsa_test and that's something that I did not touch.

07:47 < Suryo> Two tests that I wrote for PSO are passing

07:48 < Suryo> What should be done?

08:10 Suryo has quit [Quit: Page closed]

08:44 < jenkins-mlpack2> Yippee, build fixed!

08:44 < jenkins-mlpack2> Project docker mlpack nightly build build #248: FIXED in 3 hr 30 min: http://ci.mlpack.org/job/docker%20mlpack%20nightly%20build/248/

08:58 < zoq> Suryo: I'll have to adapt the threshold for the SPSA test, will do that later today.

09:35 riaash04 has joined #mlpack

09:43 gopal has joined #mlpack

09:43 < gopal> help

09:51 < riaash04> Hi, I am working on the implementing a manifold learning algorithm isomap, so for this I forked from the mlpack repository, cloned it to local, built it using the cmake commands and it succeded. But now after making adding new files, I again tried to build it but it can't find some boost libraries (program options, unit test framework, serialization).

09:51 < riaash04> So I again did apt-get for all the dependencies, but still can't find it.

09:53 < riaash04> Also, I built it using the process mentioned in this page http://www.mlpack.org/docs/mlpack-3.0.4/doxygen/build.html and it succeded. But build for the mlpack local repository is not completing.

09:54 < riaash04> Please help. I just want to test the code I am implementing.

09:56 < riaash04> Even cmake configuration is not completing.

09:59 gopal has quit [Ping timeout: 256 seconds]

10:06 cjlcarvalho has quit [Ping timeout: 268 seconds]

10:10 sonu628 has joined #mlpack

10:12 yanyan has joined #mlpack

10:15 sonu628 has quit [Ping timeout: 256 seconds]

10:15 sonu628 has joined #mlpack

10:18 yanyan has quit [Quit: Page closed]

10:18 yanyan has joined #mlpack

10:19 sonu628 has quit [Ping timeout: 256 seconds]

10:20 < riaash04> So after deleting the build folder cmake configuration is working again.

10:26 yanyan_ has joined #mlpack

10:26 yanyan_ has left #mlpack []

10:30 yanyan has quit [Ping timeout: 256 seconds]

10:42 < riaash04> I am doing to following to setup the development envirenment (very new to open source development) : 1) Forked and cloned mlpack repository 2) Used cmake ../ to configure and then built mlpack, Now after adding any file to methods directory, do I need to build whole mlpack again everytime to check the code. I know a part of mlpack can be built separately but how to build the new files I am adding separately?

10:44 Suryo has joined #mlpack

10:46 < Suryo> Zoq: thanks!! I also saw that adeel has submitted a pull request with some of his earler code refactored. I think that's for global best pso. Mine is local best. So I guess it'll be good to get both the implementations together.

10:46 < Suryo> What do you think?

10:46 Suryo has quit [Client Quit]

10:49 riaash04 has quit [Quit: Page closed]

11:00 yanyan has joined #mlpack

12:42 yanyan has quit [Ping timeout: 256 seconds]

12:57 blank has joined #mlpack

12:58 blank has quit [Client Quit]

13:09 aman_p has joined #mlpack

15:05 sumedhghaisas has quit [Ping timeout: 256 seconds]

15:15 pd09041999 has joined #mlpack

15:18 < ShikharJ> rcurtin: I think quite a few PRs got closed recently. Is that automation for closing PRs still on?

15:27 < rcurtin> yeah, it is, they get closed if the 'keep-open' label is not set

15:27 < rcurtin> you could see the ones that got closed with a search like 'is:pr is:closed label:'s: stale'' or similar

15:27 < rcurtin> and if it closed some that should have stayed open, feel free to reopen and mark as 'keep open' :)

15:28 < ShikharJ> Cool, thanks :)

15:28 < rcurtin> sure

15:28 < rcurtin> let me know when you're happy with the test wiki page (or if you wanted me to add something to it? I can't remember) and I can update mlpack-bot's text

15:30 riaash04 has joined #mlpack

15:31 ironmaniiith has joined #mlpack

15:35 KRONOS has joined #mlpack

15:36 yanyan_ has joined #mlpack

15:38 soham has joined #mlpack

15:41 ironmaniiith has quit [Quit: Page closed]

15:41 ironmaniiith has joined #mlpack

15:43 aman_p has quit [Ping timeout: 246 seconds]

15:50 < ShikharJ> I wanted to add some stuff, but probably after my exams (which get over in a couple of days).

15:53 KRONOS has quit [Ping timeout: 256 seconds]

15:57 < rcurtin> sure, no hurry

16:01 soham has quit [Ping timeout: 256 seconds]

16:12 aman_p has joined #mlpack

16:33 aditya has joined #mlpack

16:33 aditya has quit [Client Quit]

16:39 yogesh01 has joined #mlpack

16:54 < riaash04> Hi, how can I make cmake build just the folder that I specify (like a new folder that I add to the methods folder)? Also, can anyone help me to understand how can I run specific code that I write, to debug. I have added some a folder which has some hpp and cpp files in the method folder. How can I just run the contents of that folder? Very new to open source development. Thanks for help.

16:55 < riaash04> I have cloned and build the mlpack repository in ubuntu

16:56 < rcurtin> Hi riaash04, have you done any reading about how CMake works? If you've added a new folder, you would need to add it to the relevant CMakeLists.txt files

16:57 < rcurtin> I'd suggest you take some time and learn a little about CMake, then read through how we have the project configured in order to understand how it is you do what it is you want to do

17:00 < riaash04> Yes, I added the folder to CMakeLists.txt and it did get built. I still have to get more familiar with CMake though, I will do that. Thanks.

17:05 < rcurtin> sounds like you have gotten it worked out then; that's good to hear. if I can clarify things about mlpack's specific CMake configuration do let me know

17:23 rob has joined #mlpack

17:23 rob is now known as Guest79291

17:23 < Guest79291> How do I compile with nvBlas? I have already installed cuda and everything, but doing -lnvblas even with -DARMA_DONT_USE_WRAPPER it doesn't make it any faster...

17:24 < Guest79291> and arma::config tells me that it's not using blas when I do

17:28 < rcurtin> Guest79291: nvblas isn't guaranteed to give speedup for every operation... it depends on the workload and the algorithm and the data, etc.

17:28 < rcurtin> maybe you are not doing any operations with mlpack that make use of blas functionality?

17:29 < Guest79291> I'm making a ton of subvec() s and resizes to a matrix I loaded in

17:29 < rcurtin> not sure that nvblas would help with that

17:29 < Guest79291> Wouldn't it print that it was using blas? I'm doing if(cfg.blas){...}

17:29 < rcurtin> I'm not familiar with arma::config unfortunately

17:30 < Guest79291> alright, well, thank you

17:30 < rcurtin> yeah, sorry that I can't be more helpful...

17:30 < rcurtin> but anyway the way nvblas works is that it looks at the size of the matrix and the operation

17:30 < rcurtin> and estimates whether it would be faster to transfer the matrix to the GPU, perform the operation, and transfer back

17:30 < Guest79291> oh, I see

17:31 < rcurtin> but it sounds like what you're doing is something like matrix copy/extract operations? in which case that wouldn't give any speedup, I don't think

17:31 < rcurtin> but if instead you are doing something more like A*B.t() (or those types of operations), for very large matrices (and if you have a good GPU), nvblas will move the computation to the GPU and it will give some speedup

17:31 < rcurtin> neither of these things are as good as having the matrix always stored on the GPU

17:32 < rcurtin> the bandicoot project will probably be what you are looking for when it is ready, then---it's basically Armadillo with matrices stored on the GPU

17:32 < Guest79291> Ah. You're exactly right

17:32 < rcurtin> however there is still a lot of implementation work to be done there

17:32 < rcurtin> so it is not quite ready yet unfortunately :(

17:32 < Guest79291> Well, good luck :)

17:33 < Guest79291> I realize now what you're saying, I hadn't actually used the data in mlpack in any way.

17:33 < Guest79291> I'm assuming once I feed into my NN it will speed up

17:34 < rcurtin> yeah, nvblas may give acceleration for the NN, but I'm not totally sure... I haven't tried it myself

17:34 < rcurtin> I'd imagine larger batch sizes would be helpful with that

17:34 < rcurtin> like I said before it also depends on the GPU too... nvblas does its own estimation of what will be faster and what won't, so it takes the GPU model into account (I assume)

17:36 sohamt09 has joined #mlpack

17:39 < sohamt09> hi!!

17:41 pavan has joined #mlpack

17:43 pavan has quit [Client Quit]

17:47 < zoq> sohamt09: Hello there!

17:47 < zoq> Suryo: definitely

17:49 < sohamt09> i have a few question to ask about

17:50 yogesh01 has quit [Ping timeout: 256 seconds]

17:59 Guest79291 has quit [Quit: Page closed]

18:09 KimSangYeon-DGU has joined #mlpack

18:14 riaash04 has quit [Quit: Page closed]

18:18 ironmaniiith has quit [Ping timeout: 256 seconds]

18:20 deepak_ has joined #mlpack

18:22 < deepak_> HELP

18:30 deepak_ has quit [Quit: Page closed]

18:32 vivekp has quit [Read error: Connection reset by peer]

18:36 robb6 has joined #mlpack

18:37 vivekp has joined #mlpack

18:37 < robb6> are there any plans to add methods for genetic algorithms? I know one was done in a fork a while ago

18:42 < zoq> robb6: If you count NEAT, CNE, DE as well, yes.

18:43 < zoq> robb6: CNE, DE is already implemented

18:44 < robb6> where are they under methods?

18:44 < robb6> I don't see them

18:46 < rcurtin> they're part of ensmallen now: http://ensmallen.org/

18:46 < rcurtin> (we took all the stuff out of src/mlpack/core/optimizers/ and put it into its own separate library, because we thought it would be more widely usable outside of mlpack)

18:47 < robb6> Ah

18:47 < robb6> thank you

18:49 sohamt09 has quit []

18:51 < robb6> is the optimizer tutorial page still up to date?

18:51 < robb6> http://www.mlpack.org/docs/mlpack-git/doxygen/optimizertutorial.html

18:52 < zoq> robb6: Better use http://www.ensmallen.org/docs.html#function-type-documentation

18:53 < zoq> http://www.ensmallen.org/docs.html

18:53 < robb6> Great. And all I have to do is specify the optimizer during training?

18:55 < zoq> Yes and no, not every optimizer will work with every method. But ensmallen will warn you if you do something that shouldn't work in the first place.

18:57 < robb6> What optimizers would be able to work on an ANN? For example, optimizing the amount of hidden layers/neurons?

19:22 < robb6> :( I'm getting an out of bounds error for Mat::operator() when I train my model

19:22 < robb6> It's just a normal FFNN with three layers

19:24 < robb6> I forgot to identify MSE, sorry

19:25 robb6 has quit [Quit: Page closed]

19:27 kinshuk has joined #mlpack

19:30 robb7 has joined #mlpack

19:38 aman_p has quit [Ping timeout: 250 seconds]

19:39 < zoq> robb7: Every method that takes differentiable separable functions: https://ensmallen.org/docs.html#differentiable-separable-functions

19:40 aman_p has joined #mlpack

19:50 < robb7> are there any 2d plotting/graphing libraries that work nicely with armadillo?

19:54 < rcurtin> robb7: all I know of is gnuplot, maybe there is something better though...

19:54 < rcurtin> the C++ plotting world was not that great last time I checked...

19:54 < rcurtin> (but I also don't have much need for plotting so I am not the most knowledgable in this domain)

19:56 aman_p76 has joined #mlpack

20:00 < kinshuk> Hi all

20:00 < kinshuk> from my digging, looks like mlpack doesn't use one-cycle learning rate policy yet

20:01 < kinshuk> https://sgugger.github.io/the-1cycle-policy.html

20:02 < kinshuk> I think it would be a useful addition, since it seems to speed up training of NNs a lot

20:04 < kinshuk> (the idea is from the same guy who popularized cyclical annealing)

20:11 vivekp has quit [Ping timeout: 246 seconds]

20:12 vivekp has joined #mlpack

20:21 ac-optimus has joined #mlpack

20:23 yanyan_ has quit [Quit: Page closed]

20:24 channu has joined #mlpack

20:24 channu has left #mlpack []

20:28 robb7 has quit [Quit: Page closed]

20:32 aman_p76 has quit [Ping timeout: 244 seconds]

20:33 aman_p has quit [Ping timeout: 246 seconds]

20:35 robb8 has joined #mlpack

20:35 < robb8> armadillo is not using my gpu even though I explicitly told it to.

20:35 < robb8> 100% gpu power

20:35 < robb8> 100% cpu power **

20:36 < robb8> would compiling as static help?

20:40 < zoq> kinshuk: If you like you can see if it's possible to implement it as another decay policy inside the ensmallen framework.

20:42 < robb8> how do I save an already trained NN model to a file and reload it later?

20:44 < KimSangYeon-DGU> refer to http://www.mlpack.org/docs/mlpack-3.0.2/doxygen/anntutorial.html#model_saving_loading_anntut

20:45 < robb8> thank you

20:45 < KimSangYeon-DGU> robb8: It helps you

20:45 < KimSangYeon-DGU> :)

20:46 < rcurtin> robb8: I guess nvblas did not think it was profitable to move the computation to the GPU?

20:46 < rcurtin> someone else had an issue like this, let me find it:

20:46 < rcurtin> https://github.com/mlpack/mlpack/issues/1677

20:46 < rcurtin> but I don't know details of the reporter's system

20:46 < rcurtin> I know we've seen speedup with nvblas in the past though; I don't remember if it was for the NN code (actually I think the NN code didn't exist at that time?)

20:48 < robb8> i guess I assumed wrongly that it would switch over

20:48 < robb8> even at 100% cpu , maybe its not worth it

20:48 < rcurtin> yeah, I'm not sure of the algorithms they use internally

20:48 < rcurtin> I feel like something like PCA where it's an eigendecomposition or something might ship the work off to the GPU because it's more computationally intensive work

20:48 < rcurtin> assuming the data is large enough

20:49 < robb8> got it

20:49 < rcurtin> I wish I could say "try bandicoot!"... give us a few months (or several?) and then maybe we can :)

20:49 < robb8> :)

20:49 < rcurtin> it's close to next in my priority list once I finish the website and handle a few other mlpack-related things

20:50 < robb8> also, is there a way to use one of the genetic algorithms in ensmallen (like CNE) to optimize the amount of neurons ? or would I have to implement that myself? what exactly does it change when generating a new random network

20:50 < robb8> does it just use CNE on the weights/biases?

20:50 < rcurtin> hmmm, so that would be tricky but possible

20:51 < rcurtin> imagine implementing a non-differentiable function, so it has Evaluate()

20:51 < rcurtin> and Evaluate() takes in an arma::mat of parameters

20:51 < rcurtin> maybe each element in that arma::mat represents the number of neurons at each layer

20:51 < rcurtin> and then inside of Evaluate(), you convert each element of the arma::mat() to a size_t, then build the network accordingly, train and test

20:51 < rcurtin> and return the MSE (or whatever measure)

20:52 < robb8> got it, thanks! :)

20:52 < rcurtin> it's also possible you could come up with some way to approximate a Gradient() function using finite-differences or something like this

20:53 < rcurtin> but... with neural network hyperparameters, there's no guarantee the loss function would even be smooth

20:53 < rcurtin> so using a gradient-based optimizer may not work very well

20:53 < robb8> hmm

20:53 < robb8> I guess it'll probably be easier to tune it myself

20:53 < robb8> I have 256 input neurons, 1 output neuron, doing time series

20:53 < robb8> and like 2 hidden layers

20:54 < rcurtin> it may be easier to just try a handful of possibilities and see which is best

20:54 < rcurtin> but there are lots of possibilities :)

20:54 < robb8> ;)

20:54 kinshuk has quit [Remote host closed the connection]

20:54 < robb8> i guess I have to figure out a working range

20:54 kinshuk has joined #mlpack

20:54 < rcurtin> I've found with RNNs that adding extra layers of memory (like two layers of LSTMs or something like this) can be really helpful

20:55 < rcurtin> of course it makes it take way longer to train...

20:55 < robb8> I tried using an RNN (i'm using a FFNN right now), but it took me a while to get the data into cubes

20:56 < robb8> also, I had no idea what I was doing in terms of activation functions, so I applied Sigmoid to it, even though I wasn't classifying anything

20:56 < robb8> so I just added way more inputs to an FFNN

20:58 < rcurtin> ah, ok; same thing though with FFNNs, if you add more layers of depth it takes longer to train but can probably fit better

20:58 < robb8> do you think an RNN would be better for time series?

20:58 < rcurtin> depends on the task, but generally I'd use RNNs for time series if I could

20:58 < rcurtin> but again RNNs can take way longer

20:58 < rcurtin> some people have shown really nice results with convolutional FFNNs, and that they can approximate the results of RNNs (and train much more quickly)

20:58 kinshuk has quit [Ping timeout: 250 seconds]

20:59 < rcurtin> anyway, my rule of thumb for deep learning is "just try a lot of things" because it's really hard to predict how things will perform :)

20:59 ac-optimus has quit [Ping timeout: 256 seconds]

20:59 < rcurtin> other people may have different rules of thumbs... I am not the world's best data scientist anyway :)

20:59 < robb8> haha :) thank you

21:02 < robb8> also, is there a way to print the MSE or do I calculate that myself?

21:02 < rcurtin> you'd have to compute it yourself for now, but one of the other things on my list is callbacks for ensmallen that print this automatically during optimization

21:02 < robb8> I'm assuming I do that myself because predict() only takes the input data

21:02 < robb8> gotcha

21:02 < robb8> thanks

21:03 < rcurtin> ah sorry you are just building the network not using ensmallen; in that case, there is a PR to the git master branch that makes Train() return the last value of the loss function

21:03 < rcurtin> that was merged recently, so if you are using the git master branch you should be able to do that

21:03 < robb8> oh! yeah I just compiled from source

21:04 < robb8> thanks :))))

21:09 < rcurtin> sure :)

21:31 < robb8> hey, I get a suuuuper long error when I try to use the mlpack::data::save... It uses boost right? I linked -lboost_system

21:34 < robb8> does it use something like -lboost_serialization ?

21:35 < zoq> yeah

21:35 < robb8> thanks

21:38 pd09041999 has quit [Ping timeout: 246 seconds]

23:08 robb8 has quit [Quit: Page closed]

23:22 pd09041999 has joined #mlpack

23:54 Mina has joined #mlpack