#mlpack on 2016-06-22 — irc logs at libera.irclog.whitequark.org

2015-01-15 23:05 verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/

00:34 travis-ci has joined #mlpack

00:34 < travis-ci> mlpack/mlpack#1046 (master - 4129a7c : Ryan Curtin): The build passed.

00:34 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/2bd1227d0f41...4129a7c1d743

00:34 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/139364111

00:34 travis-ci has left #mlpack []

01:00 travis-ci has joined #mlpack

01:00 < travis-ci> mlpack/mlpack#1047 (master - bee9567 : Ryan Curtin): The build passed.

01:00 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/4129a7c1d743...bee9567e5c4c

01:00 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/139368115

01:00 travis-ci has left #mlpack []

01:02 marcosirc has quit [Quit: WeeChat 1.4]

01:03 travis-ci has joined #mlpack

01:03 < travis-ci> mlpack/mlpack#1048 (master - a9f5622 : Ryan Curtin): The build passed.

01:03 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/bee9567e5c4c...a9f5622c8a14

01:03 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/139368663

01:03 travis-ci has left #mlpack []

01:27 benchmark has joined #mlpack

01:27 benchmark has quit [Client Quit]

01:34 < rcurtin> wow! now there is some speedup

02:43 benchmark has joined #mlpack

02:43 benchmark has quit [Client Quit]

03:16 marcosirc has joined #mlpack

03:16 marcosirc has quit [Client Quit]

04:20 nilay has joined #mlpack

06:47 Lenish has joined #mlpack

06:50 Mathnerd314 has quit [Ping timeout: 276 seconds]

10:43 mentekid has joined #mlpack

11:32 nilay has quit [Ping timeout: 250 seconds]

12:49 zoq has quit [Ping timeout: 244 seconds]

12:54 zoq has joined #mlpack

13:03 nilay has joined #mlpack

14:00 marcosirc has joined #mlpack

15:10 kwikadi_ has joined #mlpack

15:10 kwikadi has quit [Ping timeout: 244 seconds]

15:10 kwikadi_ is now known as kwikadi

15:18 < marcosirc> Hi @rcurtin

15:18 < marcosirc> About the PR implementing approximate neighbor search.

15:18 < rcurtin> maecosirc: I think the game went very well for Argentina :)

15:19 < marcosirc> Thanks! Yes! I have to say thanks to Messi :)

15:19 < rcurtin> if you'd like to rebase the PR feel free, it will make it easier to merge

15:19 < marcosirc> Ok. I will do it. Thanks

15:20 < rcurtin> yeah, I figure Sumedh will merge it when he is happy with everything

15:34 mentekid has quit [Ping timeout: 250 seconds]

15:35 Mathnerd314 has joined #mlpack

16:02 < marcosirc> rcurtin: I can see in many sides of the code that Accessors/Mutators are implemented using references. Accessor return a const reference.

16:04 < marcosirc> Does it make sense to have both? Maybe, the mutator is enough, as ir can be used to access/modify the member's value.

16:19 < rcurtin> if you are writing a const method, then the const accessor is required

16:21 < marcosirc> Yeah, I understand. Ok.

16:34 mentekid has joined #mlpack

16:36 sumedhghaisas has joined #mlpack

16:48 mentekid has quit [Ping timeout: 252 seconds]

17:21 sumedhghaisas has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

17:21 sumedhghaisas has joined #mlpack

17:43 mentekid has joined #mlpack

18:07 mentekid has quit [Ping timeout: 244 seconds]

18:08 mentekid has joined #mlpack

18:11 < marcosirc> Hi @sumedh

18:12 < marcosirc> sumedhghaisas: I have fixed the conflicts in the PR on approximate knn.

18:14 < sumedhghaisas> @marcosirc Hey Marcos... It was on my agenda for tonight but thank you :)

18:14 < sumedhghaisas> I will merge it then...

18:14 < marcosirc> Ok. Thanks!

18:15 < sumedhghaisas> @marcosirc How are your graphs going?? Any interesting results yet??

18:15 < sumedhghaisas> I will merge as soon as the test tests are done...

18:16 < marcosirc> Hi, yes. If you give me 5 minutes I can upload it to github.io so you can see them.

18:18 < sumedhghaisas> Sure... I would love to look at them...

18:25 tsathoggua has joined #mlpack

18:27 < marcosirc> sumedhghaisas: Thanks. It is available in http://marcospividori.github.io/mlpack-app/

18:27 < marcosirc> you have to select the view: "Metric analysis with multiple parameters for an algorithm/dataset combination"

18:28 < sumedhghaisas> @marcosirc So base cases are decreasing as we predicted...

18:29 < marcosirc> Yes

18:29 < marcosirc> You can also see the runtime with:

18:29 < marcosirc> method: ALLKNN

18:29 < marcosirc> dataset: isolet

18:29 < marcosirc> option: e

18:29 < marcosirc> params: k:3 seed:42

18:29 < marcosirc> metric: Runtime

18:29 < marcosirc> it will show runtime for flann and ann too.

18:30 < marcosirc> Of course we should do benchmarking with bigger datasets!

18:30 < sumedhghaisas> yeah... I started with cloud dataset... mlpack is slowest there...

18:31 < marcosirc> isolet and corel-histogram are the biggest there..

18:32 < sumedhghaisas> mlpack seems to perform better as the dataset size grows :)

18:32 < sumedhghaisas> which is a very good thing

18:32 < sumedhghaisas> I mean perform relatively better...

18:32 < marcosirc> Yes, I agree.

18:33 < marcosirc> Maybe, we should do some test with bigger datasets...

18:34 < sumedhghaisas> Also the graph for isolet dataset... I cannot see a unit for the y-axis... where as for cloud it states minutes...

18:34 < sumedhghaisas> is minutes common??

18:36 < marcosirc> I think you are confussing the basecases metric with the runtime metric...

18:37 < marcosirc> Ahh no sorry, I see what you mean

18:38 < marcosirc> I don't think m means "minutes". I will check the javascript code to see how it sets the units.

18:40 < marcosirc> I am working updating that view. Now you have to set the same parameters for all libraries. I was planning to modify it.

18:40 < marcosirc> I was planning to use a similar approach to : "Dataset metric plots for any algorithm/parameter combination"

18:41 < marcosirc> where you can add as much cases as you want with different parameters.

18:42 < marcosirc> so we could compare, for example, "mlpack-knn ... -k 3 --seed 42 -t cover" against "mlpack-knn ... -k 3 --seed 42 -t spill" etc

18:42 mentekid has quit [Ping timeout: 276 seconds]

18:42 < marcosirc> Would you agree?

18:46 < sumedhghaisas> ohh you mean comparing against trees...

18:47 tsathoggua has quit [Quit: Konversation terminated!]

18:47 < marcosirc> I mean, the objeive of this new view is: "show the metric progress for a specific method configuration, with different values for a specific parameter"

18:49 < marcosirc> Now, we compare, for example, the runtime for ALLKNN with the same configuration for different libraries, and different values for the "-e" parameter.

18:50 < marcosirc> I would like to modify the view, so we can set different parameters and see what happen, in each case, when we only change a specific parameter, like "-e".

18:52 < marcosirc> Once we implemented spill trees. We will want to compare, not only mlpack against flann and ann, but also "mlpack -t kd" against "mlpack -t spill".

18:52 < marcosirc> Do you see what I mean?

18:53 < sumedhghaisas> I like the idea... but I think it will get complicated in this process....

18:53 < sumedhghaisas> do you think we should implement it in the same view??

18:53 < sumedhghaisas> or create another view??

18:53 < sumedhghaisas> let me think...

18:55 < marcosirc> I was implemented this in the same view...

18:58 < sumedhghaisas> hmm... I don't see any other way to compare this though...

18:59 < marcosirc> This is the general idea:

18:59 < marcosirc> Initially you choose: a method, a dataset, and an option. (For example: ALKNN, isolet, "-e")

18:59 < marcosirc> Then you can start adding combinations of: parameters, library. As many as you want.

18:59 < marcosirc> For example:

18:59 < marcosirc> "-k 3 -seed 42 -t spill", mlpack

18:59 < marcosirc> "-k 3 -seed 42 -t kd", mlpack

18:59 < marcosirc> "-k 3 -seed 42", flann

18:59 < marcosirc> "-k 3 -seed 42", ann

19:00 < marcosirc> when you click "Redraw graph", all of this cases are shown in the graphic, for different values of "-e".

19:01 < sumedhghaisas> So first parameter then library then combinations.... then add...

19:02 < sumedhghaisas> I mean...

19:02 < sumedhghaisas> in "Dataset metric plots for any algorithm/parameter combination"...

19:03 < sumedhghaisas> we first select method then combination then library...

19:03 < sumedhghaisas> but I don't think tree option will be valid for library...

19:03 < sumedhghaisas> so we can do library first then populate the combinations...

19:03 < sumedhghaisas> what do you think??

19:05 < marcosirc> Yes, I agree.

19:06 < marcosirc> Now, in "Dataset metric plots for any algorithm/parameter combination", when you select parameters, it updates the list of libraries available.

19:06 < marcosirc> So, when you select parameters like: "-k 3 -seed 42 -t spill", only mlpack will appear as a possible library.

19:07 < marcosirc> But, we can move the library to be chosen first, as you suggested. It looks more intuitive...

19:09 < sumedhghaisas> yeah... I think its more intuitive... So if I understand correctly... we will select the method first... then comparison parameter then library... then combination... correct??

19:15 < marcosirc> We could have fixed options: method, dataset, metric, option (For example ALLKNN, isolet, Runtime, -e)

19:15 < marcosirc> Then, users can add as many (library,parameters) combinations as they want.

19:17 < nilay> zoq: Hi, how are you?

19:17 < nilay> I had a small doubt, when doing 1 x 1 convolution we simply ignore some cells of the input tensor?

19:18 < sumedhghaisas> But if we are comparing for different parameter values... shouldn't we keep the method constant??

19:18 < sumedhghaisas> I mean both method and comparison parameter....?

19:18 < sumedhghaisas> ohh you mean the same...

19:19 < marcosirc> Yeah :)

19:19 < sumedhghaisas> sorry :)

19:19 < sumedhghaisas> I think thats a good plan...

19:20 < marcosirc> Great. Thanks! I am working on this. I think I will finish it in the next days.

19:24 < zoq> nilay: Hello, I can't complain, how are you? I'm not sure why do you think we ignore cells of the input tensor? The 1x1 convolution is used to reduce the filter size.

19:26 < zoq> nilay: http://iamaaditya.github.io/2016/03/one-by-one-convolution/ that's a nice explanation

19:26 < nilay> I am good :)

19:26 < nilay> yeah i am looking at the same link

19:26 < nilay> convolution is applying a filter on an input tensor.

19:27 < nilay> so 1 x 1 convolution reduces filter size, what does this mean?

19:27 < nilay> should i read the network in network paper too?

19:27 < zoq> nilay: yeah, that would be helpful

19:28 < zoq> nilay: the 1x1 convolution is a really neat trick, and it's so simple, "For example, an image of 200 x 200 with 50 features on convolution with 20 filters of 1x1 would result in size of 200 x 200 x 20."

19:29 < zoq> nilay: so if you do convolution with a 3x3 kernel you don't have to do it on the 200x200x50 input matrix, just on the 200x200x20 matrix

19:30 < nilay> so in a sense, we are applying this kernel of size 20 x 20 (20 filters of 1 x 1) on the input image which is 200 x 200 x 50

19:30 < nilay> i think i am wrong

19:31 < zoq> no, you are right you do convolution wit a kernel size of 1x1

19:32 < nilay> from where does then 20 filters come? ( a convolution with kernel size 1 x 1 would just be multiplying the image with a scalar)

19:33 < nilay> we implemented the ConvTriangle function, that is convolution right, with a kernel of size k x k

19:33 < nilay> (while doing the feature extraction part)

19:35 < nilay> ok so now i understand i think, we take a tensor at each location which is of size 50, and map it to a tensor of size 20

19:35 < nilay> using some kernell

19:35 < nilay> kernel*

19:36 < zoq> So if you start with an image of size 200x200x20 and you do convolution with a 1x1 filter the output is 200x200 because you accumulate the result over the dimensions of the input.

19:37 < zoq> so, someting like image[0] * kernel + image[1] * kernel ...

19:37 < nilay> yeah, ok. so we only consider the location (i, j) when quoting the convolution size, the tensor at that location is of any arbitrary length

19:39 < zoq> yes, the output of the convolution is defined by the number of kernels

19:39 < nilay> the animation on the link, is not representative then, it seems we are ignoring some values in that link

19:40 < nilay> so we have 20 kernels of size 1 x 1?

19:40 < zoq> yeah, it's not accurate

19:40 < zoq> yes, right

19:41 < zoq> http://wikicoursenote.com/w/images/8/8f/I1.png on top of that you do a 3x3 convolution and a 5x5 convolution

19:43 < nilay> yes ok.

19:45 < nilay> also for doing backprop for CNNs we don't have to do anything different than in a normal net?

19:45 < zoq> nilay: no, simple backprop

19:46 < zoq> http://arxiv.org/abs/1512.00567 is also a nice paper

19:47 < zoq> https://github.com/mlpack/mlpack/blob/master/src/mlpack/methods/ann/layer/conv_layer.hpp that's how the backward and gradient function of the convolution layer looks like

19:50 < nilay> yeah, i have to look at that paper.

19:51 < nilay> also the inception layer a - e should be made as separate layers?

19:53 < nilay> (a) layer means 1 x 1 convolution, (b) means a 3 x 3 on top of 1 x 1 and so on, right?

19:54 < zoq> just as a single layer

19:55 < zoq> no need to implement the naive version

19:55 < zoq> unless, you like to do that

19:56 < nilay> is my understanding of what (a) (b) .. mean w.r.t to inception layer correct?

19:56 < nilay> because i don't think it is

19:56 < zoq> I'm lloking at page 5 from the going deeper with convolutions paper

19:57 < nilay> yeah

19:57 < nilay> so they talk about 4a to 4d modules

19:57 < nilay> what is 4a?

19:57 < zoq> let's see

19:57 < nilay> inception (4a)

19:58 < zoq> ah, 4a is a different auxiliary network attached to the overall network

19:59 < nilay> so the inception layer is one unit, and there are auxillary classifiers attached over it in (a - e)

20:00 < zoq> the inception layer is what you see in figure b on page 5, but the overall inception network has some auxillary classifier attached to it

20:01 < nilay> yeah , ok

20:02 < zoq> page 7 figure 3 shows the complete inception network, with two auxillary classifiers attached to it

20:03 < zoq> the output of each, classifier is a softmax activation layer

20:04 < nilay> i wonder why we have a - e when we only attach a softmax classifier at the filter concatenation stage in that figure

20:04 Lenish is now known as Rodya

20:05 < zoq> The table on page 6 isn't what figure 3 shows

20:06 < zoq> if you add another inception layer, you probably also add another auxillary classifier

20:08 < zoq> out for dinner, back later

20:08 < nilay> ok

20:55 < sumedhghaisas> @rcurtin: Hey Ryan...

20:55 < sumedhghaisas> our tests are solid... we should boast our coverage... :P

20:55 < sumedhghaisas> I mean on the github readme...

21:06 < rcurtin> okay, if you want to modify the readme do it :)

21:14 < sumedhghaisas> @rcurtin we will need to add --coverage to gcc while compiling... I can try adding a cmake build for it and generate a graphical view using icov ... what do you think?

21:15 < sumedhghaisas> hmm... this seems better... https://coveralls.io/sign-up

21:22 travis-ci has joined #mlpack

21:22 < travis-ci> mlpack/mlpack#1051 (master - 37fda23 : sumedhghaisas): The build passed.

21:22 < travis-ci> Change view : https://github.com/mlpack/mlpack/compare/a9f5622c8a14...37fda23945b4

21:22 < travis-ci> Build details : https://travis-ci.org/mlpack/mlpack/builds/139596617

21:22 travis-ci has left #mlpack []

21:22 sumedhghaisas has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]

21:25 sumedhghaisas has joined #mlpack

21:27 sumedhghaisas has quit [Client Quit]

21:28 sumedhghaisas has joined #mlpack

21:30 sumedhghaisas has quit [Client Quit]

21:30 sumedhghaisas has joined #mlpack

21:44 nilay has quit [Ping timeout: 250 seconds]

23:20 marcosirc has quit [Quit: WeeChat 1.4]