#mlpack on 2019-06-17 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

00:21 jeffin143 has quit [Remote host closed the connection]

00:31 < ShikharJ> zoq: Did you get a chance to take a look at https://github.com/mlpack/mlpack/pull/1761 for the gradient and backward functions? I'd like to get that merged if possible?

02:25 akfluffy has joined #mlpack

02:26 < akfluffy> zoq: hey, did you get a chance to make that CNN example with an image?

02:29 < akfluffy> with 3 channels*

02:54 akfluffy has quit [Remote host closed the connection]

07:39 < lozhnikov> jeffin143: okay, I'll look through the PR today.

07:47 < jenkins-mlpack2> Project docker mlpack nightly build build #359: STILL UNSTABLE in 3 hr 33 min: http://ci.mlpack.org/job/docker%20mlpack%20nightly%20build/359/

09:19 < zoq> ShikharJ: I did, but the gradient didn't match with the implementation, have to recheck my calculation, will do that in the next day or two.

09:21 < zoq> akfluffy: Not yet, will do this in the next hours.

13:22 < zoq> akfluffy: I pretty much reused the ConvolutionalNetworkTest/VanillaNetworkTest: https://gist.github.com/zoq/87070ff2a4bf769d2264527b2f67b035

13:34 < ShikharJ> zoq: Hmm, okay. Please take your time :)

13:34 < ShikharJ> sakshamB: Toshal: Are you guys here?

13:34 < Toshal> ShikharJ: I am here

13:35 < sakshamB> ShikharJ: yes I am here

13:35 < ShikharJ> sakshamB: Okay, let's start on MiniBatchDiscrimination and Inception Score.

13:36 < sakshamB> ShikharJ: alright

13:36 < ShikharJ> I see that you marked a task as check inception score, that got me a little confused. What was meant by checking the score? And is there anything left to implement there?

13:37 < sakshamB> I’ll push the changes requested by zoq soon.

13:37 < sakshamB> hmm I just wanted to check the inception score of images produced by gan with minibatch discirmination layer.

13:38 < ShikharJ> sakshamB: I see that Virtual Batch Norm is currently under progress, but I think the changes to BatchNorm should be made in a separate PR please.

13:38 < ShikharJ> sakshamB: Is there a baseline score reported somewhere?

13:39 < sakshamB> ShikharJ: yes I will do that. I just wanted to confirm whether the issue is valid or not before opening the PR.

13:39 < ShikharJ> sakshamB: I'm saying this about BatchNorm because it can be a part of a wider issue, and I don't wish Virtual Batch Norm to get stuck in that issue.

13:40 < sakshamB> ShikharJ: yes do you think I should open a separate PR for BatchNorm right now?

13:41 < ShikharJ> sakshamB: Yes, and you should also ideally push in changes for other tests that do not make use of Linear Layer before their layer of test, to check the validity of our hypothesis.

13:44 < sakshamB> ShikharJ: I will add the linear layer for the other tests but I am not sure how it would test the validity of the issue. It is possible that other layers also have the Backward implemented incorrectly.

13:44 < ShikharJ> sakshamB: Yeah, but if the number of those layers is small, we'll have a better idea where we're going wrong with the implementations.

13:45 < sakshamB> ShikharJ: alright sounds good. I will open the PR by today.

13:47 < ShikharJ> Toshal: Though I didn't get a chance to review the Radical Test PR, I think it's a nice addition. Radical test has been failing quite sometime lately.

13:48 < ShikharJ> Toshal: The argument about making the different changes related to GANs in different PRs would apply here as well.

13:48 < ShikharJ> Toshal: I'm talking in terms of Serialization PR. I'd appreciate if your GAN related changes were made in a separate PR, so we can focus on serialization only.

13:51 < ShikharJ> Toshal: I'm convinced of the implementation, so as soon as you shift those changes, I'll approve and merge. This is also because I'll have to see the output of serialized and reloaded GANs.

13:52 < ShikharJ> Toshal: Are you there?

13:53 < Toshal> ShikharJ: sorry

13:55 < Toshal> Okay I will make a seprate PR.

13:55 < ShikharJ> Toshal: I should be able to provide you brief reviews on the Label Smoothing and Weight Normalization by today. I still have to read the full papers.

13:55 < Toshal> Okay

13:56 xiaohong has joined #mlpack

13:57 < sakshamB> ShikharJ: Getting back to your previous question regarding the baseline for inception scores, I think that it would be difficult to have a fair comparison since we will be training our own cnn model for computing the score as compared to using the Inception. Also most of the scores reported online are for CIFAR.

13:58 < ShikharJ> Toshal: By GAN related changes, I mean those where the gradient of the generator is changed. Just to be clear, rest all looks good.

13:58 < ShikharJ> sakshamB: Oh, so is it a relative measure?

13:58 < Toshal> ShikharJ: Yes, I got that.

14:00 < ShikharJ> Toshal: Okay cool, feel free to log off for now. I'll have to run and catch my regular bus, I'll start reviewing when I'm on it :)

14:00 < Toshal> Okay, Have a good day.

14:00 < sakshamB> ShikharJ: yes it would be best if we compare the inception score for different GAN variations. it would be difficult to get absolute numbers.

14:02 < ShikharJ> sakshamB: Okay, that sounds like a good idea to me. Do you mind implementing it based upon the existing models that we have of the different GAN variations? We just need a script for testing, it doesn't need to be a new test that we wish to merge.

14:05 < ShikharJ> Or rather we could merge it in a different test file for metrics, I'm open to that as well. Since the models are already implemented, it shouldn't be too hard.

14:05 < ShikharJ> Just need to find a way to cross test them on Inception.

14:06 < sakshamB> ShikharJ: hmm I am not sure about your second idea on “test file for metrics”

14:06 < sakshamB> it won’t be possible to train the entire model in the duration of the build and check the inception scores.

14:07 < ShikharJ> sakshamB: Hmm, yes I think your concerns are valid. In that case, we can always push it over to models repository once we find the script is final.

14:08 < sakshamB> sakshamB: yes I think we should just have a script to see the inception scores for different variations rather than a test for now.

14:08 < sreenik[m]> rcurtin: Just received the stickers with your note. Looks really cool. Thanks :)

14:10 < ShikharJ> sakshamB: Okay, I'll leave you to the tasks then.

14:10 < ShikharJ> sakshamB: Toshal: Have a fun week.

14:10 < sakshamB> ShikharJ: alright have a great week. :D

14:49 xiaohong has quit [Ping timeout: 256 seconds]

14:55 ImQ009 has joined #mlpack

15:10 < rcurtin> sreenik[m]: awesome, glad they made it to you :)

16:21 < sreenik[m]> :)

17:28 < akhandait> sreenik[m]: Hey

17:29 < sreenik[m]> akhandait: Hey!

17:29 < akhandait> How did the last week go?

17:30 < sreenik[m]> Quite constructive. The weight extraction is complete.

17:30 < sreenik[m]> The transfer of weights to mlpack model is also done

17:31 < sreenik[m]> To summarize, linear models are getting converted as expected

17:31 < akhandait> Great, did you face any of those problems you thought you might face?

17:31 < akhandait> sreenik[m]: Awesome!

17:32 < sreenik[m]> I did but many of them are solved now

17:32 < akhandait> I would suggest open a WIP PR and keep updating it as you go

17:32 < sreenik[m]> Ah okay sounds reasonable

17:32 < akhandait> If the linear models work, then people can maybe leave any other suggestions they have on the PR

17:33 < sreenik[m]> Yup

17:33 < akhandait> Also, about the parser, when do you think you can open the new PR?

17:33 < sreenik[m]> I hope to get convolutions working soon. The issues are:

17:34 < sreenik[m]> akhandait: About the parser, I will take just one day to finish it up, whenever you say, I can get it done by the next day

17:36 < sreenik[m]> So, the issues are: 1) There is a supported parameter called groups for the conv layer in onnx. Not sure what it exactly does but it is missing in mlpack.

17:36 < akhandait> Okay, can you try to do it by Wednesday night? I was hoping we could get that merged before the first evaluations.

17:36 < akhandait> Okay, go on

17:37 < sreenik[m]> 2) The maxpool layer does not have *pads* in mlpack (zoq had said he will look into that, so that will be fixed soon)

17:38 < sreenik[m]> 3) A couple of mlpack layers have a few non-customizable parameters like batchnorm and selu

17:39 < sreenik[m]> That's what I remember for now

17:39 < sreenik[m]> akhandait: Yes, I can get the PR into mergeable state by Wednesday night

17:40 < akhandait> Okay, I think it will be a bit easier to help you with these specific problems once you open a WIP PR.

17:41 < akhandait> About the groups parameter, I will check it out and see what we can do about it.

17:42 < sreenik[m]> I agree. I'll do it in a couple of hours (after a little code cleaning), in my own repo?

17:42 < akhandait> 2) That's great

17:43 < akhandait> sreenik[m]: Oh, sure. I first thought that you would open it in the mlpack repo directly, but now I realized that we are planning to create a new repo for this stuff.

17:43 < akhandait> So, your profile works

17:44 < akhandait> 3) Let me see check the batchnorm and selu source quickly

17:45 < sreenik[m]> Yes, I will try to set up a repo in my profile as I think the new mlpack will look like. That would make the work a lot more easier

17:45 < akhandait> sreenik[m]: Nice idea.

17:46 < sreenik[m]> You'll get them here https://github.com/onnx/onnx/blob/master/docs/Operators.md#BatchNormalization.

17:47 < akhandait> 3) Sorry, I didn't exactly get the 3rd problem

17:49 < sreenik[m]> Say for example, in mlpack's implementation of batchnorm, we can specify epsilon, but in onnx's case we can specify epsilon and momentum (that's what it looks from the outside as I have never used momentum in batchnorm before)

17:50 < akhandait> Hmm, I get it

17:51 < akhandait> I think for frequently used layers like batchnorm, we could make changes in mlpack's source to add the parameters that onnx supports but we don't.

17:51 < akhandait> It will only make our layers more flexible

17:51 < akhandait> zoq: What do you think?

17:53 < sreenik[m]> Looks like a reasonable solution to me, let's see what zoq has to say

17:54 < akhandait> sreenik[m]: If we decide to go ahead with this, I guess you would have to make a list of reasonable number of important layers that lack some parameters compared to onnx which we can add.

17:56 < sreenik[m]> Yes, that's right. Moreover, we should also make a list of the layers we would support. There doesn't seem to be too many that exist though

17:57 < akhandait> Yeah, we should make a list.

17:58 < akhandait> We would also need to add this list in the documentation of our new repo

17:58 < akhandait> It should be clear what we support and what we don't

17:59 < sreenik[m]> Let me see. I will give the conv a shot today. If I can successfully get the conversion done (assuming the onnx model does not contain unsupported attributes) then I can compile the list by tomorrow noon

17:59 < akhandait> Great, add this list in the readme of the new repo

18:00 < sreenik[m]> Okay.

18:00 < akhandait> Cool, any other updates?

18:02 < sreenik[m]> No, just that I guess I would write the blog after I get that json parser PR in good state

18:02 < sreenik[m]> That is, this Wednesday

18:03 < akhandait> Sounds good.

18:03 < akhandait> So, how are we doing with our timeline?

18:03 favre49 has joined #mlpack

18:04 < sreenik[m]> Going at a slower pace than I thought it would when I made the timeline, sadly

18:05 < akhandait> Don't worry, that's what happens in most cases. :)

18:05 < sreenik[m]> Unexpected error and segmentation faults and all, but we are probably a week behind

18:06 < favre49> zoq: I have some preliminary results - NEAT is producing errors of at most 0.5 in the current XOR test

18:06 < akhandait> sreenik[m]: We have a lot of time left to cover stuff up, I think you are doing good right now.

18:07 < zoq> favre49: Are you talking about the infinity issue?

18:07 < sreenik[m]> akhandait: Thanks for the assurance. I'll finish these things for now as discussed and give you a report on Wednesday night. I'll keep you updated as I complete each of these small tasks

18:08 < favre49> Oh no i fixed that, it was exactly what you suspected. Sorry for not updating you on that

18:08 < zoq> sreenik[m] akhandait: have to take a closer look at the discussion.

18:08 < favre49> This is the error on the actual XOR test

18:08 < akhandait> sreenik[m]: Sure, good luck with it! Good night :)

18:08 < zoq> favre49: So it fails half of the time?

18:08 < favre49> basically a fitness of ~3.8 on the current test.

18:09 < zoq> hm, not sure if this is good or bad :)

18:09 < akhandait> zoq: No rush, whenever you are free

18:09 < favre49> No I meant it gives a fitness of at least 3.5, or an error of 0.5 as the result of the training, sorry if that was not clear

18:09 < sreenik[m]> akhandait: Thanks, good night!

18:10 < favre49> zoq: Well actually there are some problems, and I'll try to figure them out. For one, if instead of passing a random input, I pass all possible XOR inputs, the results become far more erratic, and sometimes produces some stellarly horrible results.

18:11 < favre49> I am aiming for a fitness of 3.9, since that's what the python implementation gives on the XOR test

18:11 < zoq> favre49: Okay, that is strange, if I remember right you pushed the XOR test?

18:11 < favre49> Yup, but I'm yet to push the changes

18:11 < zoq> favre49: okay, the maximal fitness is 4 on the XOR task?

18:12 < favre49> Yup

18:12 < zoq> Do you think you can push the changes today or tomorrow?

18:12 < favre49> I'll push the debugged code tomorrow, still have a thing or two to fix.

18:13 < zoq> okay sounds good

18:14 < zoq> Maybe it makes sense to step through the method using GDB.

18:15 < favre49> Yup I'll do that

18:15 favre49 has quit [Quit: Page closed]

20:17 ImQ009 has quit [Read error: Connection reset by peer]

21:47 akfluffy has joined #mlpack

21:48 < akfluffy> zoq: thanks so much for the example :) just out of curiosity, how long does it take you to run one pass?

22:32 gmanlan has joined #mlpack

22:32 < gmanlan> rcurtin: you there?

22:32 < rcurtin> gmanlan: yeah, just got back from dinner

22:33 < gmanlan> hope it was a good dinner

22:33 < gmanlan> qq: you suggested renaming openblas to workaround the python build... but the script is looking for .lib not .dll.a

22:35 < rcurtin> gmanlan: just boring chipotle :) but it was good enough

22:35 < gmanlan> :)

22:35 < rcurtin> oh, right, so... hm. I think the regular mlpack build should be linking against the .lib, so I guess that would need to be renamed

22:35 < rcurtin> is there a libopenblas.lib file somewhere?

22:36 < zoq> akfluffy: < 1 second, the complete run took about 10 seconds

22:37 < gmanlan> when we use nuget package manager to obtain openblas, there is no .lib

22:37 < gmanlan> that .lib would be available if openblas is built from scratch

22:37 < rcurtin> gmanlan: huh, in this case, I'm not sure how any of the mlpack programs or libraries are linking successfully against openblas

22:38 < rcurtin> I thought that on Windows, if you were linking against something, you needed the .lib available at link time, but the .dll at runtime

22:38 < rcurtin> maybe my memory or understanding is wrong...

22:39 < gmanlan> that's the .a I believe in this case - let me double check

22:40 < rcurtin> huh, but I guess then python is trying to link against openblas.lib but instead should be linking against openblas.dll.a?

22:40 < rcurtin> you could try a "cheap hack" and just copy openblas.dll.a to openblas.lib to see if that will work, and if it does, I can try to fight with setuptools to get it to link directly against openblas.dll.a

22:42 < gmanlan> well, the .dll.a helps the program link against the dll (dynamic)

22:42 < gmanlan> and that's the only thing that nuget downloads when you include the openblas package

22:43 < rcurtin> https://stackoverflow.com/questions/6422478/linking-a-lib-and-def-files

22:44 < rcurtin> the first answer seems to suggest that maybe the .dll.a and .lib are equivalent... do you want to try the copy/rename trick and see what happens?

22:44 < gmanlan> let me try

22:44 < rcurtin> or if you're sure it won't work, I can maybe try to get setuptools to use the .dll.a

22:44 < rcurtin> from my end I'm not sure what the right way to link is, so I'm just suggesting trying random things, and if we get one to work I can fight with setuptools to make it do the right thing :)

22:45 < gmanlan> well I have built openblas from scratch so I know the .lib will be there, it's just the way in which Nuget works that you get the .dll.a so let me try the hack

22:45 < rcurtin> ah, ok, yeah

22:45 < rcurtin> we'll find out I guess...

22:45 < gmanlan> don't worry

22:45 < rcurtin> the issue is that for setuptools, I pass in a list of libraries and a list of library paths

22:45 < rcurtin> and the intention is that it will link like it would on linux... i.e. it takes the library list ["mlpack", "openblas"] and does -lmlpack -lopenblas

22:46 < rcurtin> and uses -L to set the library search directories right using the given library paths

22:46 < rcurtin> but the thing is, one can also use the gcc linker to specify a library to link against directly with -L/path/to/libmlpack.so

22:46 < rcurtin> which is what I'd really prefer to do... but I haven't yet figured out how to make setuptools do that

22:47 < rcurtin> (my processing of the library locations and names is why the "libopenblas.dll" change to "openblas.dll" was needed---I strip the "lib" from the front of the names, since on linux you'd specify -lmlpack not -llibmlpack

22:47 < rcurtin> so, as a result, setuptools expects "mlpack" not "libmlpack")

22:49 < gmanlan> I see

22:49 < gmanlan> that's a lot of work to keep it compatible across OSs

22:52 < gmanlan> "I'm not sure how any of the mlpack programs or libraries are linking successfully against openblas" --> by just specifying the .dll.a in -DBLAS_LIBRARY and -DLAPACK_LIBRARY

22:57 < rcurtin> ok, so the linker is just linking directly against the .dll.a then

22:57 < rcurtin> ok, so I think the renaming trick will work, and on my end I'll just have to fight with setuptools to make it handle direct library names better

22:57 < gmanlan> probably... I'm running a build to double check

23:02 < rcurtin> thanks

23:46 < gmanlan> rcurtin: it seems it worked

23:47 < gmanlan> at least 44 errors went away

23:47 < gmanlan> it seems there is an extra decoding error: mlpack\kernel_pca.pyx:83:67: Decoding error, missing or incorrect coding=<encoding-name> at top of source (cannot decode with encoding 'utf-8': invalid start byte)