#mlpack on 2020-08-06 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

02:32 b1 is now known as mistiry

06:42 ImQ009 has joined #mlpack

09:00 eddelbuettel[m] has left #mlpack []

10:26 < jeffin143[m]> @himanshu_pathak:matrix.org: woh you have added some nice improvements

10:26 < jeffin143[m]> Deep belief , svm

13:27 < R-Aravind[m]> Hello guys

13:32 < zoq> R-Aravind[m]: Hello there.

13:35 < R-Aravind[m]> I wish to help, so can you tell me what you guys are working on?

13:37 < jeffin143[m]> There are many things we are working on

13:38 < jeffin143[m]> You can view the GitHub issues

13:51 < jeffin143[m]> @walragatver:matrix.org id you have the bandwidth today ?? Could you look at the last two commits

13:51 < jeffin143[m]> May be we can finally complete today the pr curve ??

13:52 < jeffin143[m]> There is no issue also if you don't have

14:24 < rcurtin> zoq: thanks for the automatic video meet-up email, it does a much better job than I was doing :-D

14:56 < zoq> rcurtin: :)

15:00 < KimSangYeon-DGU[> kartikdutt18: sakshamb189 Are you there?

15:00 < kartikdutt18[m]> Hey KimSangYeon-DGU , sakshamb189 , I'm here.

15:00 < sakshamb189[m]> Hey I am here

15:01 < KimSangYeon-DGU[> Great, can you tell us the progress?

15:02 < kartikdutt18[m]> Sure, I converted the weights of Darknet53 model, I have pushed all required changes in the PR and I'm gettting nearly the same output in Darknet53, I'm looking into why I'm not getting exactly the same output which was the case for Darknet19.

15:03 < KimSangYeon-DGU[> Do you think it's because of the data type like uint8?

15:03 < kartikdutt18[m]> Not really I am testing with the same tensor as PyTorch.

15:04 < KimSangYeon-DGU[> And does the slightly different output affect the prediction?

15:04 < sakshamb189[m]> were you able to find the fix for the preprocessing in pytorch ?

15:05 < kartikdutt18[m]> Yes it does, on the test set that I used for Darknet 19 I got 61% in Darknet53 in PyTorch and 53% in mlpack.

15:05 < kartikdutt18[m]> <sakshamb189[m] "were you able to find the fix fo"> In python, using np.uint8 gave nearly the same result as ToTensor().

15:07 < sakshamb189[m]> so, there was no difference with using np.uint8? I am not sure what you mean

15:07 < KimSangYeon-DGU[> Yes, they should be the same I guess

15:07 < kartikdutt18[m]> Yes they were same to the degree of 1e-2.

15:08 < sakshamb189[m]> so we don't really need to use that and we have not been able to figure out an equivalent for that in mlpack? Is that correct?

15:11 < kartikdutt18[m]> If we iterate over the input features cast them as unit8 and divide by 255 should work. I spent most of yesterday and today reimplementing darknet 53 model in PyTorch so that access all of it's weights since it's made up of residual network followed by making corresponding changes in mlpack implementation. So I haven't checked how we would cast it but maybe that could be another PR for preprocessing that we can add

15:11 < kartikdutt18[m]> with the converter.

15:12 < kartikdutt18[m]> * If we iterate over the input features cast them as unit8 and divide by 255 should work. I spent most of yesterday and today reimplementing darknet 53 model in PyTorch so that the converter can access all of it's weights since it's made up of residual network followed by making corresponding changes in mlpack implementation. So I haven't checked how we would cast it but maybe that could be another PR for

15:12 < kartikdutt18[m]> preprocessing that we can add with the converter.

15:13 < sakshamb189[m]> the pytorch implementation you are referring, must have the preprocessing as well right?

15:14 < kartikdutt18[m]> The preprocessing is just ToTensor() in pytorch. In mlpack it would be casting it as unint8 and dividing it by 255.

15:14 < kartikdutt18[m]> * The preprocessing is just ToTensor() in pytorch. In mlpack it would be casting input as unint8 and dividing it by 255.

15:15 < sakshamb189[m]> alright I see and what accuracy does the pytorch model achieve?

15:15 < sakshamb189[m]> can you share the link?

15:16 < kartikdutt18[m]> darknet19 or 53?

15:16 < sakshamb189[m]> both

15:16 < kartikdutt18[m]> Sure.

15:20 < kartikdutt18[m]> For [darknet19 reference implementation](https://github.com/kartikdutt18/pytorch-darknet19) and [benchmark for both](https://pjreddie.com/darknet/imagenet/) . [Darknet53 reference implementation](https://github.com/kartikdutt18/PyTorch-Darknet53).

15:20 < kartikdutt18[m]> For darknet 19, on uniformly sampled test set from imagenette, Darknet19 achieves acc. of 76% in both PyTorch and mlpack.

15:21 < sakshamb189[m]> so you trained these models on your own in Pytorch?

15:21 < KimSangYeon-DGU[> Oh, you mean that DarkNet19 is the same between mlpack and PyTorch, but DarkNet53 isn't

15:22 < kartikdutt18[m]> Ohh no, These were open on my chrome window. These are forks from github.

15:22 < kartikdutt18[m]> <KimSangYeon-DGU[ "Oh, you mean that DarkNet19 is t"> Yes.

15:23 < KimSangYeon-DGU[> I thought the difference is from the ToTensor(), but that's not correct.

15:23 < kartikdutt18[m]> I did adpat Darknet53 [here](https://github.com/kartikdutt18/DarkNet-Models-In-PyTorch/blob/master/darknet_53.ipynb) to extract weights.

15:24 < kartikdutt18[m]> <KimSangYeon-DGU[ "I thought the difference is from"> For testing I use tensor csv as input so that it doesn't affect the model performance in any way.

15:24 < KimSangYeon-DGU[> Ok

15:24 < KimSangYeon-DGU[> I see

15:25 < kartikdutt18[m]> Since we first want to see if models work correctly rather so that we can merge the Darknet PR.

15:25 < kartikdutt18[m]> * Since we first want to see if models work correctly so that we can merge the Darknet PR.

15:26 < KimSangYeon-DGU[> Then, we can derive that some layer caused the difference

15:26 < KimSangYeon-DGU[> on DarkNet53

15:27 < kartikdutt18[m]> There are only two layers that are new in Darknet 53, a linear layer and residual layer / network.

15:27 < sakshamb189[m]> I think kartikdutt18 is working on that right?

15:27 < kartikdutt18[m]> Yes.

15:27 < KimSangYeon-DGU[> Great

15:28 < kartikdutt18[m]> Few differences that I already fixed are, That darknet19 used a threshold of 1e-1 whereas darknet53 uses a threshold of 1e-2 in LeakyReLU and got the parameter count to match

15:29 < sakshamb189[m]> Do you think you can make the fix in the converter as well?

15:29 < sakshamb189[m]> in this PR only?

15:30 < kartikdutt18[m]> Sure, I can have a seperate PR to do that if that is what you meant.

15:37 < KimSangYeon-DGU[> anywayt

15:38 < KimSangYeon-DGU[> Oops

15:40 < KimSangYeon-DGU[> Ok, then kartikdutt18 You'll work on the DarkNet53 to figure it out where the dfference is and then work on the converter, right?

15:42 < KimSangYeon-DGU[> After that, we need to try to make preprocessed input data the same between mlpack and PyTorch.

15:43 < KimSangYeon-DGU[> I mean ToTensor() in PyTorch

15:43 < sakshamb189[m]> kartikdutt18: you were saying that you were able to find the preprocessing that needs to be done for ToTensor() so you should be able to implement that now?

15:44 < sakshamb189[m]> in this PR only?

15:44 < kartikdutt18[m]> I think the converter works for both Darknet19 and Darknet53. We already have the weights for darknet19. As needed we expand it for other layers when needed. Like I added linear layer today for Darknet53 and we work on YOLO we can add any other required layer as well. As sakshamb189 said, if we can verify if the models are correct with converter first to merge the models, would it better If we first verified and

15:44 < kartikdutt18[m]> merged all models and then we can have a PR for converted weights and required preprocessing.

15:45 < kartikdutt18[m]> * I think the converter works for both Darknet19 and Darknet53. We already have the weights for darknet19. As needed we expand it for other layers when needed. Like I added linear layer today for Darknet53 and when we work on YOLO we can add any other required layer as well. As sakshamb189 said, if we can verify if the models are correct with converter first to merge the models, would it better If we first verified

15:45 < kartikdutt18[m]> and merged all models and then we can have a PR for converted weights and required preprocessing.

15:45 < kartikdutt18[m]> <sakshamb189[m] "in this PR only?"> Sure, I would first have to try that but I think that's doable.

15:46 < sakshamb189[m]> alright great :)

15:46 < kartikdutt18[m]> > <@sakshamb189:matrix.org> in this PR only?

15:46 < kartikdutt18[m]> * Sure, I would first have to try that in C++ but I think that's doable.

15:50 < KimSangYeon-DGU[> Ok, Great

15:50 < KimSangYeon-DGU[> Is there anything to discuss?

15:50 < KimSangYeon-DGU[> I'm done

15:51 < kartikdutt18[m]> <kartikdutt18[m] "I think the converter works for "> Could you let me know if this makes sense. That once I ensure that Darknet53 is correct and nothing is to be changed we could merge that. And then repeat it for the YOLO models.

15:53 < sakshamb189[m]> yup but if we can fix the weight converter we should also try to merge that in this PR only.

15:54 < KimSangYeon-DGU[> In my opinion, we can merge this if we verify the model is correct, but we should work on the normalization issue.

15:55 < kartikdutt18[m]> Sure, I will first try to get the model working and I'll push all changes to the converter repo. And then I can try the normalization issue ( I think what I said above should work).

15:57 < KimSangYeon-DGU[> Yes, the best is to load and normalize images using mlpack's feature

15:57 < KimSangYeon-DGU[> without PyTorch

15:58 < kartikdutt18[m]> Sure, I will add imagenet preprocessor function in preprocessing directory.

15:58 < KimSangYeon-DGU[> Ok, thanks!

16:00 < KimSangYeon-DGU[> Is there anything to discuss?

16:01 < kartikdutt18[m]> I think that's it from my side. Hopefully we can merge the Darknet PR in next couple of days with the preprocessor function.

16:01 < kartikdutt18[m]> * I think that's it from my side. Hopefully we should be able to merge the Darknet PR in next couple of days with the preprocessor function.

16:01 < KimSangYeon-DGU[> Yes

16:02 < sakshamb189[m]> yes alright then let's meet next week. Thanks guys!

16:02 < KimSangYeon-DGU[> Thanks for the meeting, have a nice week!

16:02 < kartikdutt18[m]> Great, Have a great week.

17:28 < himanshu_pathak[> > @himanshu_pathak:matrix.org: woh you have added some nice improvements

17:28 < himanshu_pathak[> > Deep belief , svm

17:28 < himanshu_pathak[> Thanks jeffin143 Just trying finish them soon DBN will require a lot of work also need some improvement in kernel_svm :)

17:30 mlpack-meeting has joined #mlpack

17:30 < mlpack-meeting> Hello everyone, video meeting in about 30 minutes - https://zoom.us/j/3820896170

17:30 mlpack-meeting has quit [Remote host closed the connection]

17:32 < rcurtin> awesome, successful chat notification too :)

17:48 < abernauer[m]> password is mlpack for the meeting correct?

17:49 < zoq> abernauer[m]: yes

18:01 < shrit[m]> True, it is a good reminder

19:08 ImQ009 has quit [Quit: Leaving]

20:45 < rcurtin> if any maintainer is willing to review this so we can get it merged I would appreciate it :) https://github.com/mlpack/mlpack/pull/2543

20:48 < zoq> rcurtin: Was just waiting for an update on the LDFLAGS comment :)

20:48 < rcurtin> yeah, it's a good thing Yashwant caught it; I'm not sure how long I would have spent debugging it :)