#mlpack on 2020-06-21 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

05:21 gotadachi has quit [Quit: Leaving...]

05:44 gotadachi has joined #mlpack

06:15 < jenkins-mlpack2> Yippee, build fixed!

06:15 < jenkins-mlpack2> Project mlpack - git commit test build #443: FIXED in 1 hr 37 min: http://ci.mlpack.org/job/mlpack%20-%20git%20commit%20test/443/

07:29 < KimSangYeon-DGU[> The training time took about an hour and a half

07:31 < kartikdutt18[m]> Hmm, I can't get even a single epoch in an hour. The model is bit different from darknet 19.

07:31 < KimSangYeon-DGU[> Yes, it's simpler model than darknet 19

07:33 < KimSangYeon-DGU[> Maybe, there is bottleneck in the darknet implementation.

07:34 < kartikdutt18[m]> What do you suggest?

07:35 < KimSangYeon-DGU[> When you trained, can you use multiple cores?

07:35 < KimSangYeon-DGU[> * When you trained, could you use multiple cores by any chance?

07:35 < kartikdutt18[m]> I am using open mp.

07:35 < KimSangYeon-DGU[> Ok

07:35 < kartikdutt18[m]> I'll just check the number of threads.

07:36 < KimSangYeon-DGU[> My laptop spec is i7, 16gb (8 cores)

07:37 < kartikdutt18[m]> I am using 12 threads according to activity manager.

07:39 < kartikdutt18[m]> Should I send you the training subset or use zoq's machine if they would be faster?

07:41 < KimSangYeon-DGU[> Let me try to train from the scratch. Can I reproduce the training work by using the darknet model PR?

07:41 < KimSangYeon-DGU[> Did you make any change in your local repo?

07:41 < KimSangYeon-DGU[> ,not remote repo

07:42 < kartikdutt18[m]> Let me just push it again ( I think only training params might be different).

07:43 < KimSangYeon-DGU[> Ok, and I found in darknet framework, the loading time is so fast. It only loaded images as batch size

07:44 < KimSangYeon-DGU[> The batch size was 128

07:45 < kartikdutt18[m]> We load the whole dataset first and then train the model.

07:45 < KimSangYeon-DGU[> Right

07:46 < KimSangYeon-DGU[> I'll reproduce the small network in mlpack and let's see what's happening

07:46 < kartikdutt18[m]> Sure, Let me know if I need to make the model.

07:46 < KimSangYeon-DGU[> :)

07:47 < kartikdutt18[m]> I can reuse the functions in darknet class.

07:48 < KimSangYeon-DGU[> But, your laptop is already running to train the model, so I think it's good to leave it as it is for the time being.

07:48 < kartikdutt18[m]> Sure :)

07:48 < KimSangYeon-DGU[> Without stop

07:48 < KimSangYeon-DGU[> :)

07:50 < KimSangYeon-DGU[> I was trying to find any Darknet 19 references except for the official one, but I couldn't anything yet. I'll share it with you, if I find.

07:51 < kartikdutt18[m]> Thanks, I'll check if we can load images in parallel as well. Will keep posting updates about training.

07:52 < KimSangYeon-DGU[> Thanks!

07:54 ImQ009 has joined #mlpack

07:59 < KimSangYeon-DGU[> kartikdutt18: Let me train the model using Darknet 19 in the Darknet framework first.

08:00 < kartikdutt18[m]> Ok, on Cifar 10?

08:00 < KimSangYeon-DGU[> Yes

08:01 < kartikdutt18[m]> I think you would have to change a few pooling layer (increase padding) because for 32 x 32 the size will go to 0.

08:02 < KimSangYeon-DGU[> Yes, I think so

08:05 < kartikdutt18[m]> This might be [useful](https://github.com/mlpack/models/pull/20#discussion_r439991844).

08:06 < KimSangYeon-DGU[> Great, thanks 👍️

08:28 < KimSangYeon-DGU[> kartikdutt18: Started, the Darknet 19 takes about 14 secs per 1 iteration.

08:28 < KimSangYeon-DGU[> It's much slower than the simple model that I've showed you

08:29 < KimSangYeon-DGU[> * kartikdutt18: Started, for your information, the Darknet 19 takes about 14 secs per 1 iteration.

08:39 < KimSangYeon-DGU[> * kartikdutt18: Started. For your information, in the Darknet framework the Darknet 19 takes about 14 secs per 1 iteration.

08:39 < KimSangYeon-DGU[> * kartikdutt18: Started. For your information, in the Darknet framework, the Darknet 19 takes about 14 secs per 1 iteration.

08:40 < kartikdutt18[m]> Hmm, It takes nearly 30-45 sec for 1 iteration. (1200 iteration in an epoch)

08:41 < kartikdutt18[m]> @Kim

08:41 < kartikdutt18[m]> KimSangYeon-DGU: , What's the initial loss?

08:42 < KimSangYeon-DGU[> 7.350986

08:43 < kartikdutt18[m]> I think that's also random initialization. I guess you also used linear layer to convert last 1000 channels to 10?

08:46 < KimSangYeon-DGU[> Oops, I didn't !

08:55 < KimSangYeon-DGU[> kartikdutt18: This is the summary

08:55 < KimSangYeon-DGU[> - https://pastebin.com/DUzP0fJv

08:56 < saksham189Gitter> @kartikdutt18 Did you find any examples of using Darknet on CIFAR dataset online? Do you think we should try with a different dataset?

08:56 < KimSangYeon-DGU[> And the initial loss is `2.419931`

08:57 < kartikdutt18[m]> I didn't find darknet 19 or 53 with cifar 10. There is only darknet small. I think I got nearly the same loss with Xavier Initialization but it didn't change after 2 epochs for various learning rates.

08:59 < KimSangYeon-DGU[> I'm training the model with cifar-10 and Darknet 19.

09:00 < kartikdutt18[m]> Other dataset that could be possible is [tiny imagenet](https://www.kaggle.com/c/tiny-imagenet/data).

09:01 < KimSangYeon-DGU[> kartikdutt18: Yes, but I guess Saksham wants the benchmark.

09:02 < kartikdutt18[m]> We do have some reference for darknet 19 on complete imagenet dataset. [Link](https://pjreddie.com/darknet/imagenet/)

09:03 < KimSangYeon-DGU[> saksham189 (Gitter): Would it be better for us to change the dataset to imagenet?

09:04 < saksham189Gitter> Yes but I guess the training time would be even longer, right?

09:05 < KimSangYeon-DGU[> I think so

09:05 < kartikdutt18[m]> I think so, yes

09:05 < saksham189Gitter> and Imagenet has 1000 classes so, definitely alot harder problem.

09:06 < kartikdutt18[m]> Yes.

09:10 sakshamb189[m] has joined #mlpack

09:11 < KimSangYeon-DGU[> I'm training the Darknet-19 model for CIFAR-10 in the Darknet framework. After the first epoch, let me check the validation error.

09:11 < KimSangYeon-DGU[> * I'm training the Darknet-19 model for CIFAR-10 in the Darknet framework. After the first epoch, let me check the validation accuracy.

09:11 < KimSangYeon-DGU[> To make the benchmark

09:13 < KimSangYeon-DGU[> Currently, the loss is not decreasing

09:13 < KimSangYeon-DGU[> * Currently, the loss is not decreasing easily

09:19 < kartikdutt18[m]> I think the graph would look similar to this. [Link](https://github.com/mlpack/models/pull/20#issuecomment-646528999)

09:19 < KimSangYeon-DGU[> Yes so far

09:23 < KimSangYeon-DGU[> We can decrease the channel size as the next experiment.

09:23 < KimSangYeon-DGU[> Not now :)

09:25 < KimSangYeon-DGU[> I guess CIFAR-10 image size 32 x 32 is so small for the current Darknet 19 architecture.

09:38 < KimSangYeon-DGU[> Hm, imagenet is so big... so I'm thinking to see if it's possible to migrate the pretrained Darknet-19 model weights to mlpack's

09:45 < KimSangYeon-DGU[> not training from the scratch

09:45 < kartikdutt18[m]> If we could do that that would be great. I'll take a look at that too.

09:45 < KimSangYeon-DGU[> Great

09:45 < KimSangYeon-DGU[> That would be nice workaround

09:46 < KimSangYeon-DGU[> You can get the weight from

09:46 < KimSangYeon-DGU[> - wget https://pjreddie.com/media/files/darknet19.weights

09:46 < kartikdutt18[m]> We also have https://github.com/sreenikSS/mlpack-Tensorflow-Translator/tree/e70de2848d75083fd69a4c82f281453ff8d62702\

09:47 < kartikdutt18[m]> * We also have https://github.com/sreenikSS/mlpack-Tensorflow-Translator/tree/e70de2848d75083fd69a4c82f281453ff8d62702

09:51 < kartikdutt18[m]> About speeding up loading, I had an idea that dataloader could provide same functions as arma and when that function was called we would load and return the matrix however that would end up require a lot of templating. Eg [here](https://github.com/mlpack/mlpack/blob/30134f1ff98500cb63f50489e803392482d259d2/src/mlpack/methods/ann/ffn.hpp#L136)

09:56 < KimSangYeon-DGU[> Ok, then, can you check if it's possible to migrate the pretrained the darknet 19 model to mlapck first?

10:16 < kartikdutt18[m]> Sure

11:27 < kartikdutt18[m]> I think loading weights might be a possibility, If we get both frameworks to interact. [Loading weights in darknet](https://github.com/pjreddie/darknet/blob/f6d861736038da22c9eb0739dca84003c5a5e275/src/parser.c#L1202-L1290). I'm not sure but I think if we get the weight matrix from there and set parameters accordingly.

11:28 < sakshamb189[m]> yeah that might be a good idea to avoid the training.

11:29 < kartikdutt18[m]> I'll try this then, I'm letting it train. If it works then we can avoid training for YOLO models as well.

11:29 < kartikdutt18[m]> * I'll try this then, I'm letting the model train. If it works then we can avoid training for YOLO models as well.

11:29 < kartikdutt18[m]> * I'll try this then, I'm also letting the model train. If it works then we can avoid training for YOLO models as well.

11:30 < KimSangYeon-DGU[> The Darknet -19 model that I've run about 2.5 hours ago got 1.8 loss

11:31 < kartikdutt18[m]> Is one epoch complete?

11:31 < KimSangYeon-DGU[> Yes

11:31 < KimSangYeon-DGU[> It's on 1.5 epochs now

11:32 < kartikdutt18[m]> What was the validation accuracy?

11:32 < KimSangYeon-DGU[> I'll check that after 2 epochs done.

11:33 < KimSangYeon-DGU[> Maybe, 1hour after that will be done.

11:33 < kartikdutt18[m]> Great, Thanks.

11:34 < KimSangYeon-DGU[> :)

13:28 < saksham189Gitter> Hi @himanshupathak21061998 are you there?

13:29 < HimanshuPathakGi> Yup

13:29 < HimanshuPathakGi> How are you doing?

13:29 < saksham189Gitter> good and you?

13:30 < HimanshuPathakGi> Fine I think I will complete the implementation part of kernel svm by tonight

13:31 < saksham189Gitter> alright great and when do you think you would be able to open a PR?

13:32 < HimanshuPathakGi> @saksham189 Already opened I will tag you

13:32 < HimanshuPathakGi> For a review after completing

13:32 < HimanshuPathakGi> But right now it is not in good shape

13:33 < HimanshuPathakGi> After that I have to implement test

13:34 < saksham189Gitter> Alright I see https://github.com/mlpack/mlpack/pull/2456

13:35 < HimanshuPathakGi> You can refer to thishttps://github.com/LasseRegin/SVM-w-SMO

13:35 < HimanshuPathakGi> For implementation

13:36 < HimanshuPathakGi> But I have to make some changes to support Gaussian in this implementation

13:37 < saksham189Gitter> I think the Gaussian kernel is already implemented in `src/mlpack/core/kernels/` and you would be adding the radial basis function kernel?

13:38 < HimanshuPathakGi> Yup that's why I am adding template parameter KernelType

13:38 < HimanshuPathakGi> I will use that implementation

13:39 < saksham189Gitter> and you would be adding a radial basis function kernel right?

13:39 < HimanshuPathakGi> Radial basis kernel and Gaussian are same thing

13:41 < saksham189Gitter> oh ok.. I see

13:42 < saksham189Gitter> Have you written the blog for this week?

13:42 < HimanshuPathakGi> I want to write it after completing the implementation

13:42 < HimanshuPathakGi> So that I have something valuable to add

13:43 < saksham189Gitter> okay sure :) just make sure you share it here when you are done

13:43 < HimanshuPathakGi> Yup I will try to work tonight on it:)

13:44 < HimanshuPathakGi> I have just one concern

13:45 < saksham189Gitter> yup sure, we can discuss right now

13:45 < HimanshuPathakGi> After completing I have to compare it with libsvm

13:46 < HimanshuPathakGi> But in that they were using optimised version of smo

13:46 < HimanshuPathakGi> So, I think it will be working better than our svm

13:47 < HimanshuPathakGi> I think we should discuss this after implementation that will be better approach:)

13:48 < saksham189Gitter> I think we can first implement the basic functionality of the kernel svm and then do the comparison and discuss which optimisations if any we might want to implement. Let me know what you think.

13:50 < HimanshuPathakGi> Yup that make sense we will discuss it after implementing the basic functionality :)

13:51 < saksham189Gitter> also are you following the implementation here

13:51 < saksham189Gitter> https://github.com/LasseRegin/SVM-w-SMO ?

13:52 < HimanshuPathakGi> Yup but I have to perform some changes in this to support Gaussian also

13:52 < HimanshuPathakGi> So the implementation will not be exact same

13:52 < saksham189Gitter> yup sure I see.

13:53 < saksham189Gitter> Let me know when the PR is ready for a review.

13:53 < saksham189Gitter> Is there anything else we should discuss?

13:53 < HimanshuPathakGi> Sure I will tag you for a review :)

13:54 < HimanshuPathakGi> Anything else you like to dicuss

13:56 < saksham189Gitter> If there are any blockers you are facing then let me know.

13:57 < HimanshuPathakGi> Yup right now I am not sure I will ask for help if I get stuck

13:58 < saksham189Gitter> alright sure. Then we will meet next time. Have a great week ahead.

14:01 < HimanshuPathakGi> Yeah until next time Have a nice week.

14:01 < HimanshuPathakGi> :)

15:04 < sakshamb189[m]> kartikdutt18: do we have a meeting right now?

15:04 < kartikdutt18[m]> Hi sakshamb189 , we do.

15:05 < sakshamb189[m]> I guess you are still training the DarkNet model, right?

15:05 < kartikdutt18[m]> Right, Let me post another update on the PR.

15:06 < sakshamb189[m]> Ok sure. Is there any improvement in the validation accuracy?

15:06 < sakshamb189[m]> I think the model might be overfitting on the CIFAR dataset since we have also reduced the training size.

15:07 < kartikdutt18[m]> Both are increasing but the increase is very slow.

15:07 < kartikdutt18[m]> *Both training and validation, however they are not good enough for classification.

15:09 < sakshamb189[m]> so what's the final validation accuracy you are getting after 3 epochs?

15:09 < kartikdutt18[m]> The third epoch isn't complete yet. It's at 94%.

15:09 < kartikdutt18[m]> For the second it was a bit over 11%

15:10 < sakshamb189[m]> and the training set size was 12.5k right?

15:10 < kartikdutt18[m]> Yes.

15:11 < sakshamb189[m]> did you check the class distribution in the train set?

15:11 < sakshamb189[m]> Is it uniform?

15:11 < kartikdutt18[m]> Each class has the number of images. (1250)

15:11 < kartikdutt18[m]> * Each class has the same number of images. (1250)

15:12 < sakshamb189[m]> and I am guessing you have a uniform distribution with the validation set right?

15:13 < kartikdutt18[m]> Hmm, I don't think so. I used the mlpack's data split and I don't think that gives uniform distribution.

15:13 < kartikdutt18[m]> It randomly picks indices from the dataset.

15:14 < sakshamb189[m]> when we used the entire train set for training we got 76% accuracy on the validation set right?

15:14 < kartikdutt18[m]> On training set.

15:14 < kartikdutt18[m]> We didn't have the validation part there then. I added it in the next trial

15:15 < sakshamb189[m]> alright and right now our train accuracy is around 11% as well .

15:15 < kartikdutt18[m]> Yes.

15:16 < sakshamb189[m]> I am a bit confused since even if the model is overfitting the train accuracy should have been higher (as compared to our previous trial with the full train set) but right now it is very low.

15:16 < kartikdutt18[m]> I can try a higher learning rate, I tried 0.1, 0.01 and 0.0001 for about 300 iterations each and only 0.001 led to decrease in loss

15:17 < kartikdutt18[m]> Hmm, I don't think this version is overfitting, I think the one with the full dataset did.

15:18 < sakshamb189[m]> Did you make any changes to the implementation after that?

15:19 < kartikdutt18[m]> I tried a few things, I tried different initialization, right now I am using random since the model didn't change it's loss with a different one. Since I was trying high learning rates I also added a batch norm layer before the last linear layer.

15:20 < kartikdutt18[m]> * I tried a few things, I tried different initialization, right now I am using random since the model didn't show decrease in loss with a different one. Since I was trying high learning rates I also added a batch norm layer before the last linear layer.

15:21 < sakshamb189[m]> why do you think the previous one was overfitting while this one is not?

15:23 < kartikdutt18[m]> The darknet 19 model converged very quickly, If that was the case it should have converged again with nearly the same params. I don't training accuracy of nearly 80% can be achieved on a single pass on the dataset.

15:23 < kartikdutt18[m]> * The darknet 19 model converged very quickly, If that was the case it should have converged again with nearly the same params. I don't think training accuracy of nearly 80% can be achieved on a single pass on the dataset.

15:27 < sakshamb189[m]> but we used almost the same architecture again with a smaller training set right ?

15:27 < kartikdutt18[m]> Right.

15:28 < kartikdutt18[m]> I even tried on the full set before training on subset I wasn;t able to reproduce the results of a few hundred iterations hence I said it was overfitting.

15:31 < sakshamb189[m]> yeah but the only changes that were made in between were adding a batch-norm layer and changing some hyper-parameters right?

15:31 < kartikdutt18[m]> Yes. I also don't understand how the accuracy was achieved.

15:32 < sakshamb189[m]> IMO we can remove the batch norm layer and try training on the small test set and see if the helps to improve the validation accuracy.

15:33 < sakshamb189[m]> Let me know what you think.

15:34 < kartikdutt18[m]> Sure, but then we can't use the current weights. I can stop the training and make the change. Also I'll try experimenting with the learning rate. I know mlpack has hyper tuner class, can we use that?

15:35 < kartikdutt18[m]> and ensmallen has grid search

15:35 < sakshamb189[m]> Alright just save the current weights.

15:35 < kartikdutt18[m]> Ohk, the weights will save after three epochs in 15-20 minutes.

15:36 < kartikdutt18[m]> *5-10 mins

15:36 < sakshamb189[m]> yes so we can wait for that to finish and then restart the training :)

15:38 < kartikdutt18[m]> Sure. Does mlpack has something similar to [this](https://towardsdatascience.com/estimating-optimal-learning-rate-for-a-deep-neural-network-ce32f2556ce0) to find the right learning rate. If we can do that for a different initialization like Xavier we would have to a lot less number of epochs since initial loss is around 2.x

15:38 < kartikdutt18[m]> * Sure. Does mlpack has something similar to [this](https://towardsdatascience.com/estimating-optimal-learning-rate-for-a-deep-neural-network-ce32f2556ce0) to find the right learning rate. If we can do that for a different initialization like Xavier we would have to do a lot less number of epochs since initial loss is around 2.x

15:39 < sakshamb189[m]> What learning rate were you using previously with the full training set?

15:40 < kartikdutt18[m]> 1e-3 same as now.

15:41 < kartikdutt18[m]> It might better to keep a log book of experiments

15:44 < sakshamb189[m]> I think we can start with that and then try other's in factors of 3 or 10

15:44 < kartikdutt18[m]> Sure, I'll maintain the logs as well.

15:45 < kartikdutt18[m]> Random Initialization or Should I spend some more timing finding one for Xavier?

15:50 < sakshamb189[m]> Currently only random initialization is implemented in mlpack right?

15:50 < kartikdutt18[m]> We have random, Xavier, He and maybe more.

15:51 < kartikdutt18[m]> Here's the complete [list](https://github.com/mlpack/mlpack/tree/master/src/mlpack/methods/ann/init_rules)

15:54 < sakshamb189[m]> then I guess we could go ahead with Xavier? what do you think?

15:55 < kartikdutt18[m]> Sure, If we can find a good learning rate for that we won't have to do a lot of epochs.

15:56 < kartikdutt18[m]> Also about the transferring weights from darknet framework. I'll try to develop a proof of concept for it. If it works we can avoid training completely.

15:56 < KimSangYeon-DGU[> So sorry...

15:57 < sakshamb189[m]> Alright sure. Is there anything else we should discuss?

15:57 < sakshamb189[m]> KimSangYeon-DGU: it's fine. no worries

15:58 < kartikdutt18[m]> Not really. KimSangYeon-DGU , did you get a validation accuracy?

15:59 < KimSangYeon-DGU[> It'll be done after 2 mins

15:59 < kartikdutt18[m]> Ahh nice.

15:59 < KimSangYeon-DGU[> I had a difficulty in getting accustomed to the time difference...

15:59 < kartikdutt18[m]> No worries.

16:00 < KimSangYeon-DGU[> 8000 images left

16:00 < KimSangYeon-DGU[> So far, the accuracy is 10%

16:00 < kartikdutt18[m]> On training or on validation?

16:00 < KimSangYeon-DGU[> I did 4 epochs

16:01 < KimSangYeon-DGU[> on the test dataset in cifar/test directory

16:01 < KimSangYeon-DGU[> for the 10,000 images

16:02 < kartikdutt18[m]> Hmm, I think the mlpack implementation gives the same result in nearly same number of epochs.

16:04 < KimSangYeon-DGU[> Yes, I think the image size of CIFAR-10 is so small for the Darknet-19 architecture

16:04 < kartikdutt18[m]> Would resizing help. I think that would degrade the image quality

16:06 < KimSangYeon-DGU[> Yes, it degrades the quality and at the end, it's doesn't help...

16:06 < KimSangYeon-DGU[> The validation accuracy is 10%

16:08 < KimSangYeon-DGU[> Can we find more higher resolution image classification dataset?

16:08 < KimSangYeon-DGU[> * Can we find higher resolution image classification dataset?

16:08 < KimSangYeon-DGU[> Imagenet is too much... haha 122GB, as far as I know

16:09 < kartikdutt18[m]> I am going through [this dataset](https://ai.stanford.edu/~acoates/stl10/) right now.

16:10 < KimSangYeon-DGU[> Oh, 96x96

16:10 < KimSangYeon-DGU[> If possible, 256 x 256 is better

16:11 < kartikdutt18[m]> I think this has 500 images per class right? (That line isn't clear enough).

16:13 < KimSangYeon-DGU[> Right... it needs to be checked

16:14 < kartikdutt18[m]> Also uniform of this [dataset](http://www.vision.caltech.edu/Image_Datasets/Caltech101/) would has higher resolution images

16:16 < KimSangYeon-DGU[> Oh, nice

16:19 < KimSangYeon-DGU[> https://tiny-imagenet.herokuapp.com/

16:19 < KimSangYeon-DGU[> How about tiny imagenet?

16:21 < kartikdutt18[m]> It has 64x64 images and 200 classes, I guess it would be more difficult than STL, right?

16:22 < KimSangYeon-DGU[> Ahh, really? I'm downloading it, so I couldn't check the resolution...

16:22 < zoq> kartikdutt18[m]: Are you guys testing on cifar10?

16:22 < kartikdutt18[m]> Yes.

16:23 < kartikdutt18[m]> KimSangYeon-DGU, Here is the [reference](https://www.kaggle.com/c/tiny-imagenet)

16:23 < zoq> kartikdutt18[m]: to get the dataset I can just uncomment the Utils::DownloadFile section in the main right?

16:24 < kartikdutt18[m]> Yes, that should work.

16:24 < KimSangYeon-DGU[> Yeah, but the dataset has small image resolution, so I think we need to check another dataset

16:24 < zoq> kartikdutt18[m]: and start within the build folder? -> bin/object_classification ?

16:25 < kartikdutt18[m]> yes

16:25 < zoq> kartikdutt18[m]: Hm, I get - "Cannot open: No such file or directory"

16:25 < kartikdutt18[m]> Did you run sudo make ?

16:26 < zoq> kartikdutt18[m]: no

16:26 < kartikdutt18[m]> It has the same steps as mlpack, first cmake ../ && sudo make && ./bin/filename

16:27 < kartikdutt18[m]> @zoq, KimSangYeon-DGU , trained darknet 19 from the darknet framework and it got about same accuracy as the version in mlpack

16:27 < zoq> kartikdutt18[m]: Yes I did cmake and make but without sudo

16:27 < zoq> I created the data folder in the source dir, and now it downloads the dataset

16:28 < kartikdutt18[m]> I think that should work too.

16:28 < zoq> kartikdutt18[m]: But now it searches for cifar10-small

16:28 < zoq> kartikdutt18[m]: But the folder I downloaded is called cifar10.

16:29 < kartikdutt18[m]> Ahh, sorry it's the dataset that I created (a subset of cifar10 with 10k images)

16:29 < zoq> Okay, I guess I can update the code to use cifar10 than.

16:29 < kartikdutt18[m]> Could you change cifar10-small to cifar10/train then

16:29 < zoq> Is there anything else I should update before I run training?

16:30 < kartikdutt18[m]> I am pretty sure that should work.

16:30 < kartikdutt18[m]> Let me check again

16:31 < zoq> kartikdutt18[m]: Yeah, looks good.

16:31 < zoq> kartikdutt18[m]: Was just curious if the code I use the the same you use as well, to produce the graphs.

16:32 < kartikdutt18[m]> I just dump the output in a text file and read them to get the loss. What about you?

16:32 < zoq> kartikdutt18[m]: I guess I'm at the augmentation step.

16:32 < zoq> kartikdutt18[m]: "Loading Dataset!" is the last output I see.

16:33 < kartikdutt18[m]> It should take around 10 minutes to load the dataset. Augmentation is almost instant I guess.

16:33 < zoq> kartikdutt18[m]: Now "Optimizer Created, Starting Training!"

16:33 < kartikdutt18[m]> Woah, that was very fast.

16:33 < KimSangYeon-DGU[> kartikdutt18: https://github.com/fastai/imagenette and it has about 900 images per class

16:33 < kartikdutt18[m]> Did it load all images in 3 minutes?

16:34 < zoq> kartikdutt18[m]: I think so, I can time it if you like.

16:35 < kartikdutt18[m]> I think I just did. It takes 10 minutes on mine.

16:35 < kartikdutt18[m]> KimSangYeon-DGU, Nice it has benchmarks for various epochs

16:36 < zoq> kartikdutt18[m]: And I guess you report the loss after the 998 iterations?

16:36 < KimSangYeon-DGU[> Yeah, let me check with it while @zoq looks into the mlpack Darknet implementation.

16:36 < kartikdutt18[m]> I gave the output after each epoch

16:37 < kartikdutt18[m]> It's not at like 100 iteration or so right?

16:37 < KimSangYeon-DGU[> zoq: For your information, I've checked the Darknet-19 validation accuracy for CIFAR-10 in the Darknet framework, not mlpack.

16:38 < zoq> KimSangYeon-DGU[: Ahh

16:38 < KimSangYeon-DGU[> It was 10% accuracy

16:38 < KimSangYeon-DGU[> So, we thought the CIFAR-10 doesn't fit the darknet-19 architecture

16:39 < KimSangYeon-DGU[> I also checked with smaller model in the Darknet framework, the val accuracy is high about 73%

16:40 < zoq> KimSangYeon-DGU[: I see, haven't looked into the code too much, but did you scale the input to be between [0,1]?

16:41 < KimSangYeon-DGU[> No

16:42 < zoq> I guess that would be useful.

16:43 < kartikdutt18[m]> Doesn't the same script on the website handle scaling. I guess only the model was changed in that script.

16:47 < zoq> kartikdutt18[m]: What script?

16:47 < kartikdutt18[m]> I think the tutorial [here](https://pjreddie.com/darknet/train-cifar/)

16:48 < zoq> Currently looking at: https://github.com/thtrieu/darkflow/issues/38 not sure that changed.

16:49 < zoq> For mlpack the input should be in the range between [0,1], not sure there is some internal normization step in the darknet lib?

16:50 < zoq> I guess the same is true for darknet: https://github.com/thtrieu/darkflow/issues/38#issuecomment-275085351

16:51 < kartikdutt18[m]> Hmm, I am also not if darknet handles it internally. In the object detection script, I used minmaxScaler on images

16:53 < KimSangYeon-DGU[> I'm not sure as well. It needs to be checked

16:56 < KimSangYeon-DGU[> zoq: As an alternative way, we thought the possibility of migration from Darknet's pretrained weights to mlpack. Does that make sense to you?

16:57 < zoq> KimSangYeon-DGU[: Yes sounds like a good idea to me, I guess we "just" have to make sure the format is the same.

16:58 < zoq> Personally I would work on the pre-trained model option first, I guess that includes images scaling as well.

16:59 < KimSangYeon-DGU[> Ok, what do you think kartikdutt18?

17:00 < kartikdutt18[m]> I will try developing POC for pretrained weights.

17:02 < KimSangYeon-DGU[> Ok, if we succeeded to do that, we can do in other models as we said :)

17:03 < KimSangYeon-DGU[> * Ok, if we succeeded to do that, we can also apply it to other models as we said :)

17:04 < walragatver[m]> jeffin143: Hi.

17:04 < jeffin143[m]> walragatver: hi

17:05 < jeffin143[m]> @kimsangyeon-dgu:matrix.org: @kartikdutt18:matrix.org are you done with your meet :)

17:05 < jeffin143[m]> If not we can wait

17:06 < KimSangYeon-DGU[> Yes, we're done :) thanks for asking

17:06 < KimSangYeon-DGU[> kartikdutt18: thanks for the meeting and please ping me if you have anything you want to discuss!

17:06 < KimSangYeon-DGU[> * jeffin143: Yes, we're done :) thanks for asking

17:07 < jeffin143[m]> KimSangYeon-DGU (@kimsangyeon-dgu:matrix.org): :)

17:07 < jeffin143[m]> walragatver: anything on your list

17:07 < jeffin143[m]> I don't have anything

17:07 < jeffin143[m]> I have updated the testing pr with some test , have a look

17:08 < kartikdutt18[m]> Sure. walragatver Please continue, thanks

17:08 < jeffin143[m]> Also if possible need your review on image logging pr too

17:08 < jeffin143[m]> Also I have updated the blog for this week :)

17:08 < jeffin143[m]> That's everything from my side

17:32 < walragatver[m]> jeffin143: Sorry I got deviated

17:33 < walragatver[m]> I will give you review soon.

17:34 < jeffin143[m]> walragatver: no issues :) all good

17:34 < jeffin143[m]> I have opened two pr for the reason so that work doesn't stop

17:41 < walragatver[m]> jeffin143: What's the plan ahead? What would you be implementing next?

17:43 < jeffin143[m]> walragatver: if we are done with image and testing

17:43 < jeffin143[m]> I will go for text and audio next

17:44 < jeffin143[m]> I am planning to complete image and testing pr before first phase ends

17:45 < walragatver[m]> jeffin143: And I saw your blog. I think from this year participants are allowed to write blogs anywhere

17:46 < walragatver[m]> <jeffin143[m] "I will go for text and audio nex"> Okay

17:47 < jeffin143[m]> > jeffin143: And I saw your blog. I think from this year participants are allowed to write blogs anywhere

17:47 < jeffin143[m]> Yes, but I guess that was true every year, we just went with irc last year :)

17:47 < jeffin143[m]> Sry the blog repo*

17:48 < RyanBirminghamGi> jeffin143: I'll also give another pass at reviews soon!

17:48 < jeffin143[m]> <jeffin143[m] "Sry the blog repo*"> Ryan Birmingham (Gitter): sure :)

17:49 < walragatver[m]> jefffin143: Okay, you are sure that this time we are allowed to use the blog repo?

17:49 < jeffin143[m]> I guess so

17:50 < walragatver[m]> Because no else is using it. And I think there was no mention of the blog repo in the introductory mails this time.

17:50 < walragatver[m]> It might happen that we stop maintaining it. So just confirm it.

17:50 < jeffin143[m]> May be I can send it through mailing list too

17:51 < jeffin143[m]> walragatver: may be I can send it through mailing list too

17:53 < walragatver[m]> jeffin143: It's fine I would say to continue with blog. Just change the location if something happens. Don't use mailing list because it's a communication medium.

17:53 < jeffin143[m]> I will confirm it with Ryan and let you know

17:55 < walragatver[m]> jeffiin143: Are you going to also add the graph support in the library?

17:56 < jeffin143[m]> walragatver: we don't follow the graph convention in mlpack , right ?

17:56 < walragatver[m]> Graph in the sense histogram etc.

17:56 < jeffin143[m]> Yes histogram and pr curve

17:56 < jeffin143[m]> Text audio

17:56 < jeffin143[m]> Embedding

17:56 < jeffin143[m]> 5 more things are there in my list

17:57 < walragatver[m]> Okay

17:59 < walragatver[m]> The path ahead would be quite smooth now as the CI and testing implementation is over.

17:59 < jeffin143[m]> walragatver: yes , I am sure we can speed a little bit :)

18:00 < jeffin143[m]> I can write more tutorials then may be if I am left with some time

18:02 < walragatver[m]> jeffin143: Have you given any thoughts on callbacks? Any implementatiion idea or something?

18:02 < jeffin143[m]> walragatver: no , I will try to sketch it this week

18:02 < jeffin143[m]> May be then we can schedule a meet with zoq to get input as well ??

18:04 < walragatver[m]> jeffin143: And what about your joining. When would you be joining the firm?

18:04 < jeffin143[m]> If a user Wants to log something else he has to create it as a custom function and pass it the callback

18:05 < walragatver[m]> jeffin143: I am not sure about the tf callbacks i will check it out.

18:05 < walragatver[m]> jeffin143: If it's all about accuracy and loss then it would be quite easy to implement.

18:06 < jeffin143[m]> > jeffin143: And what about your joining. When would you be joining the firm?

18:06 < jeffin143[m]> The contacted me and told me that hiring are paused they wouldn't be offering me a full time job

18:06 < jeffin143[m]> They said they will try a contract offer not to leave me in mid

18:06 < jeffin143[m]> So I am unsure

18:09 < jeffin143[m]> > jeffin143: Really? That's slightly troublesome.

18:09 < jeffin143[m]> Umm , tough situations , can't complain

18:10 < jeffin143[m]> @walragatver:matrix.org: did you join any org ?

18:10 < walragatver[m]> <jeffin143[m] "@walragatver:matrix.org: did you"> jeffin143: Nope I might join soon

18:11 < jeffin143[m]> walragatver: Probably it would be a work from home onbaording

18:11 < jeffin143[m]> :) good luck with it

18:11 < walragatver[m]> <jeffin143[m] "walragatver: Probably it would b"> jeffin143: Yeah most probably

18:12 < walragatver[m]> <jeffin143[m] ":) good luck with it"> Thanks. Lets see

18:12 < jeffin143[m]> walragatver: Thanks for the time ,

18:12 < jeffin143[m]> I will take a leave :) good night

18:13 < walragatver[m]> Yeah take care.

18:13 < RyanBirminghamGi> jeffin143: walragatver: Sorry I didn't say much of anything today; have a good night!

18:13 < walragatver[m]> I hope birm doesn't have anything to discuss

18:14 < walragatver[m]> <RyanBirminghamGi "jeffin143: walragatver: Sorry I "> No problem birm take care

18:14 < jeffin143[m]> Ryan Birmingham (Gitter): no issues :)

18:14 < jeffin143[m]> Take care , have a good week ahead

18:51 < zoq> kartikdutt18[m]: Btw. running with BatchSize=1 and looks like the loss is going down with each step currently at 0.00925315.

18:51 < kartikdutt18[m]> What was the initial loss?

18:52 < kartikdutt18[m]> Is an epoch almost done ?

18:53 < kartikdutt18[m]> * Also, is an epoch almost done ?

18:59 < zoq> kartikdutt18[m]: Can't remember I'm at: 1000/7977

19:00 < kartikdutt18[m]> Ahh, That's nice.

19:00 < zoq> kartikdutt18[m]: Let me print each step.

19:01 < kartikdutt18[m]> Sure, Thanks. I am stopping my model now as discussed.

19:01 < kartikdutt18[m]> My loss at start of the 4 th epoch is nearly 4.5

19:15 ImQ009 has quit [Quit: Leaving]

19:21 < KimSangYeon-DGU[> zoq: Thanks!

19:55 < HimanshuPathakGi> Hey @zoq @rcurtin How can we find dot product of two arma::mat??

20:10 < zoq> HimanshuPathakGi: A * A.t() maybe I miss something?

20:14 < HimanshuPathakGi> > `zoq on Freenode` Himanshu Pathak (Gitter): A * A.t() maybe I miss something?

20:14 < HimanshuPathakGi> Yup got it :)

20:45 < nishantkr18[m]> Hey everyone! Here's a 2 min read on my last two week's progress. https://nishantkr18.github.io/GSoC-2020-mlpack/week-04-and-05/week-04-and-05.html Kindly have a look :)