#mlpack on 2022-09-18 — irc logs at libera.irclog.whitequark.org

2021-07-27 15:44 rcurtin_irc changed the topic of #mlpack to: mlpack: a scalable machine learning library (https://www.mlpack.org/) -- channel logs: https://libera.irclog.whitequark.org/mlpack -- NOTE: messages sent here might not be seen by bridged users on matrix, gitter, or slack

10:03 <EshaanAgarwal[m]> <jonpsy[m]> "Shouldn't be that hard, you need..." <- No it isn't. Almost done. Had semester exams which got over today.

12:50 <rcurtin[m]> @jonpsy can you log in to Jenkins via Github at ci.mlpack.org? if so you can access all the job configurations, etc.

12:51 <rcurtin[m]> and the configurations that shrit mentioned are in https://github.com/mlpack/jenkins-conf

12:51 <rcurtin[m]> s/@//

15:27 <EshaanAgarwal[m]> <EshaanAgarwal[m]> "No it isn't. Almost done. Had..." <- I have done this. Should I share the outputfile for mlpack implementation here ?

15:44 <jonpsy[m]> <rcurtin[m]> "@jonpsy can you log in to..." <- Hey thanks! I wanted to learn about deployments, so I thought this would be a good starter.

15:45 <jonpsy[m]> <EshaanAgarwal[m]> "I have done this. Should I share..." <- Sure

15:58 * EshaanAgarwal[m] posted a file: (4KiB) < https://libera.ems.host/_matrix/media/r0/download/matrix.org/wUekqkPECOJtLzJpsWhZCpxM/outputfile.txt >

16:03 <jonpsy[m]> Good, now show me python's in same format.

16:06 <EshaanAgarwal[m]> jonpsy[m]: sure ! before this i had a question.

16:06 <jonpsy[m]> shoot

16:07 <EshaanAgarwal[m]> `const size_t envSampleSize = environment.InitialSample().Encode().n_elem;... (full message at <https://libera.ems.host/_matrix/media/r0/download/libera.chat/e53eb3df91be3c9ebef741f4648dddad91deaa97>)

16:08 <EshaanAgarwal[m]> > <@eshaanagarwal:matrix.org> `const size_t envSampleSize = environment.InitialSample().Encode().n_elem;... (full message at <https://libera.ems.host/_matrix/media/r0/download/libera.chat/68ba8baf811c8afd533c295e6352ad9aba2ab2e6>)

16:08 <jonpsy[m]> is this done per run?

16:08 <jonpsy[m]> or during ctor only?

16:08 <EshaanAgarwal[m]> and in those implementation this piece of code came with this documentation - // Reset all the networks.

16:08 <EshaanAgarwal[m]> // Note: the actor and critic networks have an if condition before reset.

16:08 <EshaanAgarwal[m]> // passed using this constructor.

16:08 <EshaanAgarwal[m]> // This is because we don't want to reset a loaded(possibly pretrained) model

16:09 <EshaanAgarwal[m]> jonpsy[m]: during initaliztion of constructor of PPO

16:09 <jonpsy[m]> well, then it shouldn't matter; but ofc you'd remove this line for now

16:09 <jonpsy[m]> since you're loading network params manually

16:10 <EshaanAgarwal[m]> jonpsy[m]: yeah agreed but how is it working ?

16:11 <EshaanAgarwal[m]> EshaanAgarwal[m]: i think that it is not doing what it is supposed to do. since here i am loading model but it was still reseting the network

16:11 <jonpsy[m]> didn't you remove it?

16:11 <EshaanAgarwal[m]> jonpsy[m]: then i did when i checked the print statments

16:12 <EshaanAgarwal[m]> but then i had questions that arent the weights initialized at the we set the network up ?

16:13 <jonpsy[m]> yes; but read the comment. He's assuming there's can be a pre-trained model here

16:13 <EshaanAgarwal[m]> also how does checking the number of elements in network with env sample size helps in detecting that ?

16:13 <jonpsy[m]> > <@eshaanagarwal:matrix.org> `const size_t envSampleSize = environment.InitialSample().Encode().n_elem;... (full message at <https://libera.ems.host/_matrix/media/r0/download/libera.chat/235d00f5518ae35a32178f4388363ced1b6fcd46>)

16:14 <EshaanAgarwal[m]> jonpsy[m]: i saw it from SAC implementation in mlpack

16:14 <jonpsy[m]> link?

16:15 <EshaanAgarwal[m]> jonpsy[m]: https://github.com/mlpack/mlpack/blob/master/src/mlpack/methods/reinforcement_learning/sac_impl.hpp#L68

16:20 <jonpsy[m]> Looks shady

16:21 <jonpsy[m]> We have a meet anyway rn; Eshaan Agarwal can you share the link pls?

16:22 <zoq[m]> This just to make sure Reset() works, if not the number of parameters is 0.

16:22 <EshaanAgarwal[m]> jonpsy[m]: https://meet.google.com/pnp-rtjw-unz

16:22 <zoq[m]> I can’t join the meeting right now, so please start without me.

16:23 <zoq[m]> I thought we are able to load the parameters already?

16:23 <zoq[m]> Didn’t you say you did that already, kinda confused.

16:24 <EshaanAgarwal[m]> zoq[m]: We are able to. Just when I was checking the weights and how they changed with epsiodes, I noticed that this piece of code reset the network during initialisation itself. Then I uncommented it and wondered what it's doing

16:25 <EshaanAgarwal[m]> EshaanAgarwal[m]: Commented*

16:25 <zoq[m]> Did you disable exploration?

16:26 <EshaanAgarwal[m]> zoq[m]: What do you mean by that ?

16:27 <zoq[m]> If we have exploration enabled we reset the weights

16:27 <zoq[m]> But looks like it’s not enabled

16:27 <zoq[m]> If you remove it, does it give the correct results?

16:27 <EshaanAgarwal[m]> zoq[m]: I am not sure if I have done anything related to that in implementation 👀

16:28 <EshaanAgarwal[m]> zoq[m]: Yes for manually provided weights everything got smooth. Then out of curiosity I tried to do it with without manual set weights

16:28 <jonpsy[m]> Regardless, what of pytorch forward result?

16:29 <EshaanAgarwal[m]> jonpsy[m]: I don't know how to save that in a file. I looked up on the internet

16:29 <jonpsy[m]> Can store an SS; anyway what did you find?

16:30 <EshaanAgarwal[m]> jonpsy[m]: Loss values were a bit different ! I think actor loss is also coming different. There wasn't much difference in critic losses. Even the updated weights looked almost same

16:30 <zoq[m]> So that sounds like forward pass is okay.

16:31 <zoq[m]> Backward pass as well?

16:31 <zoq[m]> You checked the errors?

16:31 <zoq[m]> Can you push the debugging for the forward and backward pass?

16:31 <EshaanAgarwal[m]> What errors ?

16:32 <zoq[m]> Of the backward pass.

16:32 <jonpsy[m]> he meant backward gradients

16:32 <zoq[m]> Right

16:32 <EshaanAgarwal[m]> zoq[m]: Push where ?

16:32 <jonpsy[m]> Diary?

16:32 <zoq[m]> To the PR

16:32 <zoq[m]> I like to run it myself

16:32 <jonpsy[m]> or PR or chat anywhere we can see

16:33 <zoq[m]> But did you check the backward check?

16:33 <EshaanAgarwal[m]> zoq[m]: Ok sure ! I have made some changes in cart pole evironment too.

16:34 <EshaanAgarwal[m]> zoq[m]: Almost going there when I noticed that weights issue and then tried to resolve that

16:35 <EshaanAgarwal[m]> jonpsy: if you are free would you like to join the meet ?

16:35 <zoq[m]> Might not make it.

16:35 <jonpsy[m]> nw, Eshaan Agarwal will write the meeting summary

17:24 * EshaanAgarwal[m] uploaded an image: (19KiB) < https://libera.ems.host/_matrix/media/r0/download/matrix.org/KBPpwrKgowZvPgkWnAbmKePI/Screenshot%20from%202022-09-18%2022-54-34.png >

17:24 * EshaanAgarwal[m] uploaded an image: (39KiB) < https://libera.ems.host/_matrix/media/r0/download/matrix.org/jAtezZnyHEQZojEHiVstUdFs/Screenshot%20from%202022-09-18%2022-54-24.png >

17:25 <EshaanAgarwal[m]> EshaanAgarwal[m]: jonpsy: please look now ! actully in pytorch i was saving only one of them in variable . i changed that now

17:25 <EshaanAgarwal[m]> they look similiar

17:36 <EshaanAgarwal[m]> zoq @marcusedel:matrix.org: jonpsy: can you please reopen the pull request. Apparently it got closed due to inactivity.

20:38 <rcurtin[m]> jonpsy: sounds good, just let me know if I can help explain the Jenkins setup or anything. it has years and years of cruft and insider knowledge from things we've set up 😃