#mlpack on 2022-10-30 — irc logs at libera.irclog.whitequark.org

2021-07-27 15:44 rcurtin_irc changed the topic of #mlpack to: mlpack: a scalable machine learning library (https://www.mlpack.org/) -- channel logs: https://libera.irclog.whitequark.org/mlpack -- NOTE: messages sent here might not be seen by bridged users on matrix, gitter, or slack

01:59 SlackIntegration has quit [Ping timeout: 268 seconds]

02:00 SlackIntegration has joined #mlpack

04:32 krushia has quit [*.net *.split]

04:32 krushia has joined #mlpack

04:48 <jonpsy[m]> Eshaan Agarwal, zoq when is the submission date?

08:14 rcurtin[m] has quit [Ping timeout: 268 seconds]

08:14 rcurtin[m] has joined #mlpack

09:58 <EshaanAgarwal[m]> <jonpsy[m]> "Eshaan Agarwal, zoq when is the..." <- Oct 31 to Nov 7.

09:59 <EshaanAgarwal[m]> jonpsy: zoq: you guys get a chance to look at it ?

09:59 <EshaanAgarwal[m]> Also if it could be possible can we get on PPO code pushed and merged ?

13:22 <zoq[m]> I did some debugging, and I think the reward is not correct.

13:41 <EshaanAgarwal[m]> <zoq[m]> "I did some debugging, and I..." <- Reward as in the reward given by the environment? 👀

13:44 <zoq[m]> Yes

13:46 <EshaanAgarwal[m]> zoq[m]: Can you please elaborate more on this ?

13:46 <zoq[m]> Leave a comment later on the PR.

13:48 <EshaanAgarwal[m]> zoq[m]: Sure ! Can I help in cleaning and getting read PPO code. If it is possible for you to push it ?

13:59 <zoq[m]> But please take another look at the policy implementation, ideally you can step through the code with gdb.

13:59 <zoq[m]> Looks like the reward is fine.

14:00 <zoq[m]> So what we have to do is to step through each line, to find out what’s wrong.

14:02 <EshaanAgarwal[m]> zoq[m]: I will take a look again.

14:03 <EshaanAgarwal[m]> zoq[m]: If I am not wrong you are talking about this in regard to HER right ?

14:04 <EshaanAgarwal[m]> * Sorry but If I am not wrong you are talking about this in regard to HER right ?

14:07 <zoq[m]> EshaanAgarwal[m]: Correct

16:31 <EshaanAgarwal[m]> jonpsy: zoq fieryblade https://meet.google.com/pnp-rtjw-unz

16:46 <EshaanAgarwal[m]> <EshaanAgarwal[m]> "jonpsy: zoq fieryblade https://..." <- can we reschedule this to tomorrow at same time 10:00 PM IST ? Meanwhile, i will take another look at HER with gdb. I also intend to update the progress repository so that we also get started with wrapping up the documentation and report.

16:51 <jonpsy[m]> not sure i can make it monday

16:57 <jonpsy[m]> Eshaan Agarwal: have you written this code entirely from scratch or have taken help from a resource?

17:18 <EshaanAgarwal[m]> <jonpsy[m]> "not sure i can make it monday" <- When can we do it then ?

17:20 <EshaanAgarwal[m]> <jonpsy[m]> "Eshaan Agarwal: have you..." <- I have used the psuedocode in paper and implementation done in Intel Coach Labs as reference. I have mentioned that in PR. But since the mlpack library had a different structure, I implemented in accordance it.

17:28 <jonpsy[m]> Okay I had one question

17:28 * jonpsy[m] sent a code block: https://libera.ems.host/_matrix/media/v3/download/libera.chat/60c5442c4a26e3842a44eba6ab503a143954f22c

17:29 <jonpsy[m]> `modifiedState(action.action) = 1 - modifiedState(action.action)` ?

17:29 <jonpsy[m]> What exactly are we trying to achieve here.

17:30 <EshaanAgarwal[m]> > <@jonpsy:matrix.org> `modifiedState(action.action) = 1 - modifiedState(action.action)` ?

17:30 <EshaanAgarwal[m]> >

17:30 <EshaanAgarwal[m]> flipped the bit in n length binary vector ! here agent will provide index as the action ! that index will be filpped and then produced vector will be next state

17:30 <EshaanAgarwal[m]> > What exactly are we trying to achieve here.

17:31 <EshaanAgarwal[m]> EshaanAgarwal[m]: that will give 1 for 0 at that index and 0 when its 1 at that index

17:35 <jonpsy[m]> `modifiedState(action.action) = 1 - modifiedState(action.action)`

17:35 <jonpsy[m]> so we're calling constructor?

17:36 <EshaanAgarwal[m]> jonpsy[m]: Modified state is a col vec right ?

17:36 <EshaanAgarwal[m]> In which we stored our present state

17:38 <jonpsy[m]> is it?

17:39 <jonpsy[m]> <EshaanAgarwal[m]> "> <@jonpsy:matrix.org> `..." <- If `modifiedState(action.action)` is 0, what happens? It becomes -1?

17:42 <EshaanAgarwal[m]> jonpsy[m]: it will become 1 right ? 1-0 is 1 which will get stored at that place

17:42 <jonpsy[m]> assuming the above code is correct, we could've just done `nextState.Data() = 1 - state.Data()` ? Why create unnecessary copies

17:43 <jonpsy[m]> * - state.Data()(action.action)` ?

17:44 <EshaanAgarwal[m]> jonpsy[m]: we have to flip that index right ? nextState is empty as of now

17:46 <akhunti1[m]> Hi Team , any suggestion what should i use to deploy Mlpack C++ based model ? like flask and Django use for python based Machine learning model .

17:47 <EshaanAgarwal[m]> EshaanAgarwal[m]: i think what you mean to say is this `nextState.Data()= state.Data(); nextState.Data()(action.action) = 1 - state.Data()(action.action);`

17:47 <EshaanAgarwal[m]> `

17:48 <jonpsy[m]> or you could just `std::move` the last part

17:49 <jonpsy[m]> but yes, avoid copies

17:49 <EshaanAgarwal[m]> jonpsy[m]: sure i will make that change.