#mlpack on 2022-11-04 — irc logs at libera.irclog.whitequark.org

2021-07-27 15:44 rcurtin_irc changed the topic of #mlpack to: mlpack: a scalable machine learning library (https://www.mlpack.org/) -- channel logs: https://libera.irclog.whitequark.org/mlpack -- NOTE: messages sent here might not be seen by bridged users on matrix, gitter, or slack

00:08 Unix_Man has joined #mlpack

00:36 Unix_Man has quit [Quit: WeeChat 3.6]

06:06 <jonpsy[m]> Eshaan Agarwal: where are we currently?

06:25 <EshaanAgarwal[m]> <jonpsy[m]> "Eshaan Agarwal: where are we..." <- Almost wrote the maze generation code. I will test it after classes.

08:08 <EshaanAgarwal[m]> When the constructor of Q Learning is called, it further calls the InitialSample() function of environment but since the environment object isn't initialised yet and hence it's private variables aren't initialised yet I get error ( using loop which needs to access my variables matrix's index)

08:08 <EshaanAgarwal[m]> How can I deal with this ?

08:10 <EshaanAgarwal[m]> This problem is coming because we have a random maze which makes starting points random every time. I select one of the starting points from the particular maze for the epsiode using InitialSample()

08:10 <EshaanAgarwal[m]> What could be a possible walk around ?

08:10 <EshaanAgarwal[m]> In previous environments, initial sample never depended on anything so that worked.

08:12 <EshaanAgarwal[m]> For context I am talking about this line - https://github.com/mlpack/mlpack/blob/master/src/mlpack/methods/reinforcement_learning/q_learning_impl.hpp#L54

08:13 <EshaanAgarwal[m]> `if(learningNetwork.Parameters().n_elem != environment.InitialSample().Encode().n_elem) learningNetwork.Reset(environment.InitialSample().Encode().n_elem);`

08:15 <EshaanAgarwal[m]> s/n_elem/n\_elem/, s/n_elem/n\_elem/, s/n_elem/n\_elem/

08:15 <EshaanAgarwal[m]> `targetNetwork.Reset(environment.InitialSample().Encode().n_elem);`

08:18 <EshaanAgarwal[m]> > <@eshaanagarwal:matrix.org> When the constructor of Q Learning is called, it further calls the InitialSample() function of environment but since the environment object isn't initialised yet and hence it's private variables aren't initialised yet I get error ( using loop which needs to access my variables matrix's index)

08:18 <EshaanAgarwal[m]> >

08:18 <EshaanAgarwal[m]> jonpsy: zoq:

08:18 <EshaanAgarwal[m]> > How can I deal with this ?

08:18 <EshaanAgarwal[m]> * jonpsy: zoq

10:56 <EshaanAgarwal[m]> > <@eshaanagarwal:matrix.org> When the constructor of Q Learning is called, it further calls the InitialSample() function of environment but since the environment object isn't initialised yet and hence it's private variables aren't initialised yet I get error ( using loop which needs to access my variables... (full message at <https://libera.ems.host/_matrix/media/v3/download/libera.chat/4d19da05edd10b3dcb51ed0d3b8b08e6f0328147>)

11:28 <jonpsy[m]> you need nod push it ig, since we're dealing with large scale tests. It'll only slow down our test-suite

11:28 <EshaanAgarwal[m]> i am just pushing the maze generation code

11:29 <jonpsy[m]> ok, can u show me some samples

11:29 <EshaanAgarwal[m]> jonpsy[m]: sample of mazes ?

11:29 <jonpsy[m]> + lets get HER & others running.

11:29 <jonpsy[m]> EshaanAgarwal[m]: whats the N, M you've tested it with

11:30 <EshaanAgarwal[m]> jonpsy[m]: i mean i checked with 100 * 100 ! otherwise it works for any n and m.

11:30 <EshaanAgarwal[m]> 10*10

11:30 <jonpsy[m]> for the start point to goal

11:30 <EshaanAgarwal[m]> not HER though ! HER i checked with 10*10

11:30 <jonpsy[m]> the num steps is root(M* N)

11:30 <jonpsy[m]> right?

11:31 <EshaanAgarwal[m]> jonpsy[m]: start point is random from any 0th cell and using the dfs approach i have fixed the goal for a generated matrices

11:31 <EshaanAgarwal[m]> jonpsy[m]: numeber of steps for what ?

11:32 <EshaanAgarwal[m]> if you can jump on call ,i might as well explain what i did.

11:33 <EshaanAgarwal[m]> i can show a sample of 10 * 10 generated maze if you want

11:34 <jonpsy[m]> can't

11:34 <jonpsy[m]> anyway, from what we discussed. The idea was to get a random start point, move X number of steps randomly. Then mark it as goal

11:34 <EshaanAgarwal[m]> jonpsy[m]: i have exactly the same

11:35 <EshaanAgarwal[m]> > <@jonpsy:matrix.org> anyway, from what we discussed. The idea was to get a random start point, move X number of steps randomly. Then mark it as goal

11:35 <EshaanAgarwal[m]> i have exactly the same

11:35 <EshaanAgarwal[m]> *

11:35 <jonpsy[m]> so how's X being defined here

11:35 <EshaanAgarwal[m]> jonpsy[m]: for now i have kept X as 0.5*n*m

11:35 <EshaanAgarwal[m]> but we can change it ofcourse

11:36 <EshaanAgarwal[m]> > <@jonpsy:matrix.org> so how's X being defined here

11:36 <EshaanAgarwal[m]> * for now i have kept X as 0.5 * n * m

11:37 <jonpsy[m]> <EshaanAgarwal[m]> "numeber of steps for what ?" <- .

11:37 <jonpsy[m]> <jonpsy[m]> "the num steps is root(M* N)" <- .

11:37 <EshaanAgarwal[m]> jonpsy[m]: ok i will change it to that

11:37 <EshaanAgarwal[m]> apart from that all things work.

11:37 <jonpsy[m]> Cool, with that done. Can you try generating 500 x 500 matrix

11:38 <EshaanAgarwal[m]> jonpsy[m]: ok should i test the agent on it ?

11:38 <EshaanAgarwal[m]> also what about exploration steps ?

11:38 <jonpsy[m]> lets start with ebing able to create a matrix

11:38 <jonpsy[m]> 500x500, then 1k x 1k

11:38 <jonpsy[m]> and time it

11:38 <EshaanAgarwal[m]> jonpsy[m]: time the generation of matrix ?

11:38 <jonpsy[m]> yh

11:38 <jonpsy[m]> it wouldnt matter much, but im just interested to know

11:39 <EshaanAgarwal[m]> ok how do we time it ?

11:50 <EshaanAgarwal[m]> <EshaanAgarwal[m]> "ok how do we time it ?" <- jonpsy:

12:06 <EshaanAgarwal[m]> <jonpsy[m]> "." <- i think steps which we use to make the matric for that sqrt(n*m) will be quite less ! sqrt(n*m) is even less then n + m. Some factor is necessary. for now i am keep it 3 * sqrt(n *m)

12:06 <EshaanAgarwal[m]> This is more appropriate.

12:09 <EshaanAgarwal[m]> also i am able to generate the matrices.

15:52 <jonpsy[m]> ok

15:57 <zoq[m]> <EshaanAgarwal[m]> "jonpsy:..." <- You can use armadillos tic/toc

15:58 <EshaanAgarwal[m]> zoq[m]: Ok i will check this out !

15:58 <EshaanAgarwal[m]> jonpsy: what else do I need to work on ?

15:58 <jonpsy[m]> 200x200 matrix, HEr run

15:58 <jonpsy[m]> vs others

16:01 <EshaanAgarwal[m]> <EshaanAgarwal[m]> "i think steps which we use to..." <- To be fair, I was going throught various runs and experimenting with right `exploration steps` and `max steps` !... (full message at <https://libera.ems.host/_matrix/media/v3/download/libera.chat/ed8627aadcec9e7361137453a66f064248db840d>)

16:02 <EshaanAgarwal[m]> So we should set that manually and not have formula for it

16:03 <EshaanAgarwal[m]> jonpsy[m]: What should the exploration steps and max steps that I set ?

16:03 <EshaanAgarwal[m]> I was currently running 50*50 and it has taken almost 1 hr to complete 80-100 iterations

16:04 <jonpsy[m]> oh

16:04 <EshaanAgarwal[m]> > <@eshaanagarwal:matrix.org> What should the exploration steps and max steps that I set ?

16:04 <EshaanAgarwal[m]> > I was currently running 50*50 and it has taken almost 1 hr to complete 80-100 iterations

16:04 <EshaanAgarwal[m]> Even when the exploration steps are less to what should be for a good performance I guess.

16:05 <EshaanAgarwal[m]> I think 200*200 is punching way over the belt. Like in the HER paper they said that even using HER they weren't able to solve but flipping environment with length more then 15

16:06 <EshaanAgarwal[m]> s/but/bit/

16:07 <EshaanAgarwal[m]> Anything more than 50*50 seems a bit over in my opinion as per all the runs that I was trying.

16:08 <zoq[m]> The purpose of the test is to make sure HER works, so if it works on a reasonable number of different settings it’s fine.

16:09 <zoq[m]> That said we can test it on a larger size outside of the test suite.

16:09 <EshaanAgarwal[m]> zoq[m]: I think as per what we discussed for testing purposes. 4*4 or 10*10 was ok. We were doing this to guage the limits of our algo

16:10 <zoq[m]> gives us some more insight into the correctness of the implementation

16:10 <zoq[m]> Yes, makes sense.

16:10 <EshaanAgarwal[m]> EshaanAgarwal[m]: Infact in the test I implemented it works on 10*10. And as per the PR test they all have converged

16:11 <zoq[m]> Good

16:11 <EshaanAgarwal[m]> And I verified that independently in my system too

16:11 <zoq[m]> And they converge for each run or 1 out of 5?

16:11 <EshaanAgarwal[m]> zoq[m]: I will say 4/5 for the threshold I have set.

16:12 <EshaanAgarwal[m]> But we check for only one run

16:12 <EshaanAgarwal[m]> I set the threshold just to make sure that it's feasible and also not a one time thing

16:12 <EshaanAgarwal[m]> EshaanAgarwal[m]: Meaning it needs to converge in only one out of the five.

16:12 <zoq[m]> Yes was just curious how stable it is.

16:13 <EshaanAgarwal[m]> zoq[m]: Yup it does.

16:14 <EshaanAgarwal[m]> zoq[m]: Ok so how do you suggest that I should proceed !

16:14 <EshaanAgarwal[m]> I have pushed the code with 10*10 in the PR. I am currently checking 50*50 but frankly it might need more exploration steps to perform and it's quite time consuming.

16:15 <EshaanAgarwal[m]> EshaanAgarwal[m]: 10*10 as the test. For test I think that is more than sufficient and serves the purpose well

16:16 <zoq[m]> If you can let it run on the side, I would start on the documentation for HER. Which includes a description of the method, an example I guess we can use the maze.

16:17 <EshaanAgarwal[m]> zoq[m]: Where do I have to write the description ? I think I have provided in line documentation with the code.

16:17 <EshaanAgarwal[m]> * Where do I have to write the description ? I think I have provided in line documentation with the code. I will go over it today again and push all the changes

16:18 <zoq[m]> https://github.com/mlpack/mlpack/tree/master/doc/tutorials

16:18 <zoq[m]> You can add it in the form of a tutorial.

16:19 <zoq[m]> You can check the existing tutorials for some inspiration.

16:19 <EshaanAgarwal[m]> zoq[m]: Sure ! I will do this by tomorrow.

16:20 <EshaanAgarwal[m]> zoq[m]: Anything else apart from that ?

16:21 <zoq[m]> EshaanAgarwal[m]: I’ll go through the code later today, so if you can incorporate the feedback as well, that would be great.

18:54 <akhunti1[m]> Hi rcurtin I am getting this error [ ImportError: libopenblaso.so.0: cannot open shared object file: No such file or directory ]

18:56 <akhunti1[m]> Is there something i need to change in FindArmadillo.cmake file , so that i can avoid this dependence , or i need to add this file

19:17 <rcurtin[m]> armadillo is dependent on OpenBLAS; I can't say exactly what your issue is, but it is an unavoidable dependency

19:19 <akhunti1[m]> Thanks for your help , Then I will try to install openblas .

20:08 <akhunti1[m]> Hi rcurtin I installed openblas from GitHub :

20:08 * akhunti1[m] uploaded an image: (2KiB) < https://libera.ems.host/_matrix/media/v3/download/matrix.org/YvPAsPgzUqtuklAvmMujBKBp/image.png >

20:09 <akhunti1[m]> but i am getting error [ ImportError: libopenblaso.so.0:] for this file .

20:12 <akhunti1[m]> so , any idea where i can get this file [ libopenblaso.so.0:] because when i installed it is generating libopenblas.so.0 file .

20:27 <akhunti1[m]> or can i edit the file name of the file as to aso .