#mlpack on 2019-04-02 — irc logs at libera.irclog.whitequark.org

2018-11-12 22:39 ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/

01:56 robb9 has joined #mlpack

01:57 < robb9> hey, i'm trying to format data for an LSTM and I am wondering if I did it correctly

01:58 < robb9> I made each "data point" a timestep (cube slice), each slice has 1 row that has 2 columns (features)

01:59 < robb9> also, in this case, (I am using an LSTM) how many inputs should the first layer have? 2 because I have 2 features?

02:30 robb9 has quit [Quit: Page closed]

02:58 seewishnew has joined #mlpack

03:08 seewishnew has quit [Ping timeout: 250 seconds]

03:28 ahmedmaher has joined #mlpack

03:49 Thong_ has joined #mlpack

03:49 Thong_ has quit [Client Quit]

03:51 Thong_ has joined #mlpack

03:51 pd09041999 has quit [Quit: Leaving]

03:51 pd09041999 has joined #mlpack

03:52 Thong_ has quit [Client Quit]

04:53 seewishnew has joined #mlpack

05:10 seewishnew has quit [Remote host closed the connection]

05:11 seewishnew has joined #mlpack

05:15 seewishnew has quit [Ping timeout: 250 seconds]

05:18 seewishnew has joined #mlpack

05:56 seewishnew has quit [Remote host closed the connection]

05:56 seewishnew has joined #mlpack

06:02 seewishnew has quit [Remote host closed the connection]

06:03 seewishnew has joined #mlpack

06:07 seewishnew has quit [Ping timeout: 258 seconds]

07:05 ahmedmaher has quit [Read error: Connection reset by peer]

07:06 ahmedmaher has joined #mlpack

07:12 Mulx10 has joined #mlpack

07:13 < Mulx10> robb9: I guess the cube structure is (n_features, n_data, n_timestep).

07:15 < Mulx10> robb9 : If the input has two features than input size should be 2, if input has two timestamps then rho should be 2

07:16 < Mulx10> robb9 : there is a typo, it is time steps not timestamps.

07:16 pd09041999 has quit [Ping timeout: 245 seconds]

07:22 Mulx10 has quit [Ping timeout: 256 seconds]

07:29 pd09041999 has joined #mlpack

07:48 heisenbug_ has joined #mlpack

07:48 < heisenbug_> Heyy, I am interested in working for MLPACK, can anyone let me know how to get started.

07:55 heisenbug_ has quit [Ping timeout: 256 seconds]

08:27 pd09041999 has quit [Ping timeout: 250 seconds]

08:42 pd09041999 has joined #mlpack

09:32 pd09041999 has quit [Read error: Connection reset by peer]

09:56 pd09041999 has joined #mlpack

10:25 seewishnew has joined #mlpack

10:34 seewishnew has quit [Remote host closed the connection]

10:35 seewishnew has joined #mlpack

10:42 seewishnew has quit [Ping timeout: 250 seconds]

10:51 pd09041999 has quit [Quit: Leaving]

11:09 rf_sust2018 has joined #mlpack

11:15 seewishnew has joined #mlpack

11:18 seewishnew has quit [Remote host closed the connection]

11:18 seewishnew has joined #mlpack

11:23 seewishnew has quit [Ping timeout: 250 seconds]

11:26 seewishnew has joined #mlpack

11:30 rf_sust2018 has quit [Quit: Leaving.]

11:31 ahmedmaher has quit [Ping timeout: 258 seconds]

11:35 ahmedmaher has joined #mlpack

12:06 i8hantanu has joined #mlpack

12:13 seewishnew has quit [Remote host closed the connection]

12:13 seewishnew has joined #mlpack

12:15 robb9 has joined #mlpack

12:15 < robb9> Mulx10: thank you! I finally understand

12:16 < robb9> also, does RNN extend a base class? I want to make it an instance variable but I can't unless I specify rho, which I don't want to do yet

12:18 seewishnew has quit [Ping timeout: 264 seconds]

12:22 seewishnew has joined #mlpack

12:25 seewishnew has quit [Remote host closed the connection]

12:25 seewishnew has joined #mlpack

12:27 bhavya01 has joined #mlpack

12:30 seewishnew has quit [Ping timeout: 264 seconds]

12:53 seewishnew has joined #mlpack

13:00 bhavya01 has quit [Quit: Ex-Chat]

13:00 bhavya01 has joined #mlpack

13:08 robb9 has quit [Quit: Page closed]

13:15 sreenik has joined #mlpack

13:35 bhavya01 has quit [Ping timeout: 268 seconds]

13:36 rf_sust2018 has joined #mlpack

13:37 seewishnew has quit [Remote host closed the connection]

13:38 seewishnew has joined #mlpack

13:41 seewishnew has quit [Remote host closed the connection]

13:42 seewishnew has joined #mlpack

13:51 seewishnew has quit [Remote host closed the connection]

13:52 seewishnew has joined #mlpack

13:56 seewishnew has quit [Ping timeout: 250 seconds]

13:59 bhavya01 has joined #mlpack

14:00 paul____ has joined #mlpack

14:02 rf_sust2018 has quit [Quit: Leaving.]

14:04 < paul____> Hi! In the context of gsoc, may I ask if there are have been any strong proposals for the mlpack-Tensorflow-pytorch translator project yet? I am very interested in the project, however if some rockstars already applied to that project I might apply for a different one to have a realistic chance.

14:08 < rcurtin> paul____: I have no idea what the quality of the proposals are, but there has not been much discussion about that project

14:08 < rcurtin> there's also been basically no discussion of the 'mlpack on resource constrained devices' project

14:10 seewishnew has joined #mlpack

14:13 < paul____> Thanks for the answer rcurtin! I would definitely be more interested in the 'mlpack translator project' than the 'mlpack on resource constrained devices' project. I suppose I will put together a bit of a pitch / preliminary proposal for that project then and will eventually suggest it in this channel. Again, thanks for the quick response!

14:15 i8hantanu has quit [Quit: Connection closed for inactivity]

14:22 paul____ has quit [Quit: Page closed]

14:25 < rcurtin> sounds good

14:28 pd09041999 has joined #mlpack

14:39 rf_sust2018 has joined #mlpack

14:45 robb9 has joined #mlpack

14:45 < robb9> I'm getting some weird behavior with mlpack::data::load

14:45 < robb9> I have 256 lines in a csv file, with each line having 4 features

14:46 < robb9> however, upon loading it, the n_cols is equal to 2570 somehow, not 256.

14:46 < robb9> just using stock mlpack::data::Load("data.csv", filedata, true);

14:48 < robb9> ah, I think I found the problem

14:48 robb9 has quit [Client Quit]

14:56 ahmedmaher has quit [Read error: Connection reset by peer]

14:56 astadnik has joined #mlpack

14:56 ahmedmaher has joined #mlpack

14:58 < astadnik> Hello guys)

15:06 seewishnew has quit [Remote host closed the connection]

15:06 seewishnew has joined #mlpack

15:11 seewishnew has quit [Ping timeout: 264 seconds]

15:28 leilee has joined #mlpack

15:38 big has joined #mlpack

15:38 saksham189 has joined #mlpack

15:38 < big> hey, I have an error regarding using Predict() with my recurrent neural net

15:39 < big> it's a Mat::submat() out of bounds error

15:39 < big> I have rho set to my amount of datapoints per timestep

15:40 < big> and the amount of rows in my dataset is equal to the amount of inputs in the first layer

15:40 < big> what else could be the problem?

15:40 < big> the error specifically happens when I use Predict(). I am constructing my datapoint correctly and all members are populated

15:42 bhavya01 has quit [Remote host closed the connection]

16:13 pd09041999 has quit [Ping timeout: 250 seconds]

16:16 leilee has quit [Ping timeout: 256 seconds]

16:25 < big> I don't get a matrix multiplication error or anything, just a submat out of bounds

16:26 pd09041999 has joined #mlpack

16:30 pd09041999 has quit [Excess Flood]

16:31 pd09041999 has joined #mlpack

16:37 pd09041999 has quit [Max SendQ exceeded]

16:38 pd09041999 has joined #mlpack

16:44 saksham189 has quit [Ping timeout: 256 seconds]

16:45 < big> I think it's because I'm adding the layers wrong

17:00 pd09041999 has quit [Ping timeout: 250 seconds]

17:05 saksham189 has joined #mlpack

17:06 seewishnew has joined #mlpack

17:09 seewishnew has quit [Remote host closed the connection]

17:09 seewishnew has joined #mlpack

17:09 sreenik has quit [Ping timeout: 256 seconds]

17:10 ahmedalamir has joined #mlpack

17:12 ahmedmaher has quit [Ping timeout: 268 seconds]

17:13 pd09041999 has joined #mlpack

17:13 pd09041999 has quit [Max SendQ exceeded]

17:26 rf_sust2018 has quit [Quit: Leaving.]

17:27 saksham189 has quit [Ping timeout: 256 seconds]

17:38 saksham189 has joined #mlpack

17:45 sreenik has joined #mlpack

17:45 < sreenik> Before I knew about ensmallen I used to make use of mlpack's default SGD optimizer with AdamUpdate. Now with ens::Adam(...) however the accuracy has come down to 10% from 99% with similar parameters. Am I missing anything obvious?

17:55 seewishnew has quit [Ping timeout: 250 seconds]

18:03 seewishnew has joined #mlpack

18:10 < rcurtin> sreenik: the code didn't change, so that's quite surprising to me; are you sure nothing else in your code changed?

18:13 seewishnew has quit [Remote host closed the connection]

18:13 seewishnew has joined #mlpack

18:16 < sreenik> I don't think so. I will investigate a bit more. If the optimizer code hasn't changed, then I hope something is wrong with my code. I will open an issue in case I can't find out

18:18 seewishnew has quit [Ping timeout: 268 seconds]

18:26 seewishnew has joined #mlpack

18:27 < big> It looks like there's a very specific way to add layers to an RNN...

18:27 < big> Is it true that I have to make a separate object, say, "recurrent", add layers to that, and then add that to the model?

18:33 < sreenik> The parser is ready (ahh finally!). I am trying it out with different models. No errors so far just this thing regarding the accuracy. Also, I am not very familiar with using RNNs with mlpack, so I have left rnn features out (incorporating them later won't be much of a task though). Should I file a PR or upload it to my own repo and share the link after documenting? I am not too sure where this parser should reside.

18:34 < sreenik> Also, it needs extensive testing and maybe some structuring before it can be merged.

18:41 < big> is there a debug mode that I can make armadillo use? I need verbosity to see where in Predict() I'm getting the out of bounds error

18:43 seewishnew has quit [Remote host closed the connection]

18:43 seewishnew has joined #mlpack

18:48 seewishnew has quit [Ping timeout: 268 seconds]

18:51 < sreenik> rcurtin: Sorry it was my mistake. It's alright now. Ensmallen is working exactly as it is supposed to. The dataset had been corrupted somehow :)

18:55 vivekp has quit [Ping timeout: 244 seconds]

19:31 Suryo has joined #mlpack

19:32 < Suryo> zoq, rcurtin: I had submitted my proposal last week (I've been busy with my studies since) and was wondering if you guys got a chance to look at it. I just wanted to know what to add to it, or if there's anything I need to remove from it...

19:33 Suryo has quit [Client Quit]

19:34 < zoq> big: model.Add<Recurrent<> >(Add<>(), ...); works as well

19:34 < zoq> big: About debugging what about using gdb to step through the code?

19:35 < zoq> Suryo: hm, I think I have missed the application, let's see.

19:35 < big> Well I have been getting a Matrix::submat out of bounds() error when I try to Train or Predict using my model

19:35 < big> I've tried using gdb but I haven't gotten anything useful

19:37 < big> I have 2 timesteps (rho=2, slices of cube is 2), 4 features (4 rows in cube) and 16 columns per step (16 datapoints per step)

19:38 < zoq> big: hm, it should be possible to get the line that caused the error using gdb, but I agree that might be time consuming

19:39 < zoq> big: The shape sounds resaonable to me

19:39 < big> all of my data is populated but when I try to Train or Predict it gives the same error

19:39 < zoq> right at the first iteration?

19:40 < big> I'm not sure, how would I go about checking which iteration it's on? i'm assuming more gdb

19:41 < big> I'll see if I can tell and I'll let you know

19:41 < zoq> You could get some more output if you define ENS_PRINT_INFO and ENS_PRINT_WARN

19:42 < zoq> https://github.com/mlpack/ensmallen/blob/8d9682ca54483fb0601db8b48b17839e206149eb/include/ensmallen_bits/log.hpp#L73-L83

19:42 < big> Alright, I'll try that. Making armadillo verbose didn't give any helpful output

19:45 < big> Would it be helpful if I told you that it printed absolutely nothing with both enabled?

19:46 < big> I'm going to try reinstalling armadillo and mlpack

19:46 < zoq> Kinda, I guess it returns before the first iteration.

19:47 < zoq> Not sure, perhaps you can create a simple model including the data that can be used to reproduce the issue? In case you don't get anything reasonable from the gdb output.

19:48 < big> Would having only the following layers maybe cause a problem somehow? Linear(4,10), GRU(10,10), Linear(10,3)

19:48 < big> Yeah I've taken everything out and have the simplest possible model right now for testing purposes

19:49 < zoq> the structure looks alright to me, probably something strange with the data encoding

19:49 < zoq> if you like you can post the link here and I can take a look at it once I have a chance, probably not before tomorrow

19:50 < big> Alright, if this reinstall doesn't work then I'll do that. Thanks

19:52 saksham189 has quit [Quit: Page closed]

19:53 < rcurtin> Suryo: I have a bit of time set out tonight to look at proposals, I'll be sure to look at yours also

19:54 < zoq> Currently looking into the application.

20:09 ahmedmaheralamir has joined #mlpack

20:09 ahmedalamir has quit [Read error: Connection reset by peer]

20:21 < big> zoq: https://gitlab.com/hexrays/my-error

20:32 ahmedmaher has joined #mlpack

20:35 ahmedalamir has joined #mlpack

20:36 ahmedmaheralamir has quit [Ping timeout: 246 seconds]

20:37 ahmedmaher has quit [Ping timeout: 268 seconds]

20:41 ahmedmaher has joined #mlpack

20:41 ahmedalamir has quit [Ping timeout: 246 seconds]

21:05 big has quit [Ping timeout: 256 seconds]

21:06 big has joined #mlpack

21:16 sreenik has quit [Quit: Page closed]

21:20 < big> I'll do some additional gdb work but I can't seem to figure out what the problem is

21:39 anidh1997 has joined #mlpack

21:47 < big> zoq: Also it's not my installation, I just tried on a different machine with the same error

22:09 ahmedmaher has quit [Ping timeout: 245 seconds]

22:14 big has quit [Ping timeout: 256 seconds]

22:21 anidh1997 has quit [Quit: Page closed]

22:22 Suryo has joined #mlpack

22:23 < Suryo> zoq: thanks for taking a look at my application and commenting. I think that a lot of your comments are actually addressed in different parts of the application, but I'll review it to make sure that all the points you've raised are clear in the application.

22:26 Suryo has quit [Client Quit]

22:44 ahmedmaher has joined #mlpack

23:24 big has joined #mlpack

23:24 < big> zoq: tracing with GDB and I may have found the culprit. for some reason, it thinks the batchSize is 256 instead of rho which is what I gave it

23:24 < big> mlpack::ann::RNN<...>, ...>::Predict (this=0x7fffffffd3c0, predictors=..., results=..., batchSize=256)

23:25 < big> is the default value 256 or something?

23:27 ahmedalamir has joined #mlpack

23:27 ahmedmaher has quit [Read error: Connection reset by peer]

23:35 < big> Nevermind, that's not the error

23:35 big has quit [Quit: Page closed]

23:41 ahmedmaheralamir has joined #mlpack

23:44 ahmedalamir has quit [Ping timeout: 255 seconds]