verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/
< zoq>
partobs-mdp: Sorry for the slow reponse, the left side is the expected total reward in the long run, which can be can be estimated from the predicted rewards. Maybe this: http://rllab.readthedocs.io/en/latest/user/implement_algo_basic.html is helpful here? Also, just to be clear you like to implement the REINFORCE model not the fully differential model? I think the implementation of the differentiable model
< zoq>
is easier, but it's up to you.
vivekp has quit [Ping timeout: 268 seconds]
sheogorath27 has joined #mlpack
vss has joined #mlpack
vss has quit [Ping timeout: 260 seconds]
partobs-mdp has joined #mlpack
andrzejku has joined #mlpack
vivekp has joined #mlpack
vivekp has quit [Ping timeout: 268 seconds]
vivekp has joined #mlpack
andrzejku has quit [Quit: My iMac has gone to sleep. ZZZzzz…]
vivekp has quit [Ping timeout: 276 seconds]
andrzejku has joined #mlpack
MikeLDN has joined #mlpack
< MikeLDN>
rcurtin: (regarding the Range Search LNK error in VC) Release 2.0.3 is building OK. VC has /FORCE:MULTIPLE linker flag but it is not working. If I dig some solution I'll write it here...
< partobs-mdp>
zoq: Implemented changes from your review. Can you outline how to crate a PR with all optimizer changes that were made during our previous work? (I understand that I should create a new branch, but what should I do next? I'm slightly confused)
jenkins-mlpack has quit [Remote host closed the connection]
jenkins-mlpack has joined #mlpack
partobs-mdp has quit [Ping timeout: 246 seconds]
partobs-mdp has joined #mlpack
MikeLDN has joined #mlpack
vivekp has quit [Ping timeout: 260 seconds]
vivekp has joined #mlpack
vivekp has quit [Ping timeout: 240 seconds]
vivekp has joined #mlpack
vivekp has quit [Ping timeout: 255 seconds]
vivekp has joined #mlpack
vivekp has quit [Ping timeout: 246 seconds]
kris1 has joined #mlpack
vivekp has joined #mlpack
< kris1>
What layer to use for 0/1 loss in the OutputLayer of ffn
< zoq>
kris1: cross entropy or negative log likelihood loss should work, Konstantin implemented the cross entropy layer here: https://github.com/mlpack/mlpack/pull/1005
< kris1>
I was using the nll but when target(i) = 1; then size_t currentTarget = target(i) - 1; the current Target wraps around so i don;t think nll would work
vivekp has quit [Ping timeout: 268 seconds]
< zoq>
kris1: ah, right
< zoq>
kris1: however the cross entropy layer should work
vivekp has joined #mlpack
MikeLDN has quit [Ping timeout: 260 seconds]
< kris1>
Ok i will try that.
< kris1>
One more thing if we do something like a = arma::mat(b.mempt(), b.n_rows, b.n_cols, false, false). Then reassign b = newObject. that would mean that b.n_rows and b.n_cols would also change. Would a change accordingly or not.
< zoq>
kris1: a does not change no, also it's unsafe to do that since c uses the memory from b.
< kris1>
okay…..so how should i share the predictors and responses variable of gan with the generator and discriminator network. Note that both predictors and response change when training disrciminator and generator.
< zoq>
kris1: As long as you don't change the memory pointer or resize the matrix you can use memptr, you could also use a pointer that points to the shared parameter.
partobs-mdp has quit [Ping timeout: 260 seconds]
< kris1>
mikhail i have added the gan pr. I working on the training right now since it does not work correctly. But the other functions do work. If you have the time please look at tit
vivekp has quit [Ping timeout: 260 seconds]
vivekp has joined #mlpack
vivekp has quit [Ping timeout: 260 seconds]
< lozhnikov>
kris1: I want to fix the ssRBM implementation first. Nevertheless, I can take a quick look.
< kris1>
yes i will get to the ssRBM tonight. I was working on the Gan from the afternoon. I think cross entropy would fix most of the errors i am getting.
< kris1>
Let’s see.
< kris1>
I will fix the problems you mentioned in the ssRBM PR tonight.
< kris1>
I am again getting into memory errors ie the program is getting killed.
< kris1>
You could also look at the PR if you want i have used the cross entropy form konstantin
< zoq>
kris1: Can you be a little bit more specific e.g. line number? Just glanced over the code, couldn't see anything right away.
< zoq>
I guess, GanTest (PR) is the test that results in a memory error?
< kris1>
Yes. So well the program just get’s killed without giving line number. But the with valgrind the last line being executed was at gan_impl.hpp:171
< zoq>
kris1: hm, maybe you can step through the program line by line (gdb) to narrow down the error, if not I'll take a closer look at the issue tomorrow.
< kris1>
sorry i did that with lldb and execution stops at 171. I checked the variables there they seemed fine to me.
< kris1>
But i will have another look just in case
< zoq>
okay sounds good, I'll set it on my to do list for tomorrow and get back to you once I know what happened here