verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/
witness has quit [Quit: Connection closed for inactivity]
vivekp has joined #mlpack
govg has joined #mlpack
witness has joined #mlpack
isaiah_ has quit [Quit: Page closed]
vivekp has quit [Ping timeout: 245 seconds]
vivekp has joined #mlpack
McDhee has joined #mlpack
McDhee has quit [Client Quit]
McDhee has joined #mlpack
horribilis has joined #mlpack
McDhee has quit [Quit: Page closed]
horribilis has quit [Quit: Leaving.]
daivik has joined #mlpack
sumedhghaisas has quit [Ping timeout: 240 seconds]
sumedhghaisas has joined #mlpack
sumedhghaisas has quit [Read error: Connection reset by peer]
sumedhghaisas has joined #mlpack
< luffy1996> zoq: I would like to know what is aggregated_policy class in reinforcement learning
< luffy1996> It would be great if you can give a paper that describes the method
viraj has joined #mlpack
< luffy1996> Is this the reinforce algorithm defined in reinforcement learning?
sumedhghaisas has quit [Read error: Connection reset by peer]
sumedhghaisas has joined #mlpack
sumedhghaisas has quit [Read error: Connection reset by peer]
sumedhghaisas has joined #mlpack
witness has quit [Quit: Connection closed for inactivity]
viraj has quit [Quit: Page closed]
viraj has joined #mlpack
sumedhghaisas2 has joined #mlpack
sumedhghaisas2 has quit [Read error: Connection reset by peer]
sumedhghaisas2 has joined #mlpack
sumedhghaisas has quit [Ping timeout: 276 seconds]
viraj has quit [Ping timeout: 260 seconds]
sumedhghaisas has joined #mlpack
sumedhghaisas2 has quit [Ping timeout: 245 seconds]
daivik has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]
sumedhghaisas has quit [Ping timeout: 240 seconds]
sumedhghaisas has joined #mlpack
vivekp has quit [Ping timeout: 240 seconds]
vivekp has joined #mlpack
sumedhghaisas2 has joined #mlpack
sumedhghaisas has quit [Ping timeout: 256 seconds]
sumedhghaisas has joined #mlpack
sumedhghaisas2 has quit [Ping timeout: 256 seconds]
sumedhghaisas2 has joined #mlpack
sumedhghaisas has quit [Read error: Connection reset by peer]
viraj has joined #mlpack
sumedhghaisas2 has quit [Read error: Connection reset by peer]
viraj has quit [Client Quit]
viraj has joined #mlpack
sumedhghaisas2 has joined #mlpack
< zoq> luffy1996: I'm not sure there is a reference for the Aggregated policy class: "An aggregated policy will randomly select a child policy under a given distribution at each time step."
< zoq> luffy1996: The reinforce normal layer is mend to be used for the recurrent visual attention model.
< luffy1996> zoq: this reinforce is different from the policy gradient method. right?
< zoq> luffy1996: yes
< luffy1996> zoq: I am having confusion here. What do you define a child policy ?
< zoq> If you use the AggregatedPolicy class, you have to pass a collection of policies, each policy in that collection is a child.
< luffy1996> I think I will have to take a look at the implementation again.
< luffy1996> zoq: I am planning to go ahead with policy gradient methods implementation in mlpack as gsoc propsoal. How do you feel about this?
< zoq> luffy1996: Sounds fine for me, do you have specifc methods in mind?
< viraj> Hi I am new to the mlpack. I have gone through the methods that are included in mlpack. I want to know if you are interested to implement the SVM (Support Vector Machine) Algorithm because I didn't saw anything like that in the current methods.
< luffy1996> I was planning to go with Reinforce (with,without baseline) and actor critic method implementation
< luffy1996> What are your say in this?
< zoq> luffy1996: Sounds good :)
< zoq> viraj: Hello, if you are going to write an SVM implementation, keep in mind that it should outperform libsvm, which might be pretty difficult, but if you're up for a challenge, then that might be a good one :)
< luffy1996> zoq: I have an idea how important is a gsoc proposal because one has to complete the project in summers. How do you see my plans. Is it apt or should I add one or two algorithms more :)
< luffy1996> zoq: Moreover I would like to know your views on its implementation. How do you see that policy gradient might help the public . I would like to here your views :)
< zoq> Actually, that depends on the details, e.g. Shangtong already worked on A3C, so it is interesting to see in what way you like to extend the existing codebase. Each idea has its advantages, e.g. it might not return the best results, but it is able to achieve good results in fewer iterations.
< viraj> zoq: I don't know about libsvm before so I searched about libsvm and found that it is very developed library. It would be very difficult to outperform their implementations.
csoni has joined #mlpack
< zoq> viraj: yeah, definitely challenging.
< viraj> I would like to know that is it a good idea to implement Genetic Algorithms in mlpack for gsoc.
csoni has quit [Quit: Leaving]
viraj has quit [Quit: Page closed]
droid has joined #mlpack
droid has quit [Client Quit]
vivekp has quit [Ping timeout: 252 seconds]
vivekp has joined #mlpack
< luffy1996> zoq: I have been through the codes so I know that A3C has been implementated. What I wanted was to go for PPO in actor critic .
< luffy1996> How do you see this ? :)
vivekp has quit [Ping timeout: 256 seconds]
vivekp has joined #mlpack
< zoq> luffy1996: Ah I see, that would be a really neat addition.
< zoq> viraj: Absolutely, there is an open PR for NEAT, that needs some additional work.
< luffy1996> So I am going with this as my proposal. I hope this will meet will all the requirements that mlpack would need :)
aman_ has joined #mlpack
robertohueso has joined #mlpack
daivik has joined #mlpack
robertohueso has quit [Quit: Lost terminal]
aman_ has quit [Ping timeout: 260 seconds]
ImQ009 has joined #mlpack
vivekp has quit [Ping timeout: 248 seconds]
vivekp has joined #mlpack
daivik has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]
vivekp has quit [Ping timeout: 240 seconds]
vivekp has joined #mlpack
moksh has joined #mlpack
< moksh> Hey, is it possible to run only some specific test cases? For example, I wrote code for a new variant of the adam optimizer and some test cases in the adam_test.cpp file. Is there a way to run only that test file?
< zoq> yes, bin/mlpack_test -t SuiteName/TestCase e.g. bin/mlpack_test -t AdamTest/AdamSphereFunctionTest
robertohueso has joined #mlpack
moksh has quit [Ping timeout: 260 seconds]
moksh_ has joined #mlpack
< moksh_> @zoq if I make any changes then do I have to build the entire library or it is possible to build just that adam module and its test case?
daivik has joined #mlpack
< zoq> moksh_: if you run make again it should also rebuild changes.
< moksh_> @zoq Okay. Thank you! :)
sumedhghaisas2 has quit [Read error: Connection reset by peer]
sumedhghaisas has joined #mlpack
sumedhghaisas2 has joined #mlpack
moksh_ has quit [Quit: Page closed]
sumedhghaisas has quit [Ping timeout: 260 seconds]
sumedhghaisas2 has quit [Ping timeout: 245 seconds]
daivik has quit [Remote host closed the connection]
ImQ009_ has joined #mlpack
ImQ009 has quit [Ping timeout: 256 seconds]
daivik has joined #mlpack
robertohueso has quit [Quit: leaving]
anirudhm has quit [Read error: Connection reset by peer]
anirudhm has joined #mlpack
daivik has quit [Quit: http://www.kiwiirc.com/ - A hand crafted IRC client]
ImQ009_ has quit [Read error: Connection reset by peer]
govg has quit [Ping timeout: 268 seconds]
Marquise has joined #mlpack
Marquise has quit [Ping timeout: 260 seconds]