verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/
rcurtin_ has joined #mlpack
K4k has joined #mlpack
lozhnikov has joined #mlpack
govg has quit [Quit: leaving]
s1998 has joined #mlpack
gtank has quit [Read error: Connection reset by peer]
lozhnikov has quit [Read error: Connection reset by peer]
gtank has joined #mlpack
lozhnikov has joined #mlpack
s1998 has quit [Quit: Page closed]
s1998 has joined #mlpack
< s1998>
I had a PR wrt token 817 on DTree.The destructor is deleting the left and right children (the data ). And copy constructor is copying only the pointers (not the data). Is that Okay ?
< s1998>
The child node (data) needs to be copied or the pointer to the child node?
< rcurtin_>
s1998: I haven't had a chance to look at your PR lately, I'll try and find time to do it today
yashu-seth has joined #mlpack
< yashu-seth>
I was exploring the mlpack_ann module, but I could not a find a documentation for it. Can someone please help me with some resources to understand the module?
< s1998>
rcurtin: okay
yashu-seth has quit [Ping timeout: 260 seconds]
raphael29_ has joined #mlpack
yashu-seth has joined #mlpack
s1998 has quit [Ping timeout: 260 seconds]
yashu-seth has quit [Ping timeout: 260 seconds]
govg has joined #mlpack
mikeling has quit [Quit: Connection closed for inactivity]
travis-ci has joined #mlpack
< travis-ci>
mlpack/mlpack#1728 (master - c5c0b5a : Ryan Curtin): The build was fixed.
raphael29_ has quit [Quit: Konversation terminated!]
< rcurtin>
zoq: I'd like to merge #825 (the automatic bindings PR), did you have any comments to make? I can wait for a bit longer if you like :)
< zoq>
rcurtin: I'm almost done with the PR, maybe you can wait until tomorrow?
< rcurtin>
sure, no hurry!
< rcurtin>
I have the next part almost done... just today I did a short five-line refactoring of emst_main.cpp, added "add_python_binding(emst)" to emst/CMakeLists.txt, and got a working Python binding :)
< rcurtin>
so things are definitely coming together
< zoq>
Sounds great, I'm sure a lot of people are more than happy to use python instead of c++. So really excited to see this thing working.
< rcurtin>
yeah
< rcurtin>
the main missing piece, for now, is the serialization of models... I've done it with some handwritten bindings, but I'm not doing it automatically yet
< rcurtin>
like I can easily write a binding that serializes the model to a binary blob or whatever using boost serialization, but that means if I do, e.g.,
< rcurtin>
model = mlpack.hoeffding_tree(X, y, other_options=...)
< rcurtin>
then it will have to deserialize that entire model string, which can take a while
< rcurtin>
so in a handwritten binding I managed to have 'model' just be a memory pointer to an initialized model object and it seemed to work fine
< rcurtin>
I just need to figure out how to do that automatically... but I definitely think it's possible :)
< zoq>
hm, if you say the deserialize of the entire model takes some time, does that mean it's basically unusable?
< rcurtin>
well, sort of unusable, but not entirely
< zoq>
Would be super easy, that way
< rcurtin>
in the example of Hoeffding trees, the overhead for serialization/deserialization for trees of any reasonable size is hundreds of ms to a few seconds
< rcurtin>
so if I'm doing something like looping over points in a dataset and predicting the label of a single point, it takes forever
< rcurtin>
for some models which are very small, like the regression models (LARS/linear regression/etc.), the serialization overhead would be virtually zero
< zoq>
ah yeah, perhaps a RAMDisk is the solution :)
< rcurtin>
yeah, especially with RAM being so cheap now
< rcurtin>
I've considered mounting my home desktop's / on a ramdisk... there's more than enough RAM to do it