verne.freenode.net changed the topic of #mlpack to: http://www.mlpack.org/ -- We don't respond instantly... but we will respond. Give it a few minutes. Or hours. -- Channel logs: http://www.mlpack.org/irc/
marcosirc has quit [Quit: WeeChat 1.4]
nilay has joined #mlpack
nilay has quit [Quit: Page closed]
mentekid has joined #mlpack
nilay has joined #mlpack
dms has joined #mlpack
< dms>
hello guyz can you tell me how to know about various folders in mlpack-2.0.2 ?
< dms>
What they do ? And how can they be used ? Should I know about all ? Or is it not necessary to do that ?
< mentekid>
dms: do you mean inside src/mlpack?
< dms>
yeah
< mentekid>
well, I'm not sure, but I would say
< mentekid>
core/ has the main functionality, that's where stuff like loading data/ what headers are included is
< mentekid>
it also has stuff that is common to many algorithms, like distance computations, kernels, tree traversal etc
< dms>
okay ..and what about folders outside src ?
< dms>
do we need to care them ?
< mentekid>
methods/ implements the machine learning methods mlpack offers - for example in methods/pca there is code for Principal Component Analysis. In each folder there's source code for an execuatble (I think!) that ends up in build/bin/ after compilation
< mentekid>
depends what you need to do - CMake is files for the cmake program (which configures how mlpack is going to be built)
< mentekid>
doc has the documentation, so for example if you can't access mlpack.org/docs.html for some reason you can find it there
< mentekid>
but the code is in src/mlpack
< dms>
ohkay ..so tell me how can I implement any algorithm with any command?
< mentekid>
you mean a new algorithm?
< dms>
what should i type in my terminal to implement some learning algorithm like linear regression
< mentekid>
for mlpack? Or just use mlpack to implement your own stuff?
< mentekid>
ah
< dms>
use mlpack to implement my sutff
< mentekid>
There's two ways. You can use the binaries in build/bin, for example it would be build/bin/mlpack_linear_regression or something for LR
< mentekid>
(I'm not sure how the binary is named, I haven't compiled it because I don't need it)
< dms>
how to load data in terminal ?
< mentekid>
so you type bin/<binaryname> -h which will give you the help on how to use it. Each binary has ways to load training/testing data
< dms>
okk and ?
< mentekid>
give me a moment
< dms>
sure
< mentekid>
so, yeah, you look into the help output to find what switches you need (switches are the -<letter> or --<phrase-connected-by-dashes> commands)
< mentekid>
so for example you might need to specify the output file, you find what the switch is for that, let's say -o, and you type bin<binaryname> -o myoutput.csv
< mentekid>
you add as many switches and their arguments (if any) as you want, and in any order (there are some combinations that don't make sense, mlpack will produce an error to let you know)
< mentekid>
for example, I have the iris_train.csv and iris_test.csv and I want, for every point in iris_test.csv to find the 5 nearest neighbors and store their IDs in iris_5nn.csv
< marcosirc>
I have been working on the benchmarking system. I added a new "view" as you suggested.
nilay has quit [Ping timeout: 250 seconds]
< marcosirc>
I was wondering why timing is not considered as the metric. This would make the code simpler, specially for the views.
< marcosirc>
*as a metric.
< zoq>
marcosirc: hm, at the time we implemented the timing part we didn't think about adding other metrics, but now that you brought it up, it sounds like a good idea. It should be straightforward to merge the timing and metric results.