ChanServ changed the topic of #mlpack to: "mlpack: a fast, flexible machine learning library :: We don't always respond instantly, but we will respond; please be patient :: Logs at http://www.mlpack.org/irc/
< yashwants19[m]>
I guess scheduled cron workflow is working fine.
ayush has joined #mlpack
< RishabhGarg108Gi>
Hii, I am new to C++ design patterns. Can someone clarify that the "visitors" in `methods/ann/visitor` corresponds to "visitor design patterrn" or is it something else ?
< rcurtin>
RishabhGarg108Gi: yep, that's correct; we are using the boost::visitor implementation
< rcurtin>
taqd: glad you got it worked out! thanks for opening the PR even though that turned out to not be a bug
< rcurtin>
could we improve our documentation or error messages to make it easier for you to have found out that the labels needed to be transposed?
< RishabhGarg108Gi>
Okay. Thanks @rcurtin !
ayush has quit [Ping timeout: 264 seconds]
< rcurtin>
so inside of my company I made bindings in our database product to mlpack, and I just got some really nice feedback yesterday in our public "#appreciation" channel:
< rcurtin>
> Big appreciation to @Ryan Curtin for a very comprehensive set of ML algorithms and a very easy interface to use in Delve through his bindings to mlpack
< rcurtin>
but really that appreciation should go to everyone here who makes all those algorithms possible :)
< zoq>
Ohh that is great to hear.
< rcurtin>
at the current moment, mlpack has a lot of exposure inside the company since it is the only machine learning toolkit accessible from our database language
< jeffin143[m]>
rcurtin: not sure if I have asked you aldeardy ?
< jeffin143[m]>
So you are making a db which has some intelligence ??
< rcurtin>
jeffin143[m]: sort of... the quick "pitch" is this... most data scientists currently extract their data from the database into CSVs or something, then load it into their machine learning environment (could be Python, C++, R, whatever) and do their machine learning
< rcurtin>
but many times, just the cost of getting the data out of the database could be a lot; it could be a complex join of some sort that is really expensive
< rcurtin>
so there are some cases in which, it turns out, you could learn the machine learning algorithm directly on the underlying database tables *without* performing that expensive join, and this can give orders-of-magnitude speedups
< jeffin143[m]>
Woohhhh that was nice
< rcurtin>
so the idea is that we will build a database that does this, and we chose to make a new query language (not SQL, more like Datalog if you have ever heard of it---I hadn't :))
< rcurtin>
the hope is that in this query language, it's very easy to express machine learning algorithms
< rcurtin>
but for now, we only have mlpack bindings :)
< abernauer[m]>
rcurtin: Is it going to be a DSL? I like the idea though seems like a great way to avoid that common workflow setback.
< rcurtin>
abernauer[m]: not a DSL, actually a completely new query language
ayush has joined #mlpack
< abernauer[m]>
Well sounds awesome and congrats on the appreciative feedback. Sounds like it addresses a real need for data scientists that's not currently being addressed.
< rcurtin>
let's hope :) it's a very hard product to build since it's so interdisciplinary
< rcurtin>
so I have to work with lots of database and compiler people, and we all speak completely different languages
< rcurtin>
not only that but even our implementation language is foreign to many of us... we are writing the product entirely in Julia, but very few of us are "Julia native"
< abernauer[m]>
<rcurtin "not only that but even our imple"> Yeah that sounds like one of the more difficult challenges. Personally, whenever I learn or work in a language try to write code that's idiomatic to that language.
< rcurtin>
yeah, so interestingly, we get to define what "idiomatic" means, and that is one of the big discussion points in the company
< rcurtin>
if someone writes a query, what is the "right" way to have written that query that we want to encourage people to use?
< abernauer[m]>
Yeah I would think that get's complicated or opinionated as you're all coming from different backgrounds as mentioned. Sounds like a fun challenge though :')
< rcurtin>
opinionated for sure :-D
< jeffin143[m]>
> so the idea is that we will build a database that does this, and we chose to make a new query language (not SQL, more like Datalog if you have ever heard of it---I hadn't :))
< jeffin143[m]>
@rcurtin:matrix.org since I some what work in cloud space so I have seen datadog and have worked with it
< rcurtin>
oh no I mean 'datalog' not 'datadog' :)
< AnushKiniGitter[>
Hey guys, is there any way to restart the pipeline checks on a PR without making a commit?
< rcurtin>
I think you can close and open the PR and that will re-trigger them
< AnushKiniGitter[>
Thanks! Will try it out.
ImQ009 has quit [Quit: Leaving]
qur70 has quit [Ping timeout: 240 seconds]
taqd has joined #mlpack
< taqd>
Hi again, in mlpack_perceptron (and elsewhere) the code references "labels(i,j)", "predictedLabels(i,j)" and "instanceWeights(i,j)" and for the life of me I can't find where they are defined. Any suggestions?