#mlpack on 2022-12-20 — irc logs at libera.irclog.whitequark.org

2021-07-27 15:44 rcurtin_irc changed the topic of #mlpack to: mlpack: a scalable machine learning library (https://www.mlpack.org/) -- channel logs: https://libera.irclog.whitequark.org/mlpack -- NOTE: messages sent here might not be seen by bridged users on matrix, gitter, or slack

03:07 jjb[m] has joined #mlpack

03:08 <jjb[m]> ryan nice! I saw a few items on the “Should require only C++” that I’ll aim to tackle.

19:23 <zoq[m]> Some really good numbers.

19:24 <rcurtin[m]> I still have some minor bugs in my OpenCL XORWOW implementation, but I have it within ~4x of CUDA. I'll probably spend a couple more hours with it, but randu() performance is not the most important thing in the world so probably not much more time than that... for now 😃

19:24 <rcurtin[m]> The Philox generator you wrote will be what I use for randn() 👍️

19:25 <rcurtin[m]> * I still have some minor bugs in my OpenCL XORWOW implementation, but I have it within ~4x of the runtime CUDA. I'll probably spend a couple more hours with it, but randu() performance is not the most important thing in the world so probably not much more time than that... for now 😃

19:25 <rcurtin[m]> * I still have some minor bugs in my OpenCL XORWOW implementation, but I have it within ~4x of the runtime of CUDA. I'll probably spend a couple more hours with it, but randu() performance is not the most important thing in the world so probably not much more time than that... for now 😃

19:25 <rcurtin[m]> s/CUDA/the runtime of cuRand/, s//`/, s//`/

19:25 <zoq[m]> Looking through https://gitlab.com/conradsnicta/bandicoot-code/-/merge_requests/27/diffs now

19:25 <rcurtin[m]> It's... a lot 😃

19:27 <zoq[m]> So far the implementation is easy, so easy to review.

19:28 <rcurtin[m]> Those array operations are the easiest kernels to write and tune, they're very boilerplate. There might be some extra performance that one could squeeze out of each operation, but that's a task for another time...