#mlpack on 2023-05-12 — irc logs at libera.irclog.whitequark.org

2021-07-27 15:44 rcurtin_irc changed the topic of #mlpack to: mlpack: a scalable machine learning library (https://www.mlpack.org/) -- channel logs: https://libera.irclog.whitequark.org/mlpack -- NOTE: messages sent here might not be seen by bridged users on matrix, gitter, or slack

05:22 robobub has quit [Ping timeout: 240 seconds]

05:23 robobub has joined #mlpack

09:54 heisenbuugGopiMT has joined #mlpack

11:58 <akhunti1[m]> Hi Team,

11:58 <akhunti1[m]> I have created a Random Forest model using MLpack cli. When I used this model for inference using MLpack cli, I got a prediction score of 0.256234749665911. However, when I loaded this model using the Mlpack C++ load function and performed inference, I got a prediction score of 0.5053640966677341. Could you please let me know why there is a difference in the prediction score?

11:58 <akhunti1[m]> I also tried with other computer i got the same out put mentioned above .

12:10 <akhunti1[m]> I just wanted to inform you that I have not observed this type of difference for all the records. Out of the 6,000 records I tested, I found only one record that had this type of difference in probability.

12:12 <akhunti1[m]> Could you please guide me on any possible solutions or workarounds for this issue?

12:30 <akhunti1[m]> * I just wanted to inform you that I have not observed this type of difference for all the records. Out of the 6,000 records I tested, I found only one hundred records that had this type of difference in probability.

12:31 <akhunti1[m]> s/only/100/, s/one/records/, s/record//

12:33 <rcurtin[m]> the first difference, the very small one, didn't concern me much---that's probably a tiny floating point difference or something. this second one seems a bit more serious but I have a feeling it is a similar cause. can you tell us how you are loading the data and model in each case? and what format the data and model are saved in?

12:36 <akhunti1[m]> This is for cli :... (full message at <https://libera.ems.host/_matrix/media/v3/download/libera.chat/ba1c90401f3c5026947432a27d749b376ff6b003>)

12:38 <akhunti1[m]> And this command is for c++ :

12:38 <akhunti1[m]> curl --http0.9 -X POST -H 'Content-Type: application/json' -d'{"data": { "ndarray": [4,5,0,0,0,0,0,0,0,0,0]}}' http://localhost:9000/api/v1.0/predictions

12:40 <rcurtin[m]> okay, I see... can you show what one of the points that has the difference in probability is, both when calling from C++ and in test.np.without_label.csv?

12:51 <akhunti1[m]> Hi rcurtin this is the data :... (full message at <https://libera.ems.host/_matrix/media/v3/download/libera.chat/85e038fd1ca92b85f732e67a146263b7db3a08c0>)

12:51 <akhunti1[m]> which having different probability

12:52 <akhunti1[m]> Did i answer your question correctly ?

12:54 <akhunti1[m]> yes both in c++ and test.np.without_label.csv file

12:56 <rcurtin[m]> okay, thanks... those points are simple enough that there should not be any floating point errors involved

12:56 <rcurtin[m]> are you sure that your JSON parsing code is correct? I think that you should carefully check that the point you are reading in your C++ model is exactly the same as what you just pasted above