Why is the reported accuracy of the Classification learner app very low (51%), while in the scatter plots, no incorrect model predictions are reported?
5 views (last 30 days)
Show older comments
I have loaded a training dataset to the learner app. The dataset contains 22 predictors and I used the default 5 fold cross validation (other values have no influence). All classifiers score very low accuracies (51% or lower) apart from the logistic reggression model, that one scores 100%. It doesnt matter which features I chose, no real big changes occur, only less accuracy is acquired. When I check these results in the scatter plots, no incorrect model predictions are reported as shown in the image below. You can see that the dataset contains 4433 NaN's which are hidden. The confusion matrix shows the 4433 hidden observations as False positives, which is why the accuracy is so low. Apparently the training dataset contains 4433 observations which only contains NaN values. Since they were supposed to be hidden, I assumed they will not be used in the calculation of the accuracy, but apparently they are. Is that correct?
0 Comments
See Also
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!