Short script that calculates root mean square error from data vector or matrix and the corresponding estimates.
Checks for NaNs in data and estimates and deletes them and then simply does:
r = sqrt( sum( (data(:)-estimate(:)).^2) / numel(data) );
yes, the two sums could be avoided by simply writing
r=sqrt(sum((data(:)-estimate(:)).^2)/numel(data))
The computation time is about the same but readability might be enhanced by using the colon operator.
Best regards,
Wolfgang
10 Oct 2008
Felix Hebeler
@Gary: no, you need two sums if you process matrices, the first sums across all columns, the second then sums across the resulting vector. If you process vectors, the second sum calculates the sum of a scalar. Faster than checking for dimensions first.
09 Oct 2008
Gary Merkoske
you have one too many SUM() in the eqn, although it appears to be harmless. Am I correct? RMS Error is then;
r=sqrt(sum((data-estimate).^2)/numel(data))
11 Sep 2008
Felix Hebeler
Thanks for the feedback Wolfgang, I completely forgot that nansum needs the statistical toolbox, and of course you are right that it becomes incorrect with nans. I should have divided by numel(~isnan(data)), but deleting all NaNs in this case _is_ better! Your version actually would extract all NaNs and discard the values, so I used
I = ~isnan(data) & ~isnan(estimate); instead, which works a treat!
Durga, it's great you advertise your script on my page ;-) I see no point in input argument checking for this oneliner though - in my case I would have to reshape my matrices to use your script, not sure if that is better...
Anyway, once your script takes care of NaNs as suggested by Wolfgang, it is surely great as it calculates more than one goodness of fit.
10 Sep 2008
Wolfgang Schwanghart
Hi Felix,
the formula becomes incorrect as soon as you have nans in your arrays. You should remove nans first in both arrays
I = isnan(data) | isnan(estimate);
data = data(I);
estimate = estimate(I);
and then apply the formula. That even allows you to use sum instead of nansum, thereby avoiding dependence on the statistical toolbox.
09 Sep 2008
Durga Shrestha
This code is without input argument checking.
To compute more types of goodness of fit (including RMSE, coefficient of determination, mean absolute relative error etc.) please have a look
http://www.mathworks.com/matlabcentral/fileexchange/loadFile.do?objectId=7968&objectType=file
Updates
11 Sep 2008
include NaN checking
11 Sep 2008
- delete NaNs and use sum instead of nansum, eliminating the need for the statistical toolbox
13 Oct 2008
By popular demand: using sum(data(:)) instead of sum(sum(data)). Thanks!
27 Nov 2008
Updated description and code for better readability and