Code covered by the BSD License  

Highlights from
Information loss of the Mahalanobis distance in high dimensions: Matlab implementation

Be the first to rate this file! 8 Downloads (last 30 days) File Size: 6.85 KB File ID: #30522

Information loss of the Mahalanobis distance in high dimensions: Matlab implementation

by Dimitrios Ververidis

 

24 Feb 2011

Information loss estimation to set a lower limit on classification rate.

| Watch this File

File Information
Description

The Mahalanobis distance between a pattern measurement vector of dimensionality D and the center of the class it belongs to is distributed as a chi^2 with D degrees of freedom, when an infinite training set is used. However, the distribution of Mahalanobis distance becomes either Fisher or Beta depending on whether cross-validation or re-substitution is used for parameter estimation in finite training sets. The total variation between chi^2 and Fisher as well as between chi^2 and Beta allows us to measure the information loss in high dimensions. The information loss is exploited then to set a lower limit for the correct classification rate achieved by the Bayes classifier that is used in subset feature selection.

Installation:
-------------

The 5 functions should be in the current path of Matlab.

Usage:
------

LowCCRLimit = LowCCRLimitInfLoss(D, CCR, NDc, CClasses, ErrorEstMethod)

% D: Dimensionality of the vector (2,3,4,5,...)
% CCR: The Correct Classification rate in [1/CClasses,1] (e.g. 0.8)
% NDc: The number of training samples per class (>D+1)
% CClasses: The number of classes in your problem (2,3,4,...)
% ErrorEstMethod: "Resub" for resubstitution
% "Cross" for cross-validation

Example:
--------
LowCCRLimitInfLoss(5, 0.75, 100, 5, 'Cross')

ans = 0.7288

References:
-----------

[1] Dimitrios Ververidis and Constantine Kotropoulos, "Information loss of the Mahalanobis distance in high dimensions: Application to feature selection,"

IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 12, pp. 2275-2281, 2009.

[2] Jeffrey, Knuth, "On the Lambert W Function", Advances in Computational Mathematics, volume 5, 1996, pp. 329-359.

Special thanks to Dr. Pascal Getreuer for implementing the lambertw2 function from Jeffrey Knuth publication.

MATLAB release MATLAB 7.5 (R2007b)
Tags for This File  
Everyone's Tags
Tags I've Applied
Add New Tags Please login to tag files.
Comments and Ratings (1)
06 Aug 2011 Mohammad

Dear Dimitrios
In fact i do not have access to your paper, is it possible to send me a copy of it so that I can cite it.
I sent you several emails but you did not reply
Kind Regards
Mohammad

Please login to add a comment or rating.
Tag Activity for this File
Tag Applied By Date/Time
pattern recognition Dimitrios Ververidis 24 Feb 2011 09:26:09
gaussian methods Dimitrios Ververidis 24 Feb 2011 09:26:09
feature selection Dimitrios Ververidis 24 Feb 2011 09:26:09
information loss Dimitrios Ververidis 24 Feb 2011 09:26:09
high dimensionality problem Dimitrios Ververidis 24 Feb 2011 09:26:09

Contact us at files@mathworks.com