View License

Download apps, toolboxes, and other File Exchange content using Add-On Explorer in MATLAB.

» Watch video

Highlights from
A road to classification in high dimensional space: the regularized optimal affine discriminant

Join the 15-year community celebration.

Play games and win prizes!

» Learn more

5.0 | 1 rating Rate this file 5 Downloads (last 30 days) File Size: 27.5 KB File ID: #40047 Version: 1.0

A road to classification in high dimensional space: the regularized optimal affine discriminant



A powerful method for binary classification in high dimensional space

| Watch this File

File Information

For high-dimensional classification, it is well known that
naively performing the Fisher discriminant rule leads to poor results
due to diverging spectra and noise accumulation.
Therefore, researchers proposed independence rules to circumvent
the diverging spectra, and sparse independence rules to mitigate the issue of noise
accumulation. However, in biological applications, there are often a group of correlated genes
responsible for clinical outcomes, and the use of the covariance information
can significantly reduce misclassification rates. In theory the extent of such error rate reductions is unveiled by comparing the misclassification rates of
the Fisher discriminant rule and the independence rule.
To materialize the gain based on finite samples,
a Regularized Optimal Affine Discriminant (ROAD) is proposed. ROAD
selects an increasing number of features as the regularization relaxes.
Further benefits can be achieved when a screening method
is employed to narrow the feature pool before hitting the ROAD.
An efficient Constrained Coordinate Descent algorithm (CCD)
is also developed to solve the associated optimization problems. Sampling properties of oracle type are established.
Simulation studies and real data analysis
support our theoretical results and demonstrate the advantages
of the new classification procedure under a variety of correlation structures. A delicate result on continuous piecewise linear solution path for the ROAD optimization problem at the population level justifies the linear interpolation of the CCD algorithm.

Paper available at

Required Products Bioinformatics Toolbox
MATLAB release MATLAB 7.14 (R2012a)
Tags for This File   Please login to tag files.
Please login to add a comment or rating.
Comments and Ratings (1)
14 Sep 2014 Junxiang Wang  

Contact us