Code covered by the BSD License  

Highlights from
CVAP: Cluster Validity Analysis Platform (cluster analysis and validation tool)

4.36364

4.4 | 11 ratings Rate this file 89 Downloads (last 30 days) File Size: 80.5 KB File ID: #14620

CVAP: Cluster Validity Analysis Platform (cluster analysis and validation tool)

by

 

12 Apr 2007 (Updated )

supplying over 17 validity indices and 5 clustering algorithms based on GUI

| Watch this File

File Information
Description

Cluster validation is an important and necessary step in cluster analysis. This visual cluster validation tool CVAP based on GUI provides important tools and convenient analysis environment for validity evaluation of clustering solutions, estimation of the number of clusters and performance comparison between candidate clustering algorithms.

CVAP includes 4 External validity indices, 14 Internal validity indices and 5 clustering algorithms (K-means, PAM, hierarchical clustering and etc.). It supports other clustering algorithms via loading a solution file with class labels, or by adding new codes. And similarity metrics of Euclidean distance and Pearson correlation coefficient are supported.

Some useful information and references are in help files "Readme.txt", please read them first.

Required Products Statistics Toolbox
MATLAB release MATLAB 7.2 (R2006a)
Tags for This File   Please login to tag files.
Please login to add a comment or rating.
Comments and Ratings (21)
16 May 2014 arturo Garcia

Hola estoy haciendo un trabajo sobre clustering, y necesito validar el grupo de datos.
gracias

18 Dec 2013 NR P

Please review valid_internal_deviation.m: I believe it has errors in the correction factors for Calinski-Harabasz and Krzanowski-Lai indices, and Davies-Bouldin index estimante apper to derive for incomplete pairwise estimates.

18 Dec 2013 NR P  
03 Sep 2013 Chenghao

When applying kmeans algorithm with euclidean distance function, kmeans algorithm in valid_clusteringAlgs.m (at line 29 of CVAP 3.7 version) uses the distance function R = 'sqEuclidean';. However, when evaluating with silhouette measure in valid_internal.m(at line of 28), the code uses R = 'euclidean';. I think the same distance function should be used.

11 Jun 2013 Nejc Ilc

Many thanks to the author for his valuable and useful contribution. However, I think there is a bug in the implementation of the Dunn's index.

I have compared results from CVAP with the Julian Ramos' implementation (http://www.mathworks.com/matlabcentral/fileexchange/27859) and also with R package clValid. Output values from clValid and Ramos' code are identical, whereas the CVAP results are not in an agreement with them. I think there are errors in computing the diameter of clusters and the shortest distance between clusters (function 'valid_sumsqures'). Please, consider revising this part of code.

25 Feb 2013 Mitchell Yuwono  
20 Aug 2012 Lucas

As this was made for an older version of matlab, the pamc.dll is no longer valid.
To solve errors with the pamc function, download the latest mex files from here:

http://wis.kuleuven.be/stat/robust/LIBRA

32-bit win => pamc.mexw32
64-bit win => pamc.mexw64

Alternatively, 32-bit windows users can also try renaming the pamc.dll to pamc.mexw32 although this has not been tested.

20 Aug 2012 Lucas

The link below should be :

http://wis.kuleuven.be/stat/robust/LIBRA

26 Jan 2012 LI

I find there is a problem in the CVAP, when I used the pam.m, it must call the pamc.m, but the pamc.m is not exist in this CVAP

02 Jun 2011 Joseph

What is the best place to add a validity indices? I saw the place for a new clustering algorithm, but don't see anything similar for indices.

02 May 2011 can

I think this tool is good.

11 Apr 2011 Karin

I am having trouble uploading a data file. Mine is a .txt, but I still get loads of errors. Why?

??? Attempted to access data(:,1); index out of bounds because size(data)=[0,0].
Error in ==> valid_data_load at 18
handles.truelabels = data(:,1);

02 Nov 2010 zhen gao

How to use it. When I type "mainCVAP"
It shows "Undefined function or variable 'mainCVAP'". Why?

02 Oct 2010 agnes cawas

I'm looking for method to validate the result cluster with k-means and I exactly refuse to know about validate k-means clustering.do you have any idea?
then i read this article and I want to know how to calculate the error rate of the result for k-means clustering...
is this use SSE(sum square error)?
please replies me...

18 Mar 2010 Cara Tobin

Hello, this model is very user-friendly with the demo files. However, if I have 8000 parameter sets (each with 3 parameters), how does one prepare the input file? I had a matrix of 8000 rows with 3 columns (no classes) and this didnt seem to work. Any suggestions? THANK YOU!! Cara

20 Mar 2009 Gilles Criton

Very useful, I have looked for such a tool for a long time. Do you have developed a validity indice for subspace clustering ?

Thank you very much, great job !

07 Oct 2008 Dennis Lorch

Very good tool. Does anyone know how to include alternative distances in this GUI?

02 Jun 2008 Guillermo Quintas

I found this job really good and very usefull. I have include the option of using matrices from the workspace. I thonk that it really improve the usefulness of the GUI.

24 Jan 2008 Leon Kegel

Very useful toolbox.
You have done realy a great job on CVAP!

30 Apr 2007 yurniar rica

i need matlab code hierarchical clustering for image...

29 Apr 2007 ling zhao

I think your toolbox is just fine. It will be used for me.

Updates
23 Apr 2007

new release (version 3.4)

23 Apr 2007

new release (version 3.42)

28 Jul 2008

new release CVAP3.5

01 Jul 2009

new release CVAP3.6

09 Jul 2009

new release (version 3.6)

25 Jul 2009

new release (version 3.7)

Contact us