Dear Sebastien,
I had compiled the mex-files successfully.The "sample_mvgm" and " ndellipse" were executed successfully while there were something wrong with "yael_kmeans".It was reported that "matlab has matlab has encountered an internal problem and needs to close ".
Please give me a hand.
Compiler: Microsoft Software Development Kit (SDK) 7.1
MatlabRelease: matlab 7.14.0.739 (R2012a)
3
22 Oct 2013
Hi Tsan,
I didn’t spend a lot of time trying to understand your data, but I did manage to cluster them in less than 1 second, using clusterData. I noticed that your column 2 isn’t fully filled out. I think that’s why you’re seeing the long delay when you include column 2. If you were to exclude the pairs with missing values, it would process a lot faster. (In fact, I’m not sure how I treated missing variables. Maybe as NaNs.)
Let me know if the clustering you get with
[clusters,clusterInds,clusterBounds] = clusterData(Binningbydensity(1:3216,:));
works for you. (Those are the rows without missing column-two values.)
Cheers,
Brett
20 Oct 2013
Hi Brett,
If I use your suggested method would it just group data together based on densities and not consider the relative distance of the data between each other? For example let’s just say the data ranges from 1 to 10. The observations of 1 are the same as 10. Observations in between are markedly different, would your function then just put 1 & 10 in the same bin?
For my purposes, I would just want to group bins that are adjacent of the same/similar density together.
I also included a web link for my data just to give you an idea of what kind of data I am dealing with. I provided 2 cols, each is a different random variable.
https://docs.google.com/spreadsheet/ccc?key=0Anv9v54gTjMedGtiRW5fanFRUFBOcW4xUTJ4NHFWbFE&usp=drive_web#gid=0
Another question is that for the dataset on the 2nd column it seem to run for a particularly long time, the data are just integers centering around 1 with dispersion to as far as 7, any work around for that?
Thanks.
19 Oct 2013
@tsan: Hi Tsan,
It's difficult to comment without seeing your data, but it sounds like you could just create and analyze a vector of densities. ClusterData will spit out the indices for the groupings. (You may need to tweak the sensitivity.)
Cheers,
Brett
