Defining the 95% of data which are around the mean value

Question

Giorgos Papakonstantinou on 31 Jul 2013

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/83647-defining-the-95-of-data-which-are-around-the-mean-value

For a given set of data, how can I define which of those correspond to the 95% of the data which are around the mean value?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Jan on 1 Aug 2013

0
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/83647-defining-the-95-of-data-which-are-around-the-mean-value#answer_93314

Edited: Jan on 1 Aug 2013

Open in MATLAB Online

x = rand(1, 1000) - 0.5;
m = mean(x);
dist = abs(x - m);
[sortDist, sortIndex] = sort(dist);
index_95perc = sortIndex(1:floor(0.95 * numel(x)));
x_95percent = x(index_95perc);

1 Comment
Show -1 older commentsHide -1 older comments

Giorgos Papakonstantinou on 1 Aug 2013

Open in MATLAB Online

Thank you Jan. It was easier than I expected. Before your answer I was doing the folllowing:

vals=abs(slope);
[CdfY,CdfX] = ecdf(vals,'Function','cdf');  % compute empirical function
cr=CdfY<0.95;

where vals is my dataset.

Sign in to comment.

Answer 2

Image Analyst on 31 Jul 2013

0
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/83647-defining-the-95-of-data-which-are-around-the-mean-value#answer_93230

I'd sort the data using sort(). Then use cumsum() to get the cdf. Normalize the CDF then go from the 2.5% element to the 97.5% element using find() to find the elements (values) where the data starts and stops. It's pretty easy, but let me know if you can't figure it out.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Answer 3

Giorgos Papakonstantinou on 31 Jul 2013

0
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/83647-defining-the-95-of-data-which-are-around-the-mean-value#answer_93253

Thank you for your answer Image Analyst. The data contain also negative values. I am not sure but I think that poses a problem when I normalize the data after the cumsum.

1 Comment
Show -1 older commentsHide -1 older comments

Tom Lane on 1 Aug 2013

It sounds like Image Analyst is talking about the cumsum of a vector that assigns probability 1/N to each of N points. However, you could take the 0.025*N and 0.975*N values from the sorted vector directly, converting the index to an integer as you see fit.

Sign in to comment.

Defining the 95% of data which are around the mean value

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment
Show -1 older commentsHide -1 older comments

More Answers (2)

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Community Treasure Hunt

Defining the 95% of data which are around the mean value

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment Show -1 older commentsHide -1 older comments

More Answers (2)

0 Comments Show -2 older commentsHide -2 older comments

1 Comment Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments