<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
  <channel>
    <link>http://www.mathworks.com/matlabcentral/newsreader/view_thread/173870</link>
    <title>MATLAB Central Newsreader - Histogram and Normality test</title>
    <description>Feed for thread: Histogram and Normality test</description>
    <language>en-us</language>
    <copyright>&amp;copy;1994-2012 by MathWorks, Inc.</copyright>
    <webmaster>webmaster@mathworks.com</webmaster>
    <generator>MATLAB Central Newsreader</generator>
    <docs>http://blogs.law.harvard.edu/tech/rss</docs>
    <ttl>60</ttl>
    <image>
      <title>MathWorks</title>
      <url>http://www.mathworks.com/images/membrane_icon.gif</url>
    </image>
    <item>
      <pubDate>Wed, 06 Aug 2008 17:18:02 -0400</pubDate>
      <title>Histogram and Normality test</title>
      <link>http://www.mathworks.com/matlabcentral/newsreader/view_thread/173870#447666</link>
      <author>Mastaneh </author>
      <description>Dear all,&lt;br&gt;
&lt;br&gt;
I have a 2^18-length data, sampled at 48 kHz with a 16-bit &lt;br&gt;
ADC. The histogram is very close to the normal &lt;br&gt;
distribution, but the data always fails the normality &lt;br&gt;
hypothesis tests. &lt;br&gt;
When plotting the histogram with 1000 bins, there are &lt;br&gt;
various spikes in the figure. I know reducing the number of &lt;br&gt;
bins help get a smoother curve, but am I correct in &lt;br&gt;
assuming that these spikes are the reason the tests fail? I &lt;br&gt;
mean, the test needs to average the spike amplitudes to get &lt;br&gt;
the estimated distribution, so the result doesn't have the &lt;br&gt;
same moments as the original sample. &lt;br&gt;
&lt;br&gt;
Thanks for any explanation,&lt;br&gt;
Mastaneh </description>
    </item>
    <item>
      <pubDate>Wed, 06 Aug 2008 23:41:02 -0400</pubDate>
      <title>Re: Histogram and Normality test</title>
      <link>http://www.mathworks.com/matlabcentral/newsreader/view_thread/173870#447733</link>
      <author>Paul </author>
      <description>&quot;Mastaneh &quot; &amp;lt;mtorkama@iupui.edu&amp;gt; wrote in message&lt;br&gt;
&amp;lt;g7cmca$a2c$1@fred.mathworks.com&amp;gt;...&lt;br&gt;
&amp;gt; Dear all,&lt;br&gt;
&amp;gt; &lt;br&gt;
&amp;gt; I have a 2^18-length data, sampled at 48 kHz with a 16-bit &lt;br&gt;
&amp;gt; ADC. The histogram is very close to the normal &lt;br&gt;
&amp;gt; distribution, but the data always fails the normality &lt;br&gt;
&amp;gt; hypothesis tests. &lt;br&gt;
&amp;gt; When plotting the histogram with 1000 bins, there are &lt;br&gt;
&amp;gt; various spikes in the figure. I know reducing the number of &lt;br&gt;
&amp;gt; bins help get a smoother curve, but am I correct in &lt;br&gt;
&amp;gt; assuming that these spikes are the reason the tests fail? I &lt;br&gt;
&amp;gt; mean, the test needs to average the spike amplitudes to get &lt;br&gt;
&amp;gt; the estimated distribution, so the result doesn't have the &lt;br&gt;
&amp;gt; same moments as the original sample. &lt;br&gt;
&lt;br&gt;
&lt;br&gt;
I would investigate the source of the spikes in the original&lt;br&gt;
data and not in the histogram bins.  There are some scripts&lt;br&gt;
to remove outliers and this may be all you need. &lt;br&gt;
&amp;gt; &lt;br&gt;
&amp;gt; Thanks for any explanation,&lt;br&gt;
&amp;gt; Mastaneh </description>
    </item>
    <item>
      <pubDate>Tue, 05 Jul 2011 15:45:24 -0400</pubDate>
      <title>Re: Histogram and Normality test</title>
      <link>http://www.mathworks.com/matlabcentral/newsreader/view_thread/173870#844160</link>
      <author>Sajjad Taghvaee</author>
      <description>&quot;Mastaneh &quot; &amp;lt;mtorkama@iupui.edu&amp;gt; wrote in message &amp;lt;g7cmca$a2c$1@fred.mathworks.com&amp;gt;...&lt;br&gt;
&amp;gt; Dear all,&lt;br&gt;
&amp;gt; &lt;br&gt;
&amp;gt; I have a 2^18-length data, sampled at 48 kHz with a 16-bit &lt;br&gt;
&amp;gt; ADC. The histogram is very close to the normal &lt;br&gt;
&amp;gt; distribution, but the data always fails the normality &lt;br&gt;
&amp;gt; hypothesis tests. &lt;br&gt;
&amp;gt; When plotting the histogram with 1000 bins, there are &lt;br&gt;
&amp;gt; various spikes in the figure. I know reducing the number of &lt;br&gt;
&amp;gt; bins help get a smoother curve, but am I correct in &lt;br&gt;
&amp;gt; assuming that these spikes are the reason the tests fail? I &lt;br&gt;
&amp;gt; mean, the test needs to average the spike amplitudes to get &lt;br&gt;
&amp;gt; the estimated distribution, so the result doesn't have the &lt;br&gt;
&amp;gt; same moments as the original sample. &lt;br&gt;
&amp;gt; &lt;br&gt;
&amp;gt; Thanks for any explanation,&lt;br&gt;
&amp;gt; Mastaneh &lt;br&gt;
&amp;nbsp;&lt;br&gt;
Testing the normality of data through the histogram shape would not be enough. You need to run more tests. Here is some of them you can apply using Matlab Statistics Toolbox:&lt;br&gt;
&lt;br&gt;
QQ-Plot (qqplot(x))&lt;br&gt;
Skewness and Kurtosis&lt;br&gt;
Kolmogorov-Smirnov Test(kstest)&lt;br&gt;
Chi Square Test&lt;br&gt;
Lilliefors test for goodness of fit (lilitest)&lt;br&gt;
Jarque&#8211;Bera test(jbtest)&lt;br&gt;
&lt;br&gt;
Sajjad</description>
    </item>
  </channel>
</rss>

