In some scientific works, once the data have been gathered from a population of interest, it is often difficult to get a sense of what the data indicate when they are presented in an unorganized fashion.
Assembling the raw data into a meaningful form, such as a frequency distribution, makes the data easier to understand and interpret. It is in the context of frequency distributions that the importance of conveying in a succinct way numerical information contained in the data is encountered.
So, grouped data is data that has been organized into groups known as classes. The raw dataset can be organized by constructing a table showing the frequency distribution of the variable (whose values are given in the raw dataset). Such a frequency table is often referred to as grouped data.
Here, we developed a mcode to calculate the variance of a grouped data. One can input the returns or modified vectors n and xout containing the frequency counts and the bin locations of the hist mfunction, in a column form matrix.
Normalizes Y by (N1), where N is the sample size. This is an unbiased estimator of the variance of the population from which X is drawn. Y = VAR(X,1) normalizes by N and produces the second moment of the sample about its mean. VAR(X,0) is the same as VAR(X).
Variance calculation uses the formula,
V = Sum(F*(MC  M)^2)/D
where:
F = class frequency
MC = mark class
M = grouped mean
D = N  1 or N, whether normalizes by n1 or by n
N = sample size [=sum(F)]
Syntax: function y = gvar(x)
Inputs:
x  data matrix (Size of matrix must be nby2; absolut frequency=column 1, class mark=column 2)
n  [normalized by n1] = 0 (default), [normalized by n] = 1
Outputs:
y  variance of the values in x
