What are the standard statistical tests to see if data follows exponential or normal distributions?

What are the standard statistical tests to see if data follows exponential or normal distributions?

Answer

It seems that you’re trying to decide whether to model your data using the normal or the exponential distribution. This seems somewhat strange to me, as these distributions are very different from each other.

The normal distribution is symmetric whereas the exponential distribution is heavily skewed to the right, with no negative values. Typically a sample from the exponential distribution will contain many observations relatively close to 0 and a few obervations that deviate far to the right from 0. This difference is often easy to see graphically.

Here is an example where I’ve simulated n=100 observations from a normal distribution with mean 2 and variance 4 and an exponential distribution with mean 2 and variance 4:

Normal vs exponential: simulated data

The symmetry of the normal distribution and the skewness of the exponential can be seen using histograms, boxplots and scatterplots, as illustrated in the figure above.

Another very useful tool is a Q-Q-plot. In the example below, the points should approximately follow the line if the sample comes from a normal distribution. As you can see, this is the case for the normal data, but not for the exponential data.

Q-Q-plots for simulated data

If graphical examination for some reason isn’t enough for you, you can still use a test to determine whether your distribution is normal or exponential. Since the normal distribution is a scale and location family, you’ll want to use a test that is invariant under changes in scale and location (i.e. the result of the test should not change if you change your measurements from inches to centimetres or add +1 to all your observations).

When the null hypothesis is that the distribution is normal and the alternative hypothesis is that it is exponential, the most powerful location and scale invariant test is given by the statistic TE,N=ˉxx(1)s
where ˉx is the sample mean, x(1) is the smallest observation in the sample and s is the sample standard deviation. Normality is rejected in favour of exponentiality if TE,N is too large.

This test is actually a one-sided version of Grubbs’ test for outliers. You’ll find this implemented in most statistical software (but make sure that you use the right version – there are several alternative test statistics used for the outlier test!).

Reference for TE,N being the most powerful test: Section 4.2.4 of Testing for Normality by H.C. Thode.

Attribution
Source : Link , Question Author : smo , Answer Author : MånsT

Leave a Comment