Analysis of variance (ANOVA) is a useful tool which helps the user to identify sources of variability from one or more potential sources, sometimes referred to as "treatments" or "factors." This method is widely used in industry to help identify the source of potential problems in the production process, and identify whether variation in measured output values is due to variability between various manufacturing processes, or within them. By varying the factors in a predetermined pattern and analyzing the output, one can use statistical techniques to make an accurate assessment as to the cause of variation in a manufacturing process.
This methodology represents a basic level of the field known as design of experiments (DOE), which can be a powerful tool for statistical analysis. However, DOE methodology is essentially a field unto itself, which by and large exceeds the scope of this text and the Weibull++ software, so we will concern ourselves with the ANOVA in its most basic form.
One-Way ANOVA
The one-way ANOVA is a method of analysis that requires multiple experiments or readings to be taken from a source that can take on two or more different inputs or settings. The one-way ANOVA performs a comparison of the means of a number of replications of experiments performed where a single input factor is varied at different settings or levels. The object of this comparison is to determine the proportion of the variability of the data that is due to the different treatment levels or factors as opposed to variability due to random error. The model deals with specific treatment levels and is involved with testing the null hypothesis
where
represents the level mean. Basically, rejection of the null hypothesis indicates that variation in the output is due to variation between the treatment levels, and not due to random error. If the null hypothesis is rejected, there is a difference in the output of the different levels at a significance
, and it remains to be determined between which treatment levels the actual differences lie.
Inputs
Required inputs to perform a one-way ANOVA are the number of levels being compared, a, and the number of replications at each level,
. Typically, the user is asked to provide a value for the risk factor,
, which represents the Type 1 error probability the user is willing to live with.
Once this information has been obtained, the actual results of the experiments need to be entered. Ideally, an input grid based on the values of a and
would facilitate the input of this information. The experiment result data will be denoted as,
where,
i = 1, 2, … a is the number of levels being tested.
j = 1, 2, …
is the number of replicates at each level.
The resulting input grid would appear like the following:

Note that the number of replications at each level need not be equal, or that
need not be equal to
, and so forth.
The total number of data points for a given data set will be calculated as,
It should be noted that these N data points should be collected in a random order to insure that the effects of unknown "nuisance variables" will not affect the results of the experiment. A common example of this is a warm-up effect of production machinery, in which the output will vary as the machine accumulates more usage time from initial start-up.
Data Analysis
In ANOVA analysis, the output of each experiment, or observation, is thought to consist of variations of an overall mean value. These variations can have two sources: variation due to the factor or level and variation due to random error. The model used for the data in ANOVA analysis follows the form,
where,
= the overall mean,
= the level effect, and
= the random error component.
Since the purpose of this analysis is to determine if there is a significant difference in the effects of the factors or levels, the null hypothesis can also be written as
.
The sum of the responses over a level is denoted as,
while the level mean is denoted as,
The grand sum of all responses is denoted as,
while the overall mean of the data is,
The analysis is broken down into "sums of squares" that measure the variability due to the levels and due to the errors. The general form is,
where,
= the total sum of squares,
= the sum of squares due to the levels, and
= the sum of squares due to the errors.
The equation for the total sum of squares, which is a measure of the overall variability of the data, is,
(18)

The equation for the sum of squares for the levels, which measures the variability due to the levels or factors, is,
(19)

With
and
known,
can be calculated by subtracting
from
, or
=
-
. The
term measures the variability of the data due to random error.
There are degrees of freedom terms associated with each of the sums of squares. The degrees of freedom are given by,
Mean square values are calculated by dividing the sum of square terms for the level and error by their respective degrees of freedom values. These values represent the variance of the level and error components of the data. Mean squares values for levels and errors are,
Cochran's theory [25] states that the ratio,
is F-distributed with degrees of freedom a - 1 and N - a. Therefore, if,
where,
(20)
![]()
we reject the null hypothesis and conclude that some of the variability of the data is due to differences in the treatment levels.
Output
The general format for output for this type of analysis is an ANOVA table, which contains basic information about the analysis:

ANOVA Example
An insulation manufacturer is investigating variations in the thickness of one of its products. It is thought that the speed at which the manufacturing machinery is operated has an effect on the thickness of the insulation. An experiment was run where the machine was run at five different speeds, and a number of samples were taken at each speed. It is desired to know if the difference in thicknesses is due to the manufacturing speeds, at a 10% significance level. In order to avoid biasing the data, the samples were manufactured in a random order. The results of the experimentation appear in the following table:

First, the total number of data points is calculated as,
Further calculations can be aided by the construction of the following table:

The total sum of squares is calculated by,

The sum of squares for the factors or levels is calculated by,

The sum of squares for the error is then calculated by,
![]()
The mean square values are then calculated for the level and error components,

Finally, the value of
is calculated by,

This value must now be compared to a value of
. This value is calculated by,
![]()
Since
>
, we must reject the null hypothesis and conclude that there is a difference in the level treatments at a significance of
= 10% (or more).
Want to learn more? Attend ReliaSoft's "Master the Subject, Master the Tools" training seminars.
See Also:
Additional Reliability Analysis Tools
©1996-2000. ReliaSoft Corporation. ALL RIGHTS RESERVED.