Newsletter archives
Newsletter Signup: Text HTML 
 

How to Compare Data Sets - ANOVA

 
 
By Daniel Sloan

In 1920, Sir Ronald A. Fisher invented a statistical way to compare data sets. Fisher called his method the Analysis of Variance, which was later dubbed an ANOVA. This method eventually evolved into Six Sigma data set comparisons. An ANOVA is a guide for determining whether or not an event was most likely due to the random chance of natural variation. Or, conversely, the same method provides guidance in saying with a 95% level of confidence that a certain factor (X) or factors (X, Y, and/or Z) were the more likely reason for the event.

The F ratio is the probability information produced by an ANOVA. It was named for Fisher. The orthogonal array and the Results Project, DMAIC designed experiment's cube were also his inventions.

An ANOVA can be, and ought to be, used to evaluate differences between data sets. It can be used with any number of data sets, recorded from any process. The data sets need not be equal in size. Data sets suitable for an ANOVA can be as small as three or four numbers, to infinitely large sets of numbers.

How to Complete an Excel ANOVA
The difficulty of calculating ANOVAs by hand prevented most people from using this Six Sigma tool until the 1990's. Now, using software like Microsoft Excel, anyone and everyone can quickly determine whether differences in a set of counts or measurements were most likely due to chance variation. Or, can we say it should be more likely attributed to a "combination of factors." These variables are often labeled factor X, Y, or Z.

Here is how you could use an Excel ANOVA to determine who is a better bowler. You could and can use an ANOVA to compare any scores. Lengths of stay, days in AR, the number of phone calls, readmission rates, stock prices and any other measure are all fair game for an ANOVA. Below are six game scores for three bowlers. Which bowler is best? If there is a best bowler, is the difference between bowlers statistically significant?

Anova Compare Data Sets - Data

Step 1. Recreate the columns using Excel. Each bowler's name is the field title.

Step 2. Go to Tools and select Data Analysis as shown. If Data Analysis does not appear as the last choice on the list in your computer, you must click Add-Ins and click the Analysis ToolPak options.

Compare Data Sets - ANOVA Excel


Step 3. Click OK to the first choice, ANOVA: Single Factor.

ANOVA - Compare Data Sets


Step 4. Click and drag your mouse from Pat's name to the last score in Sheri's column. This automatically completes the Input Range for you:$F$1:$H$7. Click the box labeled "Labels in First Row." Click Output Range. Then either type in an empty cell location, or mouse click an empty cell, $I$8, as illustrated by the dotted cell below. Click OK.

ANOVA - Compare Data Sets


Step 5. Interpret the probability results by evaluating the F ratio. If the F ratio is larger than the F critical value, F crit, there is a statistically significant difference. If it is smaller than the F crit value, the score differences are best explained by chance.

Compare Data Sets - ANOVA


The F ratio 12.57 is larger than the F crit value 3.68. Mark is a better bowler. The difference between him and the other two bowlers is statistically significant. Excel automatically calculated the average, the variance - which is the standard deviation, s, squared - and the essential probability information instantly. You can use this technique to compare physicians, nurses, hospital lengths of stay, revenue, expense, supply cost, days in accounts receivable, or any other factor of interest.




The Graphic ANOVA
Excel takes care of the first three Six Sigma rules for completing an analysis. Unfortunately, it does not create a meaningful, analytic graph. As mentioned earlier, most Excel graphs are descriptive, rather than analytic.

As you advance in your Six Sigma learning you may want to learn to use a more advanced Six Sigma software program. One such program is called Stat-Ease Design-Expert. Stat-Ease calculates an ANOVA and graphically shows statistical differences between sets of data. It is all achieved with mouse clicks. You won't have to look at, or calculate an equation.

Graphical ANOVA - Comparing Data Sets.

Each I-Bar has a black square in its center. This square identifies the average score for each bowler. The top and bottom of each I-Bar extends two standard deviations, 2s, above and below the mean. Think of these as the Upper Control Limit (UCL) and Lower Control Limit (LCL) for each bowler's score. Each I-Bar covers 95% of an imaginary, on-its-side bell curve for each bowler.

This Six Sigma data array of fields and records would tell us a little about each observation. The more fields, the richer our understanding can be. For example, in the same amount of space the following table has twice as much data. Rich data, meaning each column/field has a crystal clear operational definition, can yield rich information. Many times we collect dozens of fields for each recorded observation. Since data collection is time consuming and expensive, design your collection plan with care before you begin.

Note the overlapping values in red circle markers. On occasion both Pat and Sheri could have bowled a better game than Mark. But, when the data are viewed using the mean value (x-bar), the standard deviation (s), the probability information provided by the F ratio, and a meaningful analytic graph, Mark is obviously a better bowler. In fact, as we mentioned before, he is a better bowler in a statistically significant way.

About The Author
Daniel Sloan has provided senior executive leadership, project management, seminar leadership, education, Six Sigma training, and consultant services to manufacturing companies, software corporations, computer network companies, health care corporations, aerospace, insurance, and governmental agencies in 38 of the United States, Australia, Uruguay, Mexico, and Brazil. Mr. Sloan has provided consultant services to a diverse group of other organizations including: The Washington State Department of Health Facilities, Services, and Licensing Division, City University, the University of Pittsburgh, and the University of Washington Business School.

Sloan Consulting has a 12-year track record of regional, national, and international consulting in productivity and Six Sigma breakthrough improvements. The firm's reputation rests on a long track record of producing substantial, bottom line financial results. Mr. Sloan can be reached via email at daniel@danielsloan.com.

 

Best Selling Products

  1. Six Sigma Black Belt (DMAIC) Training Slides - 2009 Version!
    The 2009 Six Sigma Black Belt course includes over 40 more slides than the 2008 version. Contents include: 1,220 PowerPo...
  2. Certified Lean Six Sigma Green Belt Assessment Exam
    This assessment exam is useful for students interested in assessing their knowledge of Lean Six Sigma on the Green Belt ...
  3. Certified Lean Six Sigma Black Belt Assessment Exam
    Interested in assessing your knowledge of Lean Six Sigma? Preparing for certifications? Testing your students and traine...
  4. Six Sigma Green Belt Training Slides - 2009 Version
    The 2009 Six Sigma Green Belt course is comprised of: 1122 slides (over 70 more slides than the 2008 version)Instructor...
  5. Certified Lean Six Sigma Black Belt E-book
    In 670 pages learn everything within the Lean Six Sigma DMAIC body of knowledge to successfully achieve Black Belt certi...
  6. The Quality Group's Lean Six Sigma Black Belt E-learning
    This D-M-A-I-C structured blended learning course combines interactive e-learning instruction with virtual online cla...
  7. Design For Six Sigma (DFSS) E-Book or Print
    Need an "encyclopedia" consisting of many of the tools you’ll study? Need a helpful refresher to apply the DFSS process?...

Premium Sponsor:

Sponsors:















Sponsor iSixSigma Military



 

About iSixSigma Military

The purpose of this iSixSigma Military channel is to document the transformation of the United States Armed Services through the use of Lean Six Sigma and related process improvement methodologies.

Ronald E. Rezek, special assistant to the acting secretary of the Army, has said the goal of the Army's Lean Six Sigma deployment is to "make the business side of the Army as efficient as the war-fighting side is effective." Leaders of the other armed services echo that sentiment and transformation objective.

This portal will serve as a central community for everyone associated with the business transformation of the U.S. military. It will provide communication updates on deployments, the opportunity for military leaders at all levels to learn new skills, advance their careers and contribute to the success of their organizations.