Course Content
Measure
Collect data to establish baselines, understand current performance, and quantify the problem. For example, measuring the average turnaround time for policy renewals.
0/4
Improve
Develop and implement solutions to address root causes. For example, streamlining workflows or introducing new digital tools to reduce manual errors.
0/4
Control
Put controls in place to sustain improvements, such as regular monitoring, updated procedures, or dashboards for ongoing performance tracking.
0/4
Six Sigma DMAIC (Define, Measure, Analyze, Improve, Control)

What is Hypothesis Testing

Hypothesis Testing allows Six Sigma team to draw conclusions about the population based on statistical analysis performed on a sample. Because the conclusions are based on samples and not the entire population, there is always some risk of error.

Using the information about the sample, statistically the p-value of the data collected can be calculated. p-value is the mean to tell if the assumption or conclusion drawn on the sample is correct or not.

 

Normal Probability Distribution

Normal Distribution, also called Gaussian distribution, is probably the most important distribution related to continuous data from statistical analysis standpoint.

Since many natural process outcomes are normally distributed, Six Sigma relies on this curve and the Empirical Rule (68-95-99.7%) to assess how well a process meets specifications and whether a sample size is good enough to be used for analysis.

In a perfect normal distribution, 68.26% of all data points fall within plus or minus 1 standard deviation from the mean; 95.46% of the data points fall within plus or minus 2 standard deviation from the mean; and 99.73% of the data points fall within plus or minus 3 standard deviation from the mean.

Testing whether data is normal is critical in statistical analysis because the results of tests conducted can be invalid if the accuracy of the data is not accounted for.

To determine if the sample data used is normal, advanced statistics are required. Microsoft Excel and other programs can be used to perform the calculation before the data is to conduct required analysis.

 

How to use Excel to check if sample data is normal

1. Create a Histogram using the sample data. Using the example of Time taken for Adults to complete the 100m Swim, the following Histogram is created.

Six Sigma Histogram in Excel

 

2. The Descriptive Statistical information of the data set is created as followed

Mean155.4117647
Standard Error14.14578996
Median134
Mode#N/A
Standard Deviation58.32458618
Sample Variance3401.757353
Kurtosis0.815962627
Skewness0.990489892
Range220
Minimum80
Maximum300
Sum2642
Count17

 

3. The Expected Distribution and the Observed Distribution are calculated as followed

Bin RangeCDF to leftBin OnlyExpected NumberObserved(Exp – Obs)2Divided by Exp
700.071539440.071539441.21617047401.479070621.216170474
800.0980112380.0264717980.4500205710.302477370.672141216
900.1310348420.0330236040.56140127310.192368840.342658366
1000.1710412820.040006440.68010948210.102329940.150460986
1100.2181064680.0470651850.80010814810.039956750.049939189
1200.271875730.0537692620.91407745610.007382680.00807665
1300.3315288050.0596530751.01410227433.943789783.888946783
1400.3957969930.0642681881.092559210.008567210.007841411
1500.463036380.0672393881.14306958810.020468910.017906965
1600.5313513560.0683149751.16135457901.348744461.161354579
1700.598753320.0674019651.14583340210.021267380.018560622
1800.6633326710.0645793511.09784895901.205272341.097848959
1900.7234194960.0600868251.02147603310.000461220.000451523
2000.777710680.0542911840.92295012321.160036441.256878794
2100.8253476160.0476369360.80982790410.036165430.044658162
2200.8659379630.0405903470.69003589900.476149540.690035899
2300.899524570.0335866080.57097232800.32600940.570972328
2400.9265127710.02698820.45879940610.292898080.63840118
2500.9475721760.0210594050.3580098900.128171080.35800989
2600.963530330.0159581540.27128861500.073597510.271288615
2700.9752734510.0117431210.19963305700.039853360.199633057
2800.9836651270.0083916770.14265850200.020351450.142658502
2900.9894885490.0058234210.09899816500.009800640.098998165
3000.9934129370.0039243880.06671459910.8710216413.05593757

 

4. Using the result above, the p-Value can be calculated using the Chi-Squared formula in Excel, and it is derived as 0.207984876. And it is good since the p-Value is greater than 0.05

Total x225.95982989
Degree of Freedom21
CHISQ.DIST0.207984876