Skip to content
View in the app

A better way to browse. Learn more.

Benchmark Six Sigma Forum

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.
Message added by Mayank Gupta,

Standard Deviation is another measure of the spread of data. It is derived from the distance of each point in the sample from the sample mean.

 

Overall Standard Deviation is calculated by considering all the data points (or population data). It is expected that both common cause and special cause variations will be present in overall standard deviation.

 

Within Standard Deviation is calculated by considering the rational sub-groups (or sample data). It is expected that only common cause variation will be present in within standard deviation.

 

An application-oriented question on the topic along with responses can be seen below. There is no best answer to this question, however do review the comments mentioned by Mr Mayank Gupta, Principal Consultant at Benchmark Six Sigma.

 

Applause for all the respondents - Ashutosh Bhardwaj, Keerthi Vasan, D. Nandakumar, Arvind Swarup.

Featured Replies

Q 604What is the difference between overall standard deviation and within standard deviation? Could there be a scenario where within standard deviation is greater than the overall standard deviation? Provide examples to support your answer.

 

Note for website visitors -

Standard deviation represents the typical (average) distance from the central location we expect to observe. Its units are exactly the same as our original measurement.  Smaller the standard deviation, the better it is. Type of standard deviations are:

 

1.       Overall Standard deviation

Standard deviation is computed for all data points of long period like entire month. In this case, it represents actual variation of the process that the customer experiences over the time.

 

2.       Within Standard Deviation

It is an estimate of variation within the subgroup. This represents inherent variation of the process over a short period of time. This shows potential variation of the process if shifts and drifts between subgroups were eliminated.

 

“Below reference picture will articulate more visualization in terms of short-term samples and long-term samples."

image.png.550c4f35351292feb46698738c47317c.png

 

 

Could there be a scenario where within standard deviation is greater than the overall standard deviation? 

Practically, Overall standard deviation should always be greater than or equal to the calculated within-standard deviation because standard deviation (overall) is affected by various causes for examples like “Equipment breakdown”, “Environment Effect”, “Unskilled operator deployment on machines’, “Difference between raw materials” and other’s special causes in long period of data. On the other side, standard deviation (within) is purely random.it is what we use to compare the inherent capability of different processes to meet a specified goal. 

Below graphical picture itself portrays the cause of higher overall standard deviation than within standard deviation with time extent.

 

image.png.1f0911b94d47c2a48e41bd1a2b3d79ff.png

 

Overall standard deviation is the standard deviation of all measurements and it is an estimate of overall process variation while within standard deviation is an estimate of variation within all sub groups. 

 

image.png.7f2dffe23edf400ef7ab4b1680c366de.png

 

There can be cases where within standard deviation is higher than overall standard deviation. It indicates greater  variation within sub groups and that the process is not stable. It can also mean that our process can have other sources of variations in addition to variation with sub groups.

 

Sl. No

Overall standard deviation

Within Standard deviation

1

Considers all the readings from the measurements

 

It’s an estimate of the variation within the Subgroups

2

Also known as Inherent process variation

 

Also known as within-subgroup variation

3

Captures all sources of systematic variation

Captures nature and inherent variation of the process over a shorter period

4

Represents actual variation of the process

 

Represents the potential variation of the process

4

Represents using indices Pp and Ppk

 

Represents using Cp and Cpk

5

Standard Deviation Calculation-Population

 

image.png

 

Standard deviation calculation - Sample

 

image.png

 

 

Within Standard deviation will be higher, if the process capability is not stable i.e., lesser than one and the sample collected as subgroups are out of spec. But the overall variation in the process is somewhat stable and process needs to be centered.

 

image.png

image.png

The Standard Deviation (overall) would take all data points of the entire month and calculate one number as an output. It is calculated by considering all data points in the dataset, without regard to any specific subgroups or categories. The Standard Deviation (within) will calculate the standard deviation of each subgroup (the samples collected on the same day) and compare them with each other. Also, "Within" standard deviation is the standard deviation calculated without including 'special cause' variation in the calculation. This provides a significantly lower value in this case.

Now, let's consider whether there could be a scenario where within standard deviation is greater than the overall standard deviation. Yes, this is possible, and it typically occurs when there is significant variation between the subgroups or categories in the data.
Example: Let's say you are monitoring the daily production of a manufacturing plant over a month, and you collect data on the number of defective products produced each day. You have two production lines (Line A and Line in your plant.

•    Overall Standard Deviation: If you calculate the overall standard deviation for the entire month's data (combining data from both lines), it will give you a measure of the total variability in the production process.
•    Within Standard Deviation: However, if you calculate the within standard deviation separately for Line A and Line B, you might find that Line B has a much higher within standard deviation than Line A. This means that Line B has more variability or inconsistency in its daily production compared to Line A.

In this scenario, the within standard deviation for Line B is greater than the overall standard deviation for the entire plant because Line B's production process is less stable and exhibits more variation within its daily output.

To be honest, I am a little disappointed that there are no winners for this question. This is such a thought provoking question. While the published answers are correct, there is no winner to this question. 

 

Some scenarios where overall standard deviation can be less than within standard deviation

 

1. The rational sub-grouping has not been done correctly

2. Working with the old concept of long term vs short term and hence inducing a difference in the time period for data collection

3. Data does not follow normal distribution

Create an account or sign in to comment

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.