Six Sigma gains it's edge over other Quality Management System as it uses data driven approach for problem solving. Statistics forms an integral part of Six Sigma methodology as many of it's tools refers to statistics for logical conclusions.
We essentially have two branches in Statistics - Descriptive and Inferential.
Descriptive Statistics helps work on collecting, analyzing and presenting information as mean, standard deviation, variation, percentage, proportion etc. While Descriptive Statistics helps with description of data, it will not manifest itself with any inferences.
Inferences about data is very important for decision making and it is Inferential Statistics which helps us with the same.
To answer above question on the approach for decision making using few samples, it is Inferential Statistics that helps us analyze sample data and predict the behavior of population.
Further, Inferential statistics helps us establish the relationship between independent variables (X, Cause) and the outcome (Y, Effect) and also help identify the critical X which needs to be focused to improve the Y.
Inferential Statistics is strongly associated with Hypothesis testing. Hypothesis testing is performed on Sample and whenever we do a Hypothesis testing, we ask below questions on whatever we saw in the sample
Is It True?
Is it Common Cause?
Is it Pure Chance?
Let us see how to perform a Hypothesis testing which is key for Inferential Statistics.
Step 1. Define the Business Problem in a data driven format i.e. Y=f(X)
Step 2. Select and appropriate or apt Hypothesis Test that we need to perform on the problem. We will see this in detail in next section. What drives the selection of test is basis the type of data defining both X and Y i.e. if the data type is discrete or continuous.
Step 3. Make the Statistical Hypothesis Statement ; H0 = Null Hypothesis = No Change, No Impact or Difference; HA=Alternate Hypothesis = New argument holds good basis the business case.
Step 4. Run the test on Sample data using tools like Minitab
Step 5. Calculate the "P" value - which will be an output from the tool
Step 6. Compare "P" value with "alpha" [Alpha is called as Type I error and acceptable level is generally kept at 5% or 0.05]
Step 7. Do Statistical conclusion i.e. if P is greater than alpha, your Null Hypothesis holds good else your alternate hypothesis will hold good.
Step 8. Do a Business Inference i.e. if Null Hypothesis holds good than the input sample is treated as non-critical x. Alternatively, if your alternate hypothesis holds good, we should treat the input as critical x.
W.r.t Step 2, on selecting the apt test, below inputs should serve as guiding pointers
Output Y is Discrete and Input X is Discrete in 2 categories, we need to use 2 proportion test
Output Y is Discrete and Input X is Discrete in multiple categories, we need to use Chi-square test
Output Y is Continuous and Input X is Discrete in 2 categories, we need to use 2-sample t-test
Output Y is Continuous and Input X is Discrete in more than 2 categories, we need to use ANOVA
Output Y is Continuous and Input X is Continuous we need to use Regression Analysis.
In summary, Inferential Statistics is used draw conclusions on the larger population by taking a sample from the same and also try to establish relationship between the input and output.