Skip to content
View in the app

A better way to browse. Learn more.

Benchmark Six Sigma Forum

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.
Message added by Mayank Gupta,

Distribution Identification is the activity of identifying the underlying distribution of a data set by generating probability plots and goodness-of-fits test values. It can also be used to determine if the transformation is effective to fit the data to a normal distribution.

 

An application-oriented question on the topic along with responses can be seen below. The best answer was provided by Kirpa Shanker Tiwari on 24th Mar 2023.

 

Congratulations to the winner!!

Distribution Identification

Featured Replies

Q 549. The statistical analysis tools we commonly use often assume data conforms to a Normal Distribution. Nevertheless, it is widely acknowledged that data might not adhere to a Normal Distribution. In these situations, it becomes essential to determine the distribution of the data before proceeding with any analysis. What methods can we use to identify the distribution of the data? Please explain one method with the help of an example.

 

Note for website visitors -

Solved by kirpa Shanker Tiwari

  • Solution

There are multiple method to check data distribution

1.       Histogram

Histogram shows data visualization of a variable using bars. It present frequency per value of dataset. If bell shape curve is there we can say data follows normal distribution.

 

In above figure first histogram looks normally distributed and second one looks uniformly distributed.

 

2.       Box plot

Box Plot is technique can be used to identify data type from samples. The Box Plot plots five variable: min, max, median, 1st & 3rd Quantiles. if at both sides of rectangle width is almost equal we can say data is normally distributed

In the figure first plot has outliers and belongs to non –normal data with outliers and second (bottom) graph can be considered normally distributed.

 

3.       QQ plot

QQ plots need more understanding, QQ means Quantile vs quantile, and here we are plotting theoretical vs actual quantiles for variables, this graph is better than box plot to visualize deviation from normal distribution. Uniform distribution has many observation in both extreme however normal plot follow a straight line.

 

4.       Kolmogorov Smirnov test

Kolmogorov Smirnov test calculate the difference between the empirical distribution and the theoretical distribution and defines the test statistic based on those differences. This test can evaluate all type of distribution.

 

5.       Lilliefors test

This test is based on the above test. Here it is accepted that the mean and variance of the population distribution are estimated rather than pre-specified by the user. if p value is > 0.05 we assume normal distribution else non normal.

 

6.       Shapiro Wilk test

This test is the wonderful test and highly used when testing for a normal distribution. It has been developed specifically for the normal distribution and it cannot be used for testing against other distributions. if p value is > 0.05 we assume normal distribution else non normal

7.       Anderson darling Method: This test also use widely along with Shapiro Wilk test and similarly used for detecting normal distribution. if p value is > 0.05 we assume normal distribution else non normal

 

Kirpa Shanker Tiwari has posted the best answer to the question. While these tests and graphs will help us identify the underlying distribution, there are other methods like goodness of fit tests that can also be used. Most of the statistical packages also have in built functionality for distribution identification.

Create an account or sign in to comment

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.