Skip to content
View in the app

A better way to browse. Learn more.

Benchmark Six Sigma Forum

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.
Message added by Mayank Gupta,

Normal Distribution - also known as the Gaussian distribution, is a continuous probability distribution that is symmetric about the mean, showing that data near the mean are more frequent in occurrence than data far from the mean. Normal distribution is graphically represented by a bell curve. It is also sometimes called as 'Natural Distribution' as it is found to be naturally occurring in many situations.

 

Logistic Distribution has a shape that is similar to that of the normal distribution however; the logistic distribution has longer tails. It is used as a growth curve to model binary responses and is commonly used in the fields of biostatistics and economics.

 

Student's T Distribution is a symmetric, bell-shaped distribution that is similar to the normal distribution, but with thicker tails. It is defined by its degrees of freedom, a value closely related to sample size. Therefore, a different t-distribution exists for every sample size. As the sample size increases, the probability density present in the tails decreases and the t-distribution approaches the normal distribution.

 

An application-oriented question on the topic along with responses can be seen below. The best answer was provided by Soji Sam on 2nd Sep 2022.

 

Applause for all the respondents - Hirak Raval, Rahul Arora, Piyush Jain, Kiran Kumar Gadhamsetty, Chandra Shekhar Chauhan, Soji Sam.

Featured Replies

Q 500. Student's T, Logistic, Binomial distributions all have a shape that resembles a bell curve or a Normal distribution. What is the criteria to decide NOT to work with a Normal distribution and instead work with one of these? Provide an example for each distribution. 

 

Note for website visitors - Two questions are asked every week on this platform. One on Tuesday and the other on Friday.

Solved by Soji Sam

Student's T Distribution- 

1. This distribution will have heavier tails  and it's midpoint value is most likely

2. Criteria- Degree of freedom are equal to or greater than 30. 

3. Application for ecometric data like exchange rates

 

Logistic Distribution:

1. This distribution will have slightly longer tails than normal distribution ,

2. Criteria- It is used when conditions and parameters are complex, It is used to describe growth

3. It is used to see growth of population over period of time, Analyse some critical chemical reactions etc.

 

Binomial Distribution:

1. Criteria-This distribution is used when for each trials there are 2 possible outcomes, trials are independent with same probability(trial to trial)

2.  It applies to describe no. of times an event occurs in a fixed number of trials

3.It is used in likelihood of success or failure in 10 flips of a coin 

 

Although Student’s T, Logistic & Binomial distributions have a shape that resembles a bell curve or a Normal distribution, however these distributions have their own set of criteria which differs from a Normal distribution. Let us try to understand each of these distributions:-

 
Student’s T Distribution :
 
The T distribution is used for performing inferential statistics on the population mean in scenarios where the population standard deviation is unknown & population being normally distributed. It is a series of distributions which depends on the sample size as every sample size has a different distribution. Compared to a Normal distribution, the T distribution has flatter in the middle & have more area in the tails. Also as the sample size becomes large it approaches a Normal distribution thus making it suitable for use in case of unknown population standard deviation, regardless of the sample size as opposed to Normal distribution where we have a known population standard deviation. 
 
1654020139_ScreenShot2022-08-31at7_36_09PM.png.a382218c668f35b618b65cd7896bd9c5.png
 
A typical application of this distribution is in instances where experimental studies are undertaken & we do not have a historical standard deviation about the population. Let us say a new branch of a mortgage loan company has opened & we have a few samples in order to validate the hypothesis that the lead time for mortgage settlement is achieving a certain hypothesized target value or not.
 
Logistic Distribution :
 
The Logistic distribution has wider tails than a normal distribution thus it provides better insight into the likelihood of extreme events. One of the most common application of this distribution is the Logistic Regression study where we predict the outcome of a discrete binary dependent variable on the basis of continuous independent variable(s) & because of the discrete nature of the output variable we cannot use normal distribution.
 
1798019509_ScreenShot2022-08-31at8_47_43PM.thumb.png.075d8486afb27baebcf5be3c2f712b36.png
 
An example can be whether a person will have a heart attack basis his blood pressure & blood sugar levels, so here we will predict the probability of having a heart attack & observe whether it is above a certain probability threshold (typically it is p=0.5), then heart attack will occur & if it is below that probability threshold then heart attack will not occur.
 
Binomial Distributions :
 
As opposed to Normal distribution which models continuous data, the Binomial distribution is used to model binary data such as a coin toss or a roll of a dice where each event is a discrete event & all the events are independent of each other. A binomial distribution curve consists of estimating the likelihood of occurrence of each independent event, thus we calculate the PDF(probability density function) as opposed to CDF(cumulative density function) which is the case in Normal distribution.
 
An example can be estimating the probability of getting a flu basis a known long term probability of having a flu. Binomial distribution will show the likelihood of a specific no. of times of getting a flu.
 
1250744182_ScreenShot2022-08-31at9_16_53PM.png.7e424e14ae38b468025258251df94ee4.png
 
 
 
 

Student’s t- distribution is occasionally used over the normal distribution. numerous introductory statistics and data wisdom courses give a explanation for the use of t- distributions along the lines of it being useful in situations where either the sample size is small and/ or the population’s standard divagation is unknown. While correct, the explanation remains abstracted through such an explanation and learners may achieve a more important understanding through a clear and simple visualization.

 

 The Normal Distribution

 

 The normal distribution, also occasionally appertained to as a bell wind, is one of the most constantly used distributions and frequently the starting point for learning about distributions in general due to its relative simplicity. Given a mean( μ) and standard divagation( σ), a normal distribution can be modeled with the following probability viscosity function

 

1.jpg.3e8ba5d1f645a0ad6f4190cae9df01f2.jpg

 

The probability viscosity function for a normal distribution

2.jpg.29aa01d2771b1e1bc0be51f846c03070.jpg

 

 

 Student’s T- Distribution

 The t- distribution is analogous to the normal distribution in numerous ways but doesn't assume knowledge of the population mean and standard divagation the way the normal distribution does. The probability viscosity function of the t- distribution is as follows, where Γ represents the gamma function and ν represents the degrees of freedom

 

3.jpg.70a6c4ceb4be5afde8660e11ea4b2064.jpg

 

 The probability viscosity function for Student’s t- distribution

 Importantly, the degrees of freedom, calculated as one lower than the sample size in utmost situations, has a large impact on the shape of the distribution at lower values. Let’s fantasize a t- distribution with a single degree of freedom.

4.jpg.96fe215060a7c13f6baccf5d688e07a1.jpg

 

 A Visual Comparison

 Now that we ’ve seen both the standard normal distribution and a t- distribution with a single degree of freedom, let’s plot them together to see how they compare.

 

5.jpg.c48531d2fcc24c81c6680de812b9cb54.jpg

 

Major Differences

 With only a single degree of freedom, the t- distribution is important flatter and has fatter tails than the standard normal distribution. The power of the t- distribution comes from its capability to acclimate for lower sample sizes(and thus less degrees of freedom) by effectively having a more conservative estimate of probability viscosity.

 At advanced degrees of freedom, the t- distribution approximates the normal distribution, making it useful at both small and large sample sizes. The vitality below shows a comparison between the t- distribution and the normal distribution at degrees of freedom ranging from 1 to 50.

SO, not only does Student’s t- distribution not bear information regarding the population mean and standard deviation (which are infrequently known in real world trials), but it also has increased inflexibility at colorful sample sizes. These parcels make it much more seductive to use over the normal distribution in utmost cases.

 

Logistic vs Normal Distribution

The logistic and normal distributions have a relatively analogous shape. still, the logistic distribution has heavier tails, which frequently increases the robustness of analyses grounded on it compared with using the normal distribution.

 

 The distribution has operations in trustability and survival analysis. The accretive distribution function has been used for modelling growth functions and as a forbearance distribution in the analysis of double data, leading to the extensively used logit model.

The logistic distribution has been used for growth models and is used in a certain type of regression known as the logistic regression. It has also operations in modeling life data. The shape of the logistic distribution and the normal distribution are veritably analogous. There are some who argue that the logistic distribution is not a good fit for modeling continuance data because the left- hand limit of the distribution extends to negative perpetuity. This could possibly affect in modeling negative times- to- failure. still, handed that the distribution in question has a fairly high mean and a fairly small position parameter, the issue of negative failure times shouldn't present itself as a problem.

 

IMG_20220901_172006.thumb.jpg.6ce59fe8c1ef9574e9a63064f316425f.jpg

 

Difference between Binomial vs Normal Distribution

 

1) The main difference between the binomial and normal distributions is that the binomial distribution is a separate distribution whereas the normal distribution is a nonstop distribution.

This means that a binomial arbitrary variable can only take integer values similar as 1, 2, 3,etc. whereas the normal variable can take any real number value similar as1.2 or2.314,etc.
2) The alternate difference between them is that a binomial arbitrary variable has a finite range whereas the normal distribution has an horizonless range.
A binomial arbitrary variable can only take finitely numerous values 1, 2,., n. On the other hand, a normal arbitrary variable can take any value between minus perpetuity to plus perpetuity, and thus its range is unbounded.

 

3) The binomial distribution is limited in its operations. It's only used in situations where a trial can have only two possible issues – success or failure. For illustration, when tossing a coin numerous times we use the binomial distribution to calculate chances (since tossing a coin has only two issues – heads or tails).

On the other hand, the normal distribution finds numerous operations in real- life situations similar as modelling the height or weight distribution of a population. The normal distribution can in fact be used to calculate chances for binomial distribution using the system of the normal approximation to binomial.

 

IMG_20220901_172134.thumb.jpg.d54ad39c6f8a368f9b562cc55e0ed925.jpg

Student's T is a continuous probability distribution that has longer tails when compared to a normal distribution. The probability of finding larger values is higher in a T distribution when compared to a normal distribution. Whenever the population standard deviation is unknown or whenever the sample size is less than 30 or in case of both scenarios, usage of Student's T distribution is recommended.

 

Logistic distribution is another continuous probability distribution that has not so longer tails as that of a Student's T distribution but has slightly longer tails than a normal distribution. This distribution is highly used to model growth of a certain species, spread of an epidemic, etc.

 

Binomial distribution is a discrete probability distribution as against normal distribution which is a continuous probability distribution. There are only 2 possible results & binomial distribution gives the probability of success.

 

Here, I have tried to differentiate about Normal distribution and others distribution types which could be used instead of Normal distribution. 

 

image.thumb.png.8c63d667083018d1fc8338f5dd6fc453.png

 

 

 

  • Solution

What is probability distribution -

A probability distribution is a statistical function that specifies all potential values and probabilities for a random variable within a certain range.

 

2 Types of Probability Distribution:

Distributions of Discrete Probability : A discrete distribution indicates the likelihood of each value of a discrete random variable occurring. These distributions frequently include statistical assessments of "counts" or "the number of times" an event happens. For example, discrete events, such as tossing dice or flipping coins, have a finite number of outcomes.

 

Distributions of Continuous Probability: A continuous distribution explains the probability of the various values of a continuous random variable. A continuous random variable can have an unlimited and uncountable number of values (known as the range). Time, for example, is infinite: you might count from 0 seconds to a million seconds...a quadrillion second...and so on infinitely.

 

 

Normal distribution -

The normal distribution is perhaps the most typical probability distribution and it is continuous distribution. The data tends to cluster around a centre value with no bias to the left or right, and it approaches a "Normal Distribution" in the following way:

 

image.png.32c9693227a0c57e597d74af766dd235.png

The normal distribution has

  • 50% of values are less than the mean, whereas 50% are more than the mean
  • There is a symmetry in the centre
  • Mean, median and mode are equal

 

Few examples are:

Heights of men - image.png.261de36c697829445a623b6b9f200663.png

 

Rolling 2 dices -  image.png.8468b7f18ccf2b20e2509c51d08467f4.png

 

 

Performance - image.png.682bd0d5af2e19412f12b73ba0826736.png

 

 

There are other distributions which have a shape that resembles a bell curve or a Normal distribution. Let's look at few of them -

 

Student's T Distribution -

The Student's T Distribution is a family of distributions that resemble the normal distribution curve but are somewhat shorter and thicker. The Student's T Distribution (and the accompanying t scores) are used in hypothesis testing to determine whether the null hypothesis should be accepted or rejected. It gives the center a lower probability and the tails a larger probability than the normal distribution.

 

When to use Student's T distribution:

  • When there are few samples, the student's t distribution is utilized instead of the normal distribution. This distribution resembles the normal distribution more as the sample size increases. Indeed, for sample sizes greater than 20 (i.e. more degrees of freedom), the distribution closely resembles the normal distribution.
  • The standard deviation of the population is unknown.
  • The distribution of the population is skewed.

image.png.0f80badd589f064234211e6b8509d36f.png

 

 

Example: Measure the average test score from a sample of just 20 students.

The Student's T-distribution should be used to determine the confidence interval around the mean. Your confidence interval will be artificially precise if you utilize the normal distribution (z-distribution)

image.png.b8a2d9b59d34a7b44c4d7bca5c2212f7.png.

 

Logistic Distribution -

The logistic distribution is also a distribution of continuous probability. The shape is similar to the normal distribution, but the tails are heavier (higher kurtosis).The fundamental distinction between the normal and logistic distributions is in the tails and the behaviour of the failure rate function. The tails of the logistic distribution are somewhat longer than those of the normal distribution.

The distribution's shape is determined by two parameters:

  • The location parameter indicates where the x-axis is centred.
  • The scale parameter indicates the spread.

Because the logistic distribution is symmetric, the mean, median, and mode are all the same.

 

When to use Logistic distribution:

The logistic distribution is primarily utilized since the cumulative distribution formula is reasonably straightforward to deal with. The formula very closely approximates the normal distribution. Looking up numbers in the z-table and rounding up or down to the closest z-score is normally how you find cumulative probabilities for the normal distribution. Because the cumulative distribution function is so complex to deal with, exact values are generally discovered using statistical software.

Although there are numerous other functions that can approach the normal, their mathematical formulations are typically exceedingly complex. In comparison, the logistic distribution has a considerably simpler CDF formula.

 

 

Example: 

  • The logistic distribution has been utilised in growth models and in a kind of regression called as logistic regression. 
  • Also it uses to calculate the relative skill level of chess players.

 

 

 

Binomial Distribution -

The binomial distribution is a discrete probability distribution that produces just two outcomes in an experiment: success or failure.

 

In a binomial distribution, two parameters, n and p, are used. The variable 'n' indicates the number of times the experiment is repeated, and the variable 'p' indicates the likelihood of any given outcome.

 

Below mentioned are the few properties of binomial distribution-

  • There are 2 possible outcomes: success or failure, true or false, yes or no.
  • There are a certain number 'n' of independent trials.
  • For each trial, the likelihood of success or failure remains constant.
  • Only the number of successes from n separate trials is computed.

 

How it is be different from normal distribution:

The primary distinction between the binomial and normal distributions is that the binomial distribution is discrete, whereas the normal distribution is continuous. The binomial distribution has a finite number of occurrences, whereas the normal distribution has an infinite number of events. If the sample size for the binomial distribution is sufficiently big, the binomial distribution's distribution curve is comparable to the normal distribution curve.

 

image.png.5d60c642e3faeb6cea89078621830095.png

 

Example: 

  • Flipping a coin - there are just two conceivable results if we flip a coin: either heads or tails
  • To find the number of male and female employees in an organization

All published answers are correct and to the point. Soji Sam's answer scores an extra point for the differentiation between continuous and discrete probability distributions. His answer has been selected as the best answer. 

 

Kudos to all participants for writing such amazing answers!

Create an account or sign in to comment

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.