Benchmark Six Sigma Expert View by Venugopal R
Using median as a measure of central tendency helps to avoid effect of outliers. For those who need a clarity on fundamental behavior of mean and median, the following simple example will help.
Consider as set of nine data points representing the minimum time in days between failures for nine similar equipment.
70, 248, 2400, 240, 2, 1460, 230, 180, 440
The mean for the above data is 586 whereas the median is 240.
Now consider the data set below, which is same as above except that the maximum value has further increased from 2400 to 4800
70, 248, 4800, 240, 2, 1460, 230, 180, 440
The mean has shot up to 852, whereas the median remains unaffected at 240.
In the above situation, the median is a more realistic representation as a measure of central tendency of the data.
Few examples where the median may be a better choice:
1. Income data in an organization: It is quite possible that there could be a few high paid individuals, by which the mean could be severely biased, hence median is preferable.
2. Age of employees in a society: A few very senior citizens among a majority of people being in the lower middle age band, could give a non-normal distribution.
3. Customer satisfaction surveys using a Likert scale of 1 to 10: A very few customers voting on the upper or lower extreme could distort the reality – hence usage of median helps.
4. Life expectancy based on a specialized treatment: For instance if most patients had a post treatment life span in the range of 10 to 15, one odd patient living for 45 years could provide an unrealistic expectancy, unless we use median as a measure of performance.
5. The comparative tests performed on non-normal distributions, knows as non-parametric tests are based on usage of median. Examples of such tests are 1-Sample sign, Wilcoxon Signed rank, Mann Whitney, Kruskal Wallis, Moods Median.