John Wilder Tukey an American statistician and mathematician developed the Rootogram. He is also known for the Fast Fourier Transform algorithm, the Tukey Lambda distribution, Tukey test of additivity, Tukey range test and the Teichmeller-Tukey lemma.
Oxford Reference Definition
Oxford Reference defines a rootogram as “a diagram suggested by Tukey in 1971, for comparing an observed bar chart or histogram (with equal-width categories) with a theoretical probability distribution. The comparison is made easier by ‘hanging’ the observed results from the theoretical curve so that the discrepancies are seen by comparison with the horizontal axis rather than a sloping curve. As in the rootogram, the vertical axis is scaled to the square root of the frequencies so as to draw attention to discrepancies in the tails of the distribution.”
R Package Documentation Definition
The R Package documentation describes the rootogram function to “graphically compare (square roots) of empirical frequencies with fitted frequencies from a probability model.” “Rootograms graphically compare frequencies of empirical distributions and fitted probability models. For the observed distribution, the histogram is drawn on a square root scale (hence the name) and superimposed with a line for the fitted frequencies. The histogram can be “standing” on the x-axis (as usual), or “hanging” from the fitted curve, or a “suspended” histogram of deviations can be drawn. “
Paper of Use of Rootogram for Count Data
A rootogram is a visual tool that was initially used by Tukey to assess the goodness of fit of univariate distributions. Christian Kleiber of the Universitat Basel and Achim Zeileis of the Universitat Innsbruck in their paper “Visualizing Count Data Regressions Using Rootograms” have used rootograms to look at issues such as overdispersion, excess zeros in regression models for count data. Count data regression plots are done in the form of bar plots of the expected and observed frequencies. Rootograms are used to see the fit of both continuous data and count data.
Rootograms compare the observed frequencies using bars (histograms) and the expected frequencies using a curve on a square root scale. Taking the square root scale transforms the date to adjust to the scale differences across the intervals. This makes the deviations across the interval for smaller observed/expected frequencies to be more visible in the plot.
For example, the deviations of 9 as comped to 3600 would only be 1:400, however, the square of the numbers 3 and 60 is 1:20. This is a visual magnification of 20 times.
There are three types of rootograms. The standing rootograms show the bars and a curve. In this, the deviations are not aligned. The standing rootogram is the least used as it just plots the bars and the curve representing the model, however, the fit is not shown. The hanging rootograms align all the deviations along the horizontal axis. The bars are hanging from the curve representing the expected frequencies whereas the suspended rootogram shows mainly the deviations as against the observed frequencies. The hanging and suspended use the horizontal reference line which shows the deviations between the observed and expected frequencies.
Example of Rootogram for Poisson Distribution and Negative Binomial Distribution
Analysis of above Rootograms
Rootograms are used to detect patterns such as runs of positive or negative deviations. The top row of the figure above shows only small deviation when fitting a Poisson model to Poisson data. The expected frequencies and observed frequencies show minimum deviation. In the bottom row of the figure above shows large deviations when fitting a negative binomial distribution. The expected frequencies do not track the observed frequencies.
References
Kleiber, C, Zeileis, A. (2016). Visualizing Count Data Regressions Using Rootograms. American Statistician, Volume 70, Issue 3, Pages 296 to 303
Oxford Reference (https://www.oxfordreference.com/view/10.1093/oi/authority.20110803095919378
R Package Documentation
https://rdrr.io/rforge/topmodels/man/rootogram.html