Skip to content
View in the app

A better way to browse. Learn more.

Benchmark Six Sigma Forum

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.
Message added by Mayank Gupta,

R-Squared is also known as the Coefficient of Determination and is an output from regression analysis. It represents the percentage of response variable variation that is explained by its relationship with one or more predictor variables. In general, the higher the R-squared value, the better the model fits your data. It is always between 0 and 100%.

 

R-Squared Adjusted is a modified version of R-Squared value. In addition to explaining the percentage of response variable variation that is explained by its relationship with one or more predictor variables, it also takes into account the number of predictor variables. It increases if an additional predictor improves the model more than what is expected by chance. R-Squared Adjusted can be used to compare multiple regression models with different number of predictor variables for the same response variable

 

An application-oriented question on the topic along with responses can be seen below. The best answer was provided by Keerthi Vasan on 14th Nov 2023.

 

Applause for all the respondents - Adil Khan, Anurag Nayak, Keerthi Vasan.

Featured Replies

Q 616R-squared (R-sq) or R-squared adjusted (R-sq(adj)) is the most sought after number after Regression or Multiple Linear Regression. While potentially R-sq can range from 0% to 100%, could there be a situation where it is 0%? Provide examples to support your answer.

 

Note for website visitors -

Solved by Keerthi vasan

Regression analysis is used to describe relationship between a set of independent variables and the dependent variables.

 

R squared (R2) is generally defined as statistic that indicate % of Variance in dependent variable that the independent Variable could explain collectively. R Square evaluates the scatter of the data points around the fitted regression.  It is mostly referred as Coefficient of determination.

 

Example

Let us say there are 6 ingredients (X) used to finally make a tasty dish (Y). R square will help to determine which ingredient has more impact on the final taste of the dish(Y).

We need to see which ingredient has more impact on final taste of dish.

Salt R2 15%

Spices R2 35%

Pepper R2 10%

Tomato R2 20%

Onion R2 20%

Color Powder R2 is 0%

 

Looking at R2 of all the ingredients (X) it Indicating spices have a higher impact on the taste of the dish (Y) & Color Powder has no impact on the final taste of the dish.

 

Lets see one more example to see what it means when R square is 0. (Advertisement Vs Sales of Umbrellas)

 

image.png.0636a92e2b7e5d143d3d2e9fbd5a8a16.png

 

Interesting thing to note here that when R square is zero the line becomes horizontally indicating you cannot in any way predict out put Y with respect to input X. As they have no correlation.

 

 

Lets see what it means when input (X) has some impact on the Output (Y)   (Rainfall Vs Sales of Umbrellas)

 

image.png.f27dbbf6aafeaa54a356c059ea93934b.png

 

 

R-squared may seems like an easy to understand statistic that indicates how well a regression model fits a data set. However, it does not tell us the complete story. To get the full picture, you must consider R square values in combination with the residual plots refer above image.

image.png.2db23c8452fd1c716fc68b52c7a60ad4.png

R-Square-it determines how well the data fits in the regression model.

it determines the % of variation between the dependent variable with independent variable.

R-Square adjusted-It is same as Rsquare,but it is used when the predictors are more.

When new/more predictors are there then R-squared adjusted gives the data of relation between all the dependent variables with independent variable.

It depend upon dependent variable .

If the dependent variable is high then R-Squared adj  will be high.

R-squared predicted-It is a cross validation.

It is inversely proportional to dependent variable.

If the dependent variable will increase then R-squared predicted will decrease.

  • Solution

Yes, although it's rare but it is possible for the value to be zero- it means the independent variables don't explain any variability in the dependent variable. Some reasons are below:


1. Selection of incorrect controllable variables
2. Non linearity relationship between variables


Examples


1. Selection of incorrect independent variables

 

     Modelling company top line as dependent variable and employee birthday as independent variable. Since both the variables are completely unrelated, the results would be skewed.


2. Non linearity

 

     Creating a model using delivery time as output variable and inventory level as input variable (this is because of impact of other parameters like traffic, breakdown etc.). This can be solved by using polynomial or non linear regression model.

Keerthi Vasan has given the best answer to the question.

Create an account or sign in to comment

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.