What is the interpretation of adjusted R-squared and can you provide examples?

What is the interpretation of adjusted R-squared and can you provide examples?

Adjusted R-squared is a statistical measure that assesses the goodness of fit of a regression model. It takes into account the number of independent variables in the model and adjusts the R-squared value accordingly. This helps to avoid overestimation of the model’s predictive power by penalizing the addition of unnecessary variables.

For example, let’s say a researcher is trying to predict a person’s height using their age and weight as independent variables. The R-squared value for this model may be 0.85, indicating a good fit. However, when the researcher adds a third independent variable, such as shoe size, the R-squared value may increase to 0.90. This increase may be misleading as shoe size is not a relevant predictor of height. Adjusted R-squared takes into account the addition of this unnecessary variable and adjusts the value to a more accurate representation of the model’s predictive power.

In summary, adjusted R-squared provides a more reliable interpretation of a regression model’s goodness of fit by penalizing the inclusion of irrelevant variables. It is a useful tool for evaluating the effectiveness of a model in predicting outcomes.

Interpret Adjusted R-Squared (With Examples)


When we fit we often calculate the R-squared value of the model.

The R-squared value is the proportion of the variance in the response variable that can be explained by the predictor variables in the model.

The value for R-squared can range from 0 to 1 where:

  • A value of 0 indicates that the response variable cannot be explained by the predictor variables at all.
  • A value of 1 indicates that the response variable can be perfectly explained by the predictor variables.

Although this metric is commonly used to assess how well a regression model fits a dataset, it has one serious drawback:

The drawback of R-squared:

 

R-squared will always increase when a new predictor variable is added to the regression model.

Even if a new predictor variable is almost completely unrelated to the response variable, the R-squared value of the model will increase, if only by a small amount.

For this reason, it’s possible that a regression model with a large number of predictor variables has a high R-squared value, even if the model doesn’t fit the data well.

Fortunately there is an alternative to R-squared known as adjusted R-squared.

The adjusted R-squared is a modified version of R-squared that adjusts for the number of predictors in a regression model.

It is calculated as:

Adjusted R2 = 1 – [(1-R2)*(n-1)/(n-k-1)]

where:

  • R2: The R2 of the model
  • n: The number of observations
  • k: The number of predictor variables

Because R-squared always increases as you add more predictors to a model, the adjusted R-squared can tell you how useful a model is, adjusted for the number of predictors in a model.

The advantage of Adjusted R-squared:

 

Adjusted R-squared tells us how well a set of predictor variables is able to explain the variation in the response variable, adjusted for the number of predictors in a model.

 

Because of the way it’s calculated, adjusted R-squared can be used to compare the fit of regression models with different numbers of predictor variables.

To gain a better understanding of adjusted R-squared, check out the following example.

Example: Understanding Adjusted R-Squared in Regression Models

Suppose a professor collects data on students in his class and fits the following regression model to understand how hours spent studying and current grade in the class affect the score a student receives on the final exam.

Exam Score = β0 + β1(hours spent studying) + β2(current grade)

Suppose this regression model has the following metrics:

  • R-squared: 0.955
  • Adjusted R-squared: 0.946

Now suppose the professor decides to collect data on another variable for each student: shoe size.

Although this variable should be completely unrelated to the final exam score, he decides to fit the following regression model:

Exam Score = β0 + β1(hours spent studying) + β2(current grade) + β3(shoe size)

Suppose this regression model has the following metrics:

  • R-squared: 0.965
  • Adjusted R-squared: 0.902

If we only looked at the R-squared values for each of these two regression models, we would conclude that the second model is better to use because it has a higher R-squared value!

However, if we look at the adjusted R-squared values then we come to a different conclusion: The first model is better to use because it has a higher adjusted R-squared value.

The second model only has a higher R-squared value because it has more predictor variables than the first model.

However, the predictor variable that we added (shoe size) was a poor predictor of final exam score, so the adjusted R-squared value penalized the model for adding this predictor variable.

This example illustrates why adjusted R-squared is a better metric to use when comparing the fit of regression models with different numbers of predictor variables.

Additional Resources

The following tutorials explain how to calculated adjusted R-squared values using different statistical software:

Cite this article

stats writer (2024). What is the interpretation of adjusted R-squared and can you provide examples?. PSYCHOLOGICAL SCALES. Retrieved from https://scales.arabpsychology.com/stats/what-is-the-interpretation-of-adjusted-r-squared-and-can-you-provide-examples/

stats writer. "What is the interpretation of adjusted R-squared and can you provide examples?." PSYCHOLOGICAL SCALES, 29 Jun. 2024, https://scales.arabpsychology.com/stats/what-is-the-interpretation-of-adjusted-r-squared-and-can-you-provide-examples/.

stats writer. "What is the interpretation of adjusted R-squared and can you provide examples?." PSYCHOLOGICAL SCALES, 2024. https://scales.arabpsychology.com/stats/what-is-the-interpretation-of-adjusted-r-squared-and-can-you-provide-examples/.

stats writer (2024) 'What is the interpretation of adjusted R-squared and can you provide examples?', PSYCHOLOGICAL SCALES. Available at: https://scales.arabpsychology.com/stats/what-is-the-interpretation-of-adjusted-r-squared-and-can-you-provide-examples/.

[1] stats writer, "What is the interpretation of adjusted R-squared and can you provide examples?," PSYCHOLOGICAL SCALES, vol. X, no. Y, ص Z-Z, June, 2024.

stats writer. What is the interpretation of adjusted R-squared and can you provide examples?. PSYCHOLOGICAL SCALES. 2024;vol(issue):pages.

Download Post (.PDF)
Slide Up
x
PDF
Scroll to Top