How to do a Polynomial Regression in Google Sheets?


Regression analysis is used to quantify the relationship between one or more predictor variables and a response variable.

The most common type of regression analysis is simple linear regression, which is used when a predictor variable and a response variable have a linear relationship.

Linear relationship between two variables

However, sometimes the relationship between a predictor variable and a response variable is nonlinear. 

Quadratic relationship example

Cubic relationship example

In these cases it makes sense to use polynomial regression, which can account for the nonlinear relationship between the variables.

This tutorial provides a step-by-step example of how to perform polynomial regression in Google Sheets

Step 1: Create the Data

First, let’s create a fake dataset with the following values:

Step 2: Create a Scatterplot

Next, we’ll create a scatterplot to visualize the data.

First, highlight cells A2:B11 as follows:

Next, click the Insert tab and then click Chart from the dropdown menu:

Step 3: Find the Polynomial Regression Equation

Next, double click anywhere on the scatterplot to bring up the Chart Editor window on the right:

Next, click Series. Then, scroll down and check the box next to Trendline and change the Type to Polynomial. For Label, choose Use Equation and then check the box next to Show R2.

This will cause the following formula to be displayed above the scatterplot:

Polynomial regression in Google Sheets

We can see that the fitted polynomial regression equation is:

y = 9.45 + 2.1x – 0.0188x2

The R-squared for this model is 0.718.

Recall that tells us the percentage of variation in the response variable that can be explained by the predictor variables. The higher the value, the better the model.

Next, change the Polynomial degree to 3 in the Chart Editor:

Changing the polynomial degree in Google Sheets

This will cause the following formula to be displayed above the scatterplot:

Cubic regression in Google Sheets

This causes the fitted polynomial regression equation to change to:

y = 37.2 – 14.2x + 2.64x2 – 0.126x3

The R-squared for this model is 0.976.

Notice that the R-squared for this model is significantly higher than the polynomial regression model with a degree of 2. This suggests that this regression model is significantly better at capturing the trend in the underlying data.

If you change the degree of the polynomial to 4, the R-squared increases just barely to 0.981. This suggests that a polynomial regression model with a degree of 3 is sufficient to capture the trend for this data.

We can use the fitted regression equation to find the expected value for the response variable based on a given value for the predictor variable. For example, if x = 4 then the expected value for y would be:

y = 37.2 – 14.2(4) + 2.64(4)2 – 0.126(4)3 = 14.576


x