Course Content

# Linear Regression with Python

4. Choosing The Best Model

Linear Regression with Python

## Quadratic Regression

## The problem with Linear Regression

Before defining the Polynomial Regression, let's look at the case that the Linear Regression we have learned before doesn't handle well.

Here you can see that our simple linear regression model is doing awful. That is because it tries to fit a straight line to the data points. Yet we can notice that fitting a parabola would be a much better choice for our points.

## Quadratic Regression Equation

To build a straight-line model, we used an equation of a line (**y=ax+b**). So to build a parabolic model, we need the equation of a parabola. That is the quadratic equation: **y=ax²+bx+c**. Changing the **a**, **b**, and **c** to **β** would give us the **Quadratic Regression Equation**:

The model this equation describes is called **Quadratic Regression**. Like before, we only need to find the best parameters for our data points.

## Normal Equation and X̃

As always, the Normal Equation handles finding the best parameters. But we need to define the **X̃** properly.

We already know how to build the **X̃** matrix for Multiple Linear Regression. It turns out the **X̃** matrix for Polynomial Regression is constructed similarly. We can think of **x²** as a second feature. This way, we need to add a corresponding new column to the **X̃**. It will hold the same values as the previous column but squared.

The video below shows how to build the **X̃**.

Everything was clear?