Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Predict House Prices | Simple Linear Regression
Linear Regression with Python
course content

Course Content

Linear Regression with Python

Linear Regression with Python

1. Simple Linear Regression
2. Multiple Linear Regression
3. Polynomial Regression
4. Choosing The Best Model

Predict House Prices

Let's build a real-world example regression model. We have a file, houses_simple.csv, that holds information about housing prices with its area as a feature.

Let's assign variables and visualize our dataset!

In the example with a person's height, it was much easier to imagine a line fitting the data well.
But now our data has much more variance since the target highly depends on many other things like age, location, interior, etc.
Anyway, the task is to build the line that best fits the data we have; it will show the trend. The OLS class should be used for that. Soon we will learn how to add more features, it will make the prediction better!

Task

  1. Assign the 'price' column of df to y.
  2. Create the X_tilde matrix using the add_constant() function from statsmodels(imported as sm).
  3. Initialize the OLS object and train it.
  4. Preprocess X_new array the same way as X.
  5. Predict the target for X_new_tilde matrix.

Once you've completed this task, click the button below the code to check your solution.

Task

  1. Assign the 'price' column of df to y.
  2. Create the X_tilde matrix using the add_constant() function from statsmodels(imported as sm).
  3. Initialize the OLS object and train it.
  4. Preprocess X_new array the same way as X.
  5. Predict the target for X_new_tilde matrix.

Once you've completed this task, click the button below the code to check your solution.

Everything was clear?

Section 1. Chapter 5
toggle bottom row

Predict House Prices

Let's build a real-world example regression model. We have a file, houses_simple.csv, that holds information about housing prices with its area as a feature.

Let's assign variables and visualize our dataset!

In the example with a person's height, it was much easier to imagine a line fitting the data well.
But now our data has much more variance since the target highly depends on many other things like age, location, interior, etc.
Anyway, the task is to build the line that best fits the data we have; it will show the trend. The OLS class should be used for that. Soon we will learn how to add more features, it will make the prediction better!

Task

  1. Assign the 'price' column of df to y.
  2. Create the X_tilde matrix using the add_constant() function from statsmodels(imported as sm).
  3. Initialize the OLS object and train it.
  4. Preprocess X_new array the same way as X.
  5. Predict the target for X_new_tilde matrix.

Once you've completed this task, click the button below the code to check your solution.

Task

  1. Assign the 'price' column of df to y.
  2. Create the X_tilde matrix using the add_constant() function from statsmodels(imported as sm).
  3. Initialize the OLS object and train it.
  4. Preprocess X_new array the same way as X.
  5. Predict the target for X_new_tilde matrix.

Once you've completed this task, click the button below the code to check your solution.

Everything was clear?

Section 1. Chapter 5
toggle bottom row

Predict House Prices

Let's build a real-world example regression model. We have a file, houses_simple.csv, that holds information about housing prices with its area as a feature.

Let's assign variables and visualize our dataset!

In the example with a person's height, it was much easier to imagine a line fitting the data well.
But now our data has much more variance since the target highly depends on many other things like age, location, interior, etc.
Anyway, the task is to build the line that best fits the data we have; it will show the trend. The OLS class should be used for that. Soon we will learn how to add more features, it will make the prediction better!

Task

  1. Assign the 'price' column of df to y.
  2. Create the X_tilde matrix using the add_constant() function from statsmodels(imported as sm).
  3. Initialize the OLS object and train it.
  4. Preprocess X_new array the same way as X.
  5. Predict the target for X_new_tilde matrix.

Once you've completed this task, click the button below the code to check your solution.

Task

  1. Assign the 'price' column of df to y.
  2. Create the X_tilde matrix using the add_constant() function from statsmodels(imported as sm).
  3. Initialize the OLS object and train it.
  4. Preprocess X_new array the same way as X.
  5. Predict the target for X_new_tilde matrix.

Once you've completed this task, click the button below the code to check your solution.

Everything was clear?

Let's build a real-world example regression model. We have a file, houses_simple.csv, that holds information about housing prices with its area as a feature.

Let's assign variables and visualize our dataset!

In the example with a person's height, it was much easier to imagine a line fitting the data well.
But now our data has much more variance since the target highly depends on many other things like age, location, interior, etc.
Anyway, the task is to build the line that best fits the data we have; it will show the trend. The OLS class should be used for that. Soon we will learn how to add more features, it will make the prediction better!

Task

  1. Assign the 'price' column of df to y.
  2. Create the X_tilde matrix using the add_constant() function from statsmodels(imported as sm).
  3. Initialize the OLS object and train it.
  4. Preprocess X_new array the same way as X.
  5. Predict the target for X_new_tilde matrix.

Once you've completed this task, click the button below the code to check your solution.

Section 1. Chapter 5
Switch to desktop for real-world practiceContinue from where you are using one of the options below
We're sorry to hear that something went wrong. What happened?
some-alt