Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Aprende Simple Linear Regression | Section
Applying Statistical Methods
Sección 1. Capítulo 8
single

single

bookSimple Linear Regression

Desliza para mostrar el menú

Simple linear regression is a foundational statistical method that helps you understand the relationship between two continuous variables: one independent (predictor) and one dependent (response). The goal is to model the dependent variable as a linear function of the independent variable. The general form of the model is:

y=β0+β1x+εy = β₀ + β₁x + ε

where yy is the dependent variable, xx is the independent variable, β0β₀ is the intercept, β1β₁ is the slope, and εε is the error term.

To fit a linear regression model in Python using scikit-learn, follow these key steps:

  1. Prepare your data: organize your predictor (xx) and response (yy) variables, typically in arrays or pandas Series;
  2. Import and instantiate the model: use LinearRegression from scikit-learn;
  3. Fit the model: call the fit() method with your data to estimate the best-fitting line;
  4. Interpret the results: extract the slope (coef_) and intercept (intercept_) to understand the relationship.

The slope (β1β₁) tells you how much y is expected to change for a one-unit increase in x. The intercept (β0β₀) represents the expected value of y when x is zero. By examining these parameters, you can assess both the direction and strength of the relationship.

Visualizing the regression line alongside the data points can help you interpret the model's fit and the nature of the relationship between the variables.

1234567891011121314151617181920212223242526272829
import numpy as np import matplotlib.pyplot as plt from sklearn.linear_model import LinearRegression # Sample data: hours studied vs exam score hours_studied = np.array([1, 2, 3, 4, 5, 6]).reshape(-1, 1) exam_score = np.array([50, 55, 65, 70, 75, 80]) # Create and fit the linear regression model model = LinearRegression() model.fit(hours_studied, exam_score) # Extract slope and intercept slope = model.coef_[0] intercept = model.intercept_ print("Slope:", slope) print("Intercept:", intercept) # Predict values and plot the regression line predicted_scores = model.predict(hours_studied) plt.scatter(hours_studied, exam_score, color='blue', label='Actual Data') plt.plot(hours_studied, predicted_scores, color='red', label='Regression Line') plt.xlabel('Hours Studied') plt.ylabel('Exam Score') plt.title('Simple Linear Regression: Hours Studied vs Exam Score') plt.legend() plt.show()
copy
Tarea

Desliza para comenzar a programar

Fit a simple linear regression model to a dataset in the global scope.

  • Initialize a LinearRegression model from sklearn.linear_model.
  • Fit the model using ad_spend as the predictor and sales_revenue as the response. Ensure you reshape ad_spend to a 2D array utilizing .reshape(-1, 1) since scikit-learn requires 2D arrays for features.
  • Extract the slope utilizing model.coef_[0] and assign it to the slope variable.
  • Extract the intercept utilizing model.intercept_ and assign it to the intercept variable.

Solución

Switch to desktopCambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones
¿Todo estuvo claro?

¿Cómo podemos mejorarlo?

¡Gracias por tus comentarios!

Sección 1. Capítulo 8
single

single

Pregunte a AI

expand

Pregunte a AI

ChatGPT

Pregunte lo que quiera o pruebe una de las preguntas sugeridas para comenzar nuestra charla

some-alt