Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Visualizing Correlations with Scatter Plots | Basic Statistical Analysis
Data Analysis with R

bookVisualizing Correlations with Scatter Plots

While correlation heatmaps offer a summary of linear relationships between variables, they can't be used to identify non-linear relationships. In these situations, scatter plots are more useful.

Why Use Scatter Plots?

Scatter plots are a simple yet powerful way to visualize the relationship between two numeric variables. They make it possible to detect both linear and non-linear trends, highlight outliers or anomalies, and provide deeper insight into relationships that might only appear abstractly in a correlation heatmap. By plotting individual data points, scatter plots give a clear and intuitive view of how variables interact.

Example: Selling Price vs. Max Power

A scatter plot can be used to examine how a car's power influences its market value. In this case, the x-axis represents maximum power, while the y-axis represents selling price.

ggplot(data = df, aes(x = max_power, y = selling_price)) +
  geom_point() +
  labs(title = "Scatter Plot of Selling Price vs. Max Power",
       x = "Max Power",
       y = "Selling Price")

This visualization makes it easy to see whether a positive linear relationship exists, where higher power generally corresponds to a higher selling price.

question mark

What does a tight upward slope in a scatter plot indicate?

Select the correct answer

Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 3. ChapterΒ 7

Ask AI

expand

Ask AI

ChatGPT

Ask anything or try one of the suggested questions to begin our chat

Suggested prompts:

Can you explain how to interpret scatter plots for non-linear relationships?

What are some common patterns to look for in scatter plots?

Can you give more examples of variables that might show no correlation?

Awesome!

Completion rate improved to 4

bookVisualizing Correlations with Scatter Plots

Swipe to show menu

While correlation heatmaps offer a summary of linear relationships between variables, they can't be used to identify non-linear relationships. In these situations, scatter plots are more useful.

Why Use Scatter Plots?

Scatter plots are a simple yet powerful way to visualize the relationship between two numeric variables. They make it possible to detect both linear and non-linear trends, highlight outliers or anomalies, and provide deeper insight into relationships that might only appear abstractly in a correlation heatmap. By plotting individual data points, scatter plots give a clear and intuitive view of how variables interact.

Example: Selling Price vs. Max Power

A scatter plot can be used to examine how a car's power influences its market value. In this case, the x-axis represents maximum power, while the y-axis represents selling price.

ggplot(data = df, aes(x = max_power, y = selling_price)) +
  geom_point() +
  labs(title = "Scatter Plot of Selling Price vs. Max Power",
       x = "Max Power",
       y = "Selling Price")

This visualization makes it easy to see whether a positive linear relationship exists, where higher power generally corresponds to a higher selling price.

question mark

What does a tight upward slope in a scatter plot indicate?

Select the correct answer

Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 3. ChapterΒ 7
some-alt