Summary  
This chapter demonstrates how to compute pairwise correlation coefficients and generate a full correlation matrix for numeric variables—covering data type conversion, missing-value handling, and use of built-in functions.  

General domain of usage  
Data analysis

**Correlation analysis** is a statistical technique used to measure the strength and direction of a relationship between two numeric variables. It helps us understand how changes in one variable are associated with changes in another.

## What Is Correlation?
A correlation coefficient (usually represented as $$r$$) ranges between -1 and 1 and means:
- **1**: perfect positive correlation;
- **0**: no correlation;
- **−1**: perfect negative correlation.

There are several types of correlation methods, but Pearson correlation is the most commonly used for numeric continuous data in R.

## Correlation Between Two Variables
You can use the `cor()` function to compute the correlation coefficient between two variables. All you need is to provide two columns as parameters.
```
cor(df$selling_price, df$km_driven)
```
As a result, the function returns a value between -1 and 1.

## Correlation Matrix (Multiple Variables)
The same function can be used to examine relationships between multiple variables.
```
# Select only numeric columns
numeric_df <- df[, c("selling_price", "km_driven", "max_power", "mileage", "engine", "seats")]
# Compute correlation matrix
cor_matrix <- cor(numeric_df, use = "complete.obs")  # Ignores any rows with missing data
```
The result is stored as a matrix that shows pairwise correlation values between all selected numeric variables.

A correlation coefficient of **-0.9** indicates:


Gain practical experience in data analysis with R by learning how to clean, transform, and visualize datasets. Explore essential workflows such as selecting and filtering data, handling missing values, and summarizing results. Build confidence in preparing data for insights, reporting, and deeper statistical exploration.

Explore the foundations of data analysis with R. Learn how to install the tools, load and inspect datasets, select and filter information, sort and transform data, handle missing values, and summarize results for deeper insights.

Learn to create compelling visualizations with ggplot2. Build bar charts, histograms, density plots, and scatter plots, then customize and refine them with styling options and faceting to reveal deeper insights in your data.

Strengthen your understanding of statistics for data analysis. Apply descriptive measures, identify and treat outliers, and use correlation techniques with visual tools like heatmaps and scatter plots to uncover meaningful relationships.

Correlation Analysis

What Is Correlation?

Correlation Between Two Variables

Correlation Matrix (Multiple Variables)

Correlation Analysis

What Is Correlation?

Correlation Between Two Variables

Correlation Matrix (Multiple Variables)