Summary  
This chapter covers computing and interpreting model performance metrics by implementing code to calculate root mean squared error (RMSE) for regression and accuracy for classification, and demonstrates using specialized functions that accept data frames with truth and prediction columns. It also shows how to print the resulting metric values.

General domain of usage  
Machine learning model evaluation

When you build predictive models, you need a reliable way to determine how good your model is at making predictions. Quantifying model accuracy and error helps you compare different models, understand where your model might be failing, and decide whether your model is ready for deployment. Without clear metrics, you would have no objective basis for improvement or selection. Metrics like **root mean squared error (RMSE)** for regression and **accuracy** for classification are commonly used to summarize model performance in a single, interpretable number.

library(dplyr)

# Example regression predictions
reg_results <- tibble(
  truth = c(2.5, 0.0, 2.1, 1.6),
  prediction = c(3.0, -0.1, 2.0, 1.5)
)

# Compute RMSE (base R)
reg_rmse <- sqrt(mean((reg_results$truth - reg_results$prediction)^2))

# Example classification predictions
class_results <- tibble(
  truth = factor(c("cat", "dog", "cat", "dog")),
  prediction = factor(c("cat", "cat", "cat", "dog"))
)

# Compute accuracy (base R)
class_acc <- mean(class_results$truth == class_results$prediction)

# Print results
print(reg_rmse)
print(class_acc)

The `rmse()` and `accuracy()` functions from the `yardstick` package require a data frame or tibble with at least two columns: one for the true values (`truth`) and one for the predicted values (`prediction`). For `rmse()`, both columns should be numeric. For `accuracy()`, both columns should be factors with matching levels. You specify which columns represent the true values and predictions using the `truth =` and `estimate =` arguments. These functions return a tibble summarizing the metric, which includes the metric name, the type of estimator, and the calculated value.

Always ensure the column names you pass to `truth` and `estimate` match exactly with your data frame. For classification, make sure your factor levels are consistent between `truth` and `prediction` columns, or you may get misleading results or errors. Choose metrics appropriate for your problem type — do not use accuracy for regression, or RMSE for classification.

Note

Which statement best describes how to use RMSE and accuracy when evaluating model performance?

Master practical data science in R by learning data cleaning, modeling, evaluation, and machine learning workflows through hands-on code. Build fluency with R syntax, functions, and outputs for real-world data science tasks.

Learn to wrangle, clean, and prepare data in R using practical, code-driven workflows.

Engineer features and reshape data for modeling using R’s tidyverse tools.

Fit, interpret, and use regression and classification models with R code.

Evaluate models and build simple machine learning pipelines in R.

Evaluating Model Performance