Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Exploring Data with dplyr | Section
Data Wrangling with Tidyverse in R

bookExploring Data with dplyr

Swipe to show menu

When you work with data frames in R, the dplyr package gives you a powerful set of tools for exploring and manipulating your data. The most important dplyr verbs are select, filter, arrange, mutate, and summarize. Each verb performs a specific type of operation:

  • select: choose specific columns from your data;
  • filter: keep only rows that meet certain conditions;
  • arrange: reorder rows based on column values;
  • mutate: add new columns or transform existing ones;
  • summarize: reduce your data to summary statistics.

These verbs allow you to quickly inspect and explore your data frames, making it easier to focus on the information that matters most.

12345678910111213141516
library(dplyr) options(crayon.enabled = FALSE) # Create a sample tibble data <- tibble::tibble( name = c("Alice", "Bob", "Carol", "David"), age = c(25, 30, 22, 35), score = c(88, 92, 95, 85) ) # Use select and filter to subset the tibble result <- data %>% select(name, score) %>% filter(score > 90) print(result)
copy

A key feature of dplyr is the pipe operator %>%, which lets you chain together multiple operations in a clear, readable sequence. Instead of nesting functions inside each other, you pass the result of one operation directly into the next. This approach makes your code easier to read and understand, especially as your data wrangling tasks become more complex.

question mark

What is the main purpose of the pipe (%>%) operator in dplyr workflows?

Select the correct answer

Everything was clear?

How can we improve it?

Thanks for your feedback!

Section 1. Chapter 3

Ask AI

expand

Ask AI

ChatGPT

Ask anything or try one of the suggested questions to begin our chat

Section 1. Chapter 3
some-alt