Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Challenge: Data Wrangling Workflow | Section
Data Wrangling with Tidyverse in R
Section 1. Chapter 11
single

single

bookChallenge: Data Wrangling Workflow

Swipe to show menu

Imagine you are working with a dataset that contains information about sales transactions from a small retail shop. The data arrives in a messy form: some values are missing, column names are inconsistent, and there are duplicate rows. Your goal is to apply a complete data wrangling workflow using Tidyverse tools in R. You will need to import (simulate) the data, clean it by renaming columns and removing duplicates, transform it by creating new variables and handling missing values, and finally summarize the sales by product category to provide insights for the shop owner.

Task

Swipe to start coding

You are given a simulated messy dataset representing sales transactions. Your task is to apply a complete data wrangling workflow using Tidyverse functions to clean, transform, and summarize the data.

  • Rename the columns to consistent, tidy names.
  • Remove duplicate rows from the dataset.
  • Replace missing values in the sales_amount column with 0 and remove rows with missing date_sold.
  • Create a new variable sales_amount_usd by multiplying sales_amount by 2.
  • Summarize the total sales in USD by product category.

Solution

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

Section 1. Chapter 11
single

single

Ask AI

expand

Ask AI

ChatGPT

Ask anything or try one of the suggested questions to begin our chat

some-alt