Summary  
This chapter covers using pandas to extract data from CSV and JSON files into DataFrames, including configuring delimiters and encodings, handling parsing errors, and normalizing nested JSON structures.

General domain of usage  
Data pipelines

```python
import pandas as pd

# Read a CSV file and display its contents
df = pd.read_csv("data/sample_data.csv")
print(df.head())
```

Reading data from CSV files is a common task in data pipelines. You use the `read_csv` function from the **pandas** library to load the file into a DataFrame. This function automatically detects the delimiter (default is comma), but you can specify a different delimiter using the `delimiter` or `sep` parameter if your file uses something else, such as a tab or semicolon. File encoding is another important aspect; most CSV files use UTF-8 encoding, but you might encounter files with different encodings like ISO-8859-1. You can specify the encoding with the `encoding` parameter. If you try to read a file with the wrong encoding, you may see errors or garbled text. Error handling is crucial during extraction. The `read_csv` function provides options like `error_bad_lines=False` (deprecated in newer pandas versions) or `on_bad_lines="skip"` to skip problematic rows, and `warn_bad_lines=True` to display warnings. Always check the documentation for your pandas version to ensure you use the correct parameters.

```python
import pandas as pd

# Read a JSON file with nested structures
df = pd.read_json("data/nested_data.json")

# If the JSON file contains deeply nested data, use json_normalize
if "records" in df.columns:
    from pandas import json_normalize
    nested_df = json_normalize(df["records"])
    print(nested_df.head())
else:
    print(df.head())
```

Which statements correctly describe how to read CSV and JSON files using pandas

Master the practical skills needed to design, build, and automate robust data pipelines using Python. This course covers ETL and ELT fundamentals, batch processing, incremental loading, and orchestration patterns, equipping you to handle real-world data engineering tasks with confidence.

Establish a solid understanding of ETL and ELT concepts, pipeline architecture, and the core components of data workflows.

Dive into practical methods for extracting data from various sources, including files, APIs, and databases.

Master the core transformation techniques and learn how to load processed data into various destinations.

Advance your skills with incremental loading, modularization, testing, logging, and orchestration patterns.

Extracting Data from CSV and JSON Files

Extracting Data from CSV and JSON Files