Course Content

Data Manipulation using pandas

Histograms

Let's move on to the first visualization steps. By now you already know how to clean, prepare, and aggregate data for further analysis. We'll start with histograms.

What is a histogram? Histogram is a graph that represents frequencies of numerical data (usually numerical intervals). To build histogram in `pandas`

, apply the `.hist()`

method to selected data. For instance, let's build a histogram for the `'totinch'`

column.

Note that you don't need to use the

`print()`

function to output the plot.

As parameters, you can set `color`

(color for rectangles, like `'r'`

, `'g'`

, `'b'`

, etc.) or `bins`

(number of intervals to divide data). Let's make rectangles red and set the number of intervals to 50.

Section 4.

Chapter 5