Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Impara Find/Find_all | Beautiful Soup
Web Scraping with Python

Scorri per mostrare il menu

book
Find/Find_all

BeautifulSoup offers methods for going through HTML tags. One of them is the function .find(). It returns the first tag which matches the parameter or None if there are no matches:

12
print(soup.find("p")) print(soup.find("h9"))
copy

We will get the same result by accessing tags directly from the BeautifulSoup object: ​​print(soup.p).

To receive the list of all occurrences of the particular tag, we can use the built-in function of the BeautifulSoup object .find_all():

python

It returns the list of instances of the tag object provided by BeautifulSoup. Tag objects offer a comfortable interface to work with their contents.

One of the most important functions of BeautifulSoup is the ability to find the specific types of tags using their attributes:

12
print(soup.find_all("p", id = "id2")) print(soup.find_all(attrs = {"class":"afterbanner", "id": "id1"}))
copy

The functions .find() and .find_all() are more convenient in usage as they can work in combination with attributes and regexes.

Using scraping, you are always interested in a specific part of the website, and unique attributes can help to identify them.

Compito

Swipe to start coding

In this task, you will work with the following page.

  1. Create the BeautifulSoup object using as parameters html and "html.parser".
  2. Print the first div tag using the function .find() of the object soup.
  3. Print the p tag where the id equal to "id0" using the function .find_all() of the soup oblect.

Soluzione

Switch to desktopCambia al desktop per esercitarti nel mondo realeContinua da dove ti trovi utilizzando una delle opzioni seguenti
Tutto è chiaro?

Come possiamo migliorarlo?

Grazie per i tuoi commenti!

Sezione 2. Capitolo 3
Siamo spiacenti che qualcosa sia andato storto. Cosa è successo?

Chieda ad AI

expand
ChatGPT

Chieda pure quello che desidera o provi una delle domande suggerite per iniziare la nostra conversazione

book
Find/Find_all

BeautifulSoup offers methods for going through HTML tags. One of them is the function .find(). It returns the first tag which matches the parameter or None if there are no matches:

12
print(soup.find("p")) print(soup.find("h9"))
copy

We will get the same result by accessing tags directly from the BeautifulSoup object: ​​print(soup.p).

To receive the list of all occurrences of the particular tag, we can use the built-in function of the BeautifulSoup object .find_all():

python

It returns the list of instances of the tag object provided by BeautifulSoup. Tag objects offer a comfortable interface to work with their contents.

One of the most important functions of BeautifulSoup is the ability to find the specific types of tags using their attributes:

12
print(soup.find_all("p", id = "id2")) print(soup.find_all(attrs = {"class":"afterbanner", "id": "id1"}))
copy

The functions .find() and .find_all() are more convenient in usage as they can work in combination with attributes and regexes.

Using scraping, you are always interested in a specific part of the website, and unique attributes can help to identify them.

Compito

Swipe to start coding

In this task, you will work with the following page.

  1. Create the BeautifulSoup object using as parameters html and "html.parser".
  2. Print the first div tag using the function .find() of the object soup.
  3. Print the p tag where the id equal to "id0" using the function .find_all() of the soup oblect.

Soluzione

Switch to desktopCambia al desktop per esercitarti nel mondo realeContinua da dove ti trovi utilizzando una delle opzioni seguenti
Tutto è chiaro?

Come possiamo migliorarlo?

Grazie per i tuoi commenti!

Sezione 2. Capitolo 3
Switch to desktopCambia al desktop per esercitarti nel mondo realeContinua da dove ti trovi utilizzando una delle opzioni seguenti
Siamo spiacenti che qualcosa sia andato storto. Cosa è successo?
some-alt