Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Lære Challenge: Threaded Web Scraper | Advanced Patterns and Best Practices
Quizzes & Challenges
Quizzes
Challenges
/
Python Multithreading and Multiprocessing

bookChallenge: Threaded Web Scraper

Imagine you are tasked with collecting information from a list of web pages, such as fetching the latest news headlines or product prices from several sites. Doing this sequentially—one after another—would be slow, especially if some pages take longer to respond. To speed up the process, you want to fetch multiple pages at the same time using threads. This approach allows you to make the most of waiting times, retrieving data from several sites concurrently and processing results as soon as they arrive.

Oppgave

Swipe to start coding

Your goal is to implement a threaded web scraper that fetches the content of multiple URLs in parallel and processes the results.

  • Complete the function fetch_url_content(url) so that it downloads and returns the first 100 characters of the response content from the provided url.
  • In the main() function, use the threading module to start a new thread for each URL in the urls list, where each thread calls fetch_url_content(url) and prints the result in the format: Content from {url}: {snippet} where {url} is the URL and {snippet} is the first 100 characters returned by fetch_url_content.
  • Ensure the main thread waits for all threads to finish before exiting.

Løsning

Alt var klart?

Hvordan kan vi forbedre det?

Takk for tilbakemeldingene dine!

Seksjon 4. Kapittel 4
single

single

Spør AI

expand

Spør AI

ChatGPT

Spør om hva du vil, eller prøv ett av de foreslåtte spørsmålene for å starte chatten vår

close

bookChallenge: Threaded Web Scraper

Sveip for å vise menyen

Imagine you are tasked with collecting information from a list of web pages, such as fetching the latest news headlines or product prices from several sites. Doing this sequentially—one after another—would be slow, especially if some pages take longer to respond. To speed up the process, you want to fetch multiple pages at the same time using threads. This approach allows you to make the most of waiting times, retrieving data from several sites concurrently and processing results as soon as they arrive.

Oppgave

Swipe to start coding

Your goal is to implement a threaded web scraper that fetches the content of multiple URLs in parallel and processes the results.

  • Complete the function fetch_url_content(url) so that it downloads and returns the first 100 characters of the response content from the provided url.
  • In the main() function, use the threading module to start a new thread for each URL in the urls list, where each thread calls fetch_url_content(url) and prints the result in the format: Content from {url}: {snippet} where {url} is the URL and {snippet} is the first 100 characters returned by fetch_url_content.
  • Ensure the main thread waits for all threads to finish before exiting.

Løsning

Switch to desktopBytt til skrivebordet for virkelighetspraksisFortsett der du er med et av alternativene nedenfor
Alt var klart?

Hvordan kan vi forbedre det?

Takk for tilbakemeldingene dine!

Seksjon 4. Kapittel 4
single

single

some-alt