Learn Analyzing Page Titles and Headings | On-Page and Technical SEO Analysis with Python

Swipe to show menu

Page titles and headings are critical elements for SEO because they help search engines and users understand the content and structure of a page. Well-crafted titles and headings improve click-through rates, clarify page relevance, and support accessible navigation. However, manually checking these elements across many pages is time-consuming. By using Python, you can automate the extraction and evaluation of page titles and headings, quickly identifying issues such as missing or duplicate tags that may impact SEO performance.


              12345678910111213141516171819202122232425
            
from bs4 import BeautifulSoup

html_content = """
<html>
  <head>
    <title>Best SEO Tools for 2024</title>
  </head>
  <body>
    <h1>Best SEO Tools for 2024</h1>
    <h1>Top Picks for Marketers</h1>
    <p>Discover the leading SEO tools to boost your website ranking.</p>
  </body>
</html>
"""

soup = BeautifulSoup(html_content, "html.parser")

# Extract all <title> tags
titles = [tag.get_text() for tag in soup.find_all("title")]

# Extract all <h1> tags
h1_headings = [tag.get_text() for tag in soup.find_all("h1")]

print("Titles:", titles)
print("H1 Headings:", h1_headings)

After extracting the titles and headings using BeautifulSoup, you can analyze them for SEO best practices. For example, you should check if there are multiple title or h1 tags, which can confuse search engines. Ideally, each page should have a single, descriptive title and one clear h1 heading. By parsing the extracted data, you can flag issues such as duplicates, missing tags, or overly generic titles and headings. Automating this analysis helps you maintain consistency and quickly spot problems that could affect your site's search visibility.


              12345678910111213141516171819
            
def are_title_and_h1_unique(html_content):
    soup = BeautifulSoup(html_content, "html.parser")
    titles = soup.find_all("title")
    h1_headings = soup.find_all("h1")
    return len(titles) == 1 and len(h1_headings) == 1

# Example usage
html_example = """
<html>
  <head>
    <title>SEO Guide</title>
  </head>
  <body>
    <h1>SEO Guide</h1>
  </body>
</html>
"""

print(are_title_and_h1_unique(html_example))  # Output: True

1. Why should page titles and h1 headings be unique?

2. What Python library is used for HTML parsing in this context?

3. Fill in the blank: To extract the text from a tag in BeautifulSoup, use _ _ _ .

Everything was clear?

Thanks for your feedback!

Section 3. Chapter 1

Ask AI

Ask anything or try one of the suggested questions to begin our chat

Section 3. Chapter 1