feat(fetch): include page title in extracted content by Christian-Sidak · Pull Request #3739 · modelcontextprotocol/servers

Christian-Sidak · 2026-03-28T05:07:18Z

Summary

When the fetch server extracts content from HTML pages, readabilipy already parses the <title> tag, but the title was being discarded. This means fetched pages lose important context about what the page is.

This PR prepends the page title as a markdown # heading when present:

# What's new in 2.1.0 (Aug 30, 2023)

Content of the page...

Handles missing, null, and whitespace-only titles gracefully (no heading prepended)
3 lines of code change in extract_content_from_html()
Both the fetch tool and the get-page prompt benefit automatically since they share the same extraction function

Fixes #2472

Test plan

3 new unit tests: title present, title missing, whitespace-only title
Tests mock readabilipy to avoid Node.js dependency in CI
All new tests pass

The readabilipy library already extracts the HTML page title, but it was being discarded. Now the title is prepended as a markdown H1 heading when present, giving consumers useful context about the page. Handles missing, null, and whitespace-only titles gracefully. Fixes modelcontextprotocol#2472

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(fetch): include page title in extracted content#3739

feat(fetch): include page title in extracted content#3739
Christian-Sidak wants to merge 1 commit intomodelcontextprotocol:mainfrom
Christian-Sidak:feat-fetch-include-page-title

Christian-Sidak commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Christian-Sidak commented Mar 28, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants