Skip to content

feat: add HTML loader to IngestAgent using html.parser#5

Open
Aryaneviloo wants to merge 1 commit into
aimanmalib:mainfrom
Aryaneviloo:feat/html-loader
Open

feat: add HTML loader to IngestAgent using html.parser#5
Aryaneviloo wants to merge 1 commit into
aimanmalib:mainfrom
Aryaneviloo:feat/html-loader

Conversation

@Aryaneviloo

Copy link
Copy Markdown

Addresses #4. Added a lightweight, dependency-free HTML branch to IngestAgent using the standard library html.parser. The parser surgically strips noise (<script>, <style>) while converting

-

tags into Markdown equivalents (#)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant