Skip to content

eceozturk00/smart-document-analyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Smart Document Analyzer

A Python tool that analyzes Word (.docx) and PDF (.pdf) documents, extracts keywords, and exports results to Excel and JSON.

Features

  • Supports DOCX and PDF
  • Detects headings/sections (best-effort)
  • Extracts top keywords using basic NLP
  • Exports:
    • output.xlsx (structured table)
    • output.json (chunks + stats + keywords)

Install

python analyzer.py --input input.docx --excel output.xlsx --json output.json

About

Analyze Word/PDF documents, extract keywords, and export results to Excel/JSON.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages