Skip to content

The ultimate modernization for the english language - makes it finally possible to read in english without getting a massive headache every time... how? a fully simplified phonetic spelling for every single word used in modern times! And a bonus script to convert words from ancient english into modernized phonetic english

Notifications You must be signed in to change notification settings

mikeNickaloff/EnglishModernization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EnglishModernization

Tools for simplifying traditional English spellings using CMU pronunciations or direct dictionary lookups. Includes:

  • A browser-based dictionary translator (eng_to_us.html) that uses american.txt only.
  • The prior CMU-driven browser translator, preserved as eng_to_us_2.html.
  • A CLI modernizer (modernize.py) that reads stdin and writes simplified text (CMU-based).

Repo: https://github.com/mikeNickaloff/EnglishModernization.git

Modernized American Language

Tools for simplafying tradishenul English spellings using CMU prohnunseyashuns or direct dictshunery lookups. Includes:

  • A browser-based dictshunery translayta (eng_to_us.html) that usis american.txt only.
  • Tha prior CMU-driven browsa translayta, preserved as eng_to_us_2.html.
  • A CLI modernizer (modernize.py) that reads stdin and writes simplafyed text (CMU-based).

Quick Start

  1. Clone
git clone https://github.com/mikeNickaloff/EnglishModernization.git
cd EnglishModernization
  1. Ensure Python 3 is installed
  1. Get CMU dictionary
    CMU.txt is included. If you replace it, keep CMU formatting: WORD PH1 PH2 ....

Run the Web Translator (self-hosted)

You need a small HTTP server so the page can fetch the dictionaries (american.txt for eng_to_us.html, CMU.txt for eng_to_us_2.html).

Usage: Type or paste text; the translated version appears live. Unknown words stay unchanged.

Run the CLI Modernizer

Transform text via stdin → stdout using CMU pronunciations.

  • Linux/macOS (bash):

    echo "The neighborhood brought weighty sacks through the thorough fair." \
      | python3 modernize.py --dict CMU.txt
  • Windows (PowerShell):

    "The neighborhood brought weighty sacks through the thorough fair." `
      | python modernize.py --dict CMU.txt

Convert a file to a new file (bash):

python3 modernize.py --dict CMU.txt < input.txt > output.txt

Convert a file (PowerShell):

Get-Content input.txt | python modernize.py --dict CMU.txt | Set-Content output.txt

Why a US-focused spelling?

Simplified, phoneme-driven spellings reduce ambiguity and better reflect American pronunciation (e.g., weight → wate, through → throo, sacks → sax). Using CMU ensures consistent phoneme inputs, yielding predictable modernized outputs for education, readability, and tooling that targets US English norms.

Files

  • eng_to_us.html: Browser translator using american.txt (dictionary replacements only).
  • eng_to_us_2.html: Browser translator using CMU.txt (rule-based phoneme flow).
  • modernize.py: CLI modernizer (stdin → stdout).
  • CMU.txt: CMU Pronouncing Dictionary (primary dependency for CMU-based flows).
  • american.txt: Generated american dictionary from CMU.txt using modernize.py.
  • CMU-PHONES.txt, IPA.txt: Reference lists.

Credits

About

The ultimate modernization for the english language - makes it finally possible to read in english without getting a massive headache every time... how? a fully simplified phonetic spelling for every single word used in modern times! And a bonus script to convert words from ancient english into modernized phonetic english

Resources

Stars

Watchers

Forks

Packages

No packages published