Tools for simplifying traditional English spellings using CMU pronunciations or direct dictionary lookups. Includes:
- A browser-based dictionary translator (
eng_to_us.html) that usesamerican.txtonly. - The prior CMU-driven browser translator, preserved as
eng_to_us_2.html. - A CLI modernizer (
modernize.py) that reads stdin and writes simplified text (CMU-based).
Repo: https://github.com/mikeNickaloff/EnglishModernization.git
Tools for simplafying tradishenul English spellings using CMU prohnunseyashuns or direct dictshunery lookups. Includes:
- A browser-based dictshunery translayta (eng_to_us.html) that usis american.txt only.
- Tha prior CMU-driven browsa translayta, preserved as eng_to_us_2.html.
- A CLI modernizer (modernize.py) that reads stdin and writes simplafyed text (CMU-based).
- Clone
git clone https://github.com/mikeNickaloff/EnglishModernization.git
cd EnglishModernization- Ensure Python 3 is installed
- Linux/macOS:
python3 --version - Windows: install from https://www.python.org/downloads/ and ensure
python(orpython3) is on PATH.
- Get CMU dictionary
CMU.txtis included. If you replace it, keep CMU formatting:WORD PH1 PH2 ....
You need a small HTTP server so the page can fetch the dictionaries (american.txt for eng_to_us.html, CMU.txt for eng_to_us_2.html).
-
Linux/macOS:
python3 -m http.server 8000
Then open:
- http://localhost:8000/eng_to_us.html for the dictionary-only translator (american.txt)
- http://localhost:8000/eng_to_us_2.html for the CMU-based translator
-
Windows (PowerShell):
python -m http.server 8000
Then open:
Usage: Type or paste text; the translated version appears live. Unknown words stay unchanged.
Transform text via stdin → stdout using CMU pronunciations.
-
Linux/macOS (bash):
echo "The neighborhood brought weighty sacks through the thorough fair." \ | python3 modernize.py --dict CMU.txt
-
Windows (PowerShell):
"The neighborhood brought weighty sacks through the thorough fair." ` | python modernize.py --dict CMU.txt
Convert a file to a new file (bash):
python3 modernize.py --dict CMU.txt < input.txt > output.txtConvert a file (PowerShell):
Get-Content input.txt | python modernize.py --dict CMU.txt | Set-Content output.txtSimplified, phoneme-driven spellings reduce ambiguity and better reflect American pronunciation (e.g., weight → wate, through → throo, sacks → sax). Using CMU ensures consistent phoneme inputs, yielding predictable modernized outputs for education, readability, and tooling that targets US English norms.
eng_to_us.html: Browser translator usingamerican.txt(dictionary replacements only).eng_to_us_2.html: Browser translator usingCMU.txt(rule-based phoneme flow).modernize.py: CLI modernizer (stdin → stdout).CMU.txt: CMU Pronouncing Dictionary (primary dependency for CMU-based flows).american.txt: Generated american dictionary from CMU.txt using modernize.py.CMU-PHONES.txt,IPA.txt: Reference lists.
- Project: EnglishModernization — https://github.com/mikeNickaloff/EnglishModernization.git
- Pronunciations: CMU Pronouncing Dictionary.