Skip to content

eleldar/Punctuation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Punctuation API

Installation

git lfs install
git clone https://github.com/eleldar/Punctuation.git
cd Punctuation
python3 -m venv venv
. venv/bin/activate
pip install -r requirements.txt
cd models
git clone https://huggingface.co/eleldar/rubert-base-cased-sentence
git clone https://huggingface.co/eleldar/repunct-model_ft repunct-model_ft/weights/ 

Usage

(venv)$ python main.py

open http://127.0.0.1:8000/docs in browser!

How it works

Before inserting raw text into model it should be tokenized. Library handle it with BaseDataset.parse_tokens

Model architecture is pretty easy and straight forward:

Links

Article on habr.ru

This repository contains code (which was edited for production purposes) from xashru/punctuation-restoration.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published