Skip to content

TUDMLab/MakeRevisionsUnderstandable

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Make Revisions Understandable - A Survey of Edit Intentions, Methods, and Applications

Awesome PRs Welcome

Text revision is a core process in document creation, capturing how authors iteratively refine, reorganize, and improve written content. With the increasing availability of large-scale revision histories from platforms such as Wikipedia and arXiv, NLP research has begun to move beyond modeling what changes are made to understanding why they are made, i.e., the underlying edit intentions. To our knowledge, this is the first survey that synthesizes text revision research through the lens of edit intentions, providing a unified view of datasets, taxonomies, identification methods, and applications. We review prior work across the full revision workflow, including revision corpus construction, edit intention taxonomy design, and edit intention identification. We further categorize representative datasets and methods, summarize downstream applications such as writing assistance and document edit summarization, and highlight key open research directions.

Important

Good news! 🎉 Our survey paper has been successfully accepted by Findings of ACL 2026. 🔥🔥🔥

A curated collection of papers and resources on edit intentions behind text revisions.

Please refer to our survey "Make Revisions Understandable - A Survey of Edit Intentions, Methods, and Applications" for the detailed contents.

Please let us know if you discover any mistakes or have suggestions by emailing us: fangping.lan@temple.edu

Table of Contents

Taxonomy

MoE LLMs Taxonomy

Paper List

Languages

Multilingual

Granularities

Words/phrases-level

Sentence-level

Datasets

Sentence-level

Document-level

Multi-level, multi-domain

Multi-label for one revision

Edits with other features

Including layout

Description of edits

Response

Sentence Segmentation

Revised Content Alignment

Document(Page) alignment

Paragraph alignment algorithm based on Jaccard similarity

Sentence alignment is not necessarily one-to-one. It can also be one-to-many (Consolidation) and many-to-one (Distribution).

Sentence Alignment:

Word Alignment

Edit Extraction

Revision Classification

Binary Classification

Multi-class classification

Random Forest classifier

BERT

RoBERTa-based classifier

PURE model

Sequence labeling

Multi-label classification

Generative model

T5 model

Llama2-70B with ICL and CoT

Applications

Interactive text writing

Iterative text rewriting

Student argumentative writings assistance system

Text simplification

document edit summarization

Fact-facued Sentence modification

Citation


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors