Summarize the Text with Less Toxic Words

In this project, I have used Flan-T5 model and fine-tune it using LoRA approach to summarize the dialogue of dialogsum dataset. Later, using the Toxicity Evaluator model "facebook/roberta-hate-speech-dynabench-r4-target", calculate the reward for the sentence generated by the fine-tuned model. In the last step, using the PPO RL approach, I fine-tuned the model weights for the limited number of parameters that are defined by LoRA approach.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
RLHF.ipynb		RLHF.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Summarize the Text with Less Toxic Words

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Summarize the Text with Less Toxic Words

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages