DistilGPT-2 Text Generation

Fine-tunes DistilGPT-2 on a conversation dataset and compares base vs fine-tuned text generation.

Overview

This project fine-tunes a DistilGPT-2 language model on the HuggingFace UltraFeedback dataset and compares text generation between the original base model and the fine-tuned version. The script first trains the model for 3 epochs on 10 000 prompts, saves the checkpoint, then reloads both models to ensure a fair comparison. Task 1 generates 5 alternative completions per prompt using both models side by side. Task 2 uses a HuggingFace pipeline with the fine-tuned model to produce ranked completions.

Requirements

Python 3.8+
transformers >= 4.30
datasets >= 2.12
tensorflow >= 2.12

Installation

pip install -r requirements.txt

Project Structure

distilgpt2-text-generation/
├── generate_text.py        # Fine-tuning and text generation script
├── requirements.txt        # Python dependencies
├── .gitignore
└── README.md

Usage

python generate_text.py

Steps performed:

Downloads the DistilGPT-2 model and the UltraFeedback dataset from HuggingFace.
Fine-tunes the model for 3 epochs on 10 000 conversation prompts.
Reloads a fresh base model and the saved fine-tuned checkpoint.
Task 1: generates 5 completions per prompt for both base and fine-tuned models; prints comparison dictionary to stdout.
Task 2: runs a text-generation pipeline with the fine-tuned model on 3 prompts; prints ranked completions to stdout.

Results

All output is printed to stdout. Task 1 produces a nested dictionary keyed by prompt, with not_tuned and tuned sub-dictionaries each containing 5 generated completions. Task 2 produces ranked completion lists per prompt. The fine-tuned model reflects the conversational style present in the UltraFeedback training data.

Author

Biswajeet Sahoo

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DistilGPT-2 Text Generation

Overview

Requirements

Installation

Project Structure

Usage

Results

Author

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
generate_text.py		generate_text.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

DistilGPT-2 Text Generation

Overview

Requirements

Installation

Project Structure

Usage

Results

Author

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages