AIR: Complex Instruction Generation via Automatic Iterative Refinement

🌟 Overview

AIR is a novel framework for generating complex instructions with constraints, significantly enhancing Large Language Models' ability to follow complex instructions. Our approach uses an innovative two-stage process:

Initial Instruction Generation: Generate base instructions from documents
Iterative Refinement: Enhance instructions through LLM-as-judge guidance

The framework produces more challenging and realistic instructions, leading to improved model performance on complex tasks.

🚀 Key Features

Automatic Iterative Refinement: Novel approach to generate complex instructions
Constraint-aware Generation: Instructions that better reflect real-world scenarios
Large-scale Dataset: AIR-10K dataset with 10,000 complex instructions
Enhanced Performance: Significant improvements over existing instruction-following methods

⚙️ Installation

git clone https://github.com/WeiLiuAH/AIR-Automatic-Iterative-Refinement
cd AIR
pip install -r requirements.txt

📊 Dataset Preparation

Download Dolma Dataset

# Download dataset chunks
huggingface-cli download --repo-type dataset --local-dir-use-symlinks False emozilla/dolma-v1_7-cc_en_head --local-dir ./data/dolma --include "*{000,001,002}_00000.parquet*"

Initial Processing

# Convert data format
python ./init_process/data_acquire.py \
    --input_path ./data/dolma \
    --output_path ./data/dolma.jsonl

# Generate embeddings
python ./init_process/embeds_gene.py \
    --input_path ./data/dolma.jsonl \
    --output_path ./data/doc_embeds.jsonl

# Select diverse documents
python ./init_process/select_diverse_based_doc_embeds.py \
    --input_path ./data/dolma.jsonl \
    --embedding_path ./data/doc_embeds.jsonl \
    --output_path ./data/dolma_60k.jsonl

# Generate initial instructions
CUDA_VISIBLE_DEVICES=0,1,2,3 python ./init_process/instruct_generate.py \
    -i ./data/dolma_60k.jsonl \
    -o ./data/dolma_init_process.jsonl \
    -m /path/llama3_70b_instruct

# Filter and score instructions
python ./init_process/instruct_score_filter.py \
    --input_path ./data/dolma_init_process.jsonl \
    --output_path ./data/dolma_init_process.jsonl

Generate and Process Judge Data

scripts

# Generate judge data (max 5 iterations)
bash ./judge_data_gene/run_main.sh

# Process for SFT training
bash ./judge_data_process/data_process.sh

Models Used

Llama-3-8B-UltraChat
Qwen-2.5-7B-UltraChat (Custom fine-tuned version)
Llama-3-8B-Tulu

Guidance Models Used

Meta-Llama-3-70B-Instruct (for Llama series)
Qwen2.5-72B-Instruct (for Qwen series)

🔄 Training

Quick Start with Pre-processed Dataset

Download our processed dataset directly from Hugging Face:

AIR Dataset

Compatible Models

We support training using LlamaFactory with the following models:

Llama-3-8B-UltraChat
Qwen-2.5-7B-UltraChat (Custom fine-tuned version)
Llama-3-8B-Tulu

📝 Citation

If you find this work helpful, please cite our paper:

@article{air2025,
  title={AIR: Complex Instruction Generation via Automatic Iterative Refinement},
  author={Wei Liu and Yancheng He and Hui Huang and Chengwei Hu and Jiaheng Liu and Shilong Li and Wenbo Su and Bo Zheng},
  journal={arXiv preprint arXiv:2502.17787},
  year={2025}
}

🙏 Acknowledgements

LlamaFactory - For providing the training framework
Dolma Dataset - For the base dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIR: Complex Instruction Generation via Automatic Iterative Refinement

🌟 Overview

🚀 Key Features

⚙️ Installation

📊 Dataset Preparation

Download Dolma Dataset

Initial Processing

Generate and Process Judge Data

scripts

Models Used

Guidance Models Used

🔄 Training

Quick Start with Pre-processed Dataset

Compatible Models

📝 Citation

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
init_process		init_process
judge_data_gene		judge_data_gene
judge_data_process		judge_data_process
logo		logo
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

AIR: Complex Instruction Generation via Automatic Iterative Refinement

🌟 Overview

🚀 Key Features

⚙️ Installation

📊 Dataset Preparation

Download Dolma Dataset

Initial Processing

Generate and Process Judge Data

scripts

Models Used

Guidance Models Used

🔄 Training

Quick Start with Pre-processed Dataset

Compatible Models

📝 Citation

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages