NLBSE'24 Tool Competition on Issue Report Classification

Team

Gabriel Aracena, CS Student
Kyle Luster, CS Student
Fabio Marcos de Abreu Santos, PhD Machine Learning Professor

Introduction

The given challenge in this competition involved constructing and evaluating a set of multi-class classification models tailored for the purpose of identifying which category a given issue report belonged in.

Participants were provided with a dataset comprised of 3,000 labeled issue reports sourced from five authentic open-source projects, spanning bugs, enhancements, and questions. Leveraging this dataset, participants were tasked creating a novel method of classification to compare the results from their model with a provided baseline.

The multi-class classification models were tuned and evaluated using the provided training and test sets.

Note: Please refer to the tool competition page to read more about the tool competition.

Dataset

Our dataset is comprised of 3,000 genuine issue reports extracted for this task.

Each issue report includes the following essential components:

Repository
Label
Title
Body

Every issue is labeled with a single class denoting its type, specifically categorized as bug, feature, or question.

Notably, issues bearing multiple labels have been omitted from our dataset to ensure clarity and consistency.

Subsequently, the dataset is partitioned into a training set (50%) and a test set (50%).

Note: The dataset has been curated by the competition's committee, ensuring its authenticity and relevance to the competition.

Training

We employed the OpenAI API to fine-tune the GPT-3.5-turbo model for each repository. In total, five models were fine-tuned and evaluated against their corresponding testing datasets.

Reproducibility

To reproduce the results one will need to create an OpenAI API key. If you don't have an OpenAI account, you can create one for free. Then, go to https://platform.openai.com/docs/overview on the left side select "API Keys". Then click on "+ Create new secret key" and enter a name. After the key is created you can paste it in jupyter notebook where it says "open-ai-key" on cells 11 and 47.

Note: As jupyter notebooks saves the outputs and results it is not necessary to create an API key to view our results.

Instructions for the Jupyter Notebook:

Kindly adhere to the guidance provided within the Markdown cells of the Jupyter Notebook. Avoid executing "Run all cells" as many cells rely on API calls that necessitate some processing time, even after execution.

Dependencies and Requirements

To run the code successfully, ensure you have the following dependencies installed:

Python 3.x
OpenAI API Key
Required Python libraries (specified in requirements.txt)

Error Handling and Troubleshooting

If you encounter any errors or issues while running the code, consider the following troubleshooting tips:

Check that all dependencies are installed correctly and up-to-date.
Ensure you have provided a valid API key for the OpenAI API.
Verify that the input data is correctly formatted and contains the necessary information.
Refer to the documentation and code comments for additional guidance on specific functions or methods.

Note: If you are unable to resolve the issue, feel free to reach out to the team for assistance.

Results

[Open AI API]

Repository	Label	Precision	Recall	F1
facebook/react	bug	0.8333	0.9500	0.8878
facebook/react	feature	0.8557	0.8900	0.8725
facebook/react	question	0.9024	0.7400	0.8132
facebook/react	average	0.8635	0.8600	0.8579

tensorflow/tensorflow	bug	0.9072	0.8800	0.8934
tensorflow/tensorflow	feature	0.9318	0.8200	0.8723
tensorflow/tensorflow	question	0.7913	0.9100	0.8465
tensorflow/tensorflow	average	0.8768	0.8700	0.8708

microsoft/vscode	bug	0.8511	0.8000	0.8247
microsoft/vscode	feature	0.8131	0.8700	0.8406
microsoft/vscode	question	0.7980	0.7900	0.7938
microsoft/vscode	average	0.8207	0.8200	0.8198

bitcoin/bitcoin	bug	0.7339	0.8000	0.7656
bitcoin/bitcoin	feature	0.8318	0.8900	0.8599
bitcoin/bitcoin	question	0.7381	0.6200	0.6739
bitcoin/bitcoin	average	0.7679	0.7700	0.7665

opencv/opencv	bug	0.7288	0.8600	0.7890
opencv/opencv	feature	0.9091	0.8000	0.8511
opencv/opencv	question	0.8617	0.8100	0.8351
opencv/opencv	average	0.8332	0.8233	0.8250

overall	bug	0.8109	0.8580	0.8321
overall	feature	0.8683	0.8540	0.8593
overall	question	0.8183	0.7740	0.7925
overall	average	0.8324	0.8287	0.8280

Baselines

[SetFit]

Repository	Label	Precision	Recall	F1
facebook/react	bug	0.9048	0.9500	0.9268
facebook/react	feature	0.8491	0.9000	0.8738
facebook/react	question	0.8652	0.7700	0.8148
facebook/react	average	0.8729	0.8733	0.8718

tensorflow/tensorflow	bug	0.9565	0.8800	0.9167
tensorflow/tensorflow	feature	0.8558	0.8900	0.8725
tensorflow/tensorflow	question	0.7885	0.8200	0.8039
tensorflow/tensorflow	average	0.8669	0.8633	0.8644

microsoft/vscode	bug	0.8485	0.8400	0.8442
microsoft/vscode	feature	0.7627	0.9000	0.8257
microsoft/vscode	question	0.8916	0.7400	0.8087
microsoft/vscode	average	0.8343	0.8267	0.8262

bitcoin/bitcoin	bug	0.7604	0.7300	0.7449
bitcoin/bitcoin	feature	0.8723	0.8200	0.8454
bitcoin/bitcoin	question	0.6455	0.7100	0.6762
bitcoin/bitcoin	average	0.7594	0.7533	0.7555

opencv/opencv	bug	0.7619	0.8000	0.7805
opencv/opencv	feature	0.8842	0.8400	0.8615
opencv/opencv	question	0.8100	0.8100	0.8100
opencv/opencv	average	0.8187	0.8167	0.8173

overall	bug	0.8464	0.8400	0.8426
overall	feature	0.8448	0.8700	0.8558
overall	question	0.8001	0.7700	0.7827
overall	average	0.8305	0.8267	0.8270

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
data		data
metrics		metrics
outputs		outputs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
issueclassificationgpt.ipynb		issueclassificationgpt.ipynb
issueclassificationgptextension.ipynb		issueclassificationgptextension.ipynb
niche_dataset.jsonl		niche_dataset.jsonl
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLBSE'24 Tool Competition on Issue Report Classification

Team

Introduction

Dataset

Training

Reproducibility

Dependencies and Requirements

Error Handling and Troubleshooting

Results

[Open AI API]

Baselines

[SetFit]

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NLBSE'24 Tool Competition on Issue Report Classification

Team

Introduction

Dataset

Training

Reproducibility

Dependencies and Requirements

Error Handling and Troubleshooting

Results

[Open AI API]

Baselines

[SetFit]

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages