Python Programming Chatbot

Welcome to the Python Programming Chatbot project repository! This project is a robust and intelligent chatbot designed to assist Python developers and learners with real-time solutions to programming challenges. By leveraging cutting-edge natural language processing models, the chatbot delivers accurate, context-aware, and actionable responses to coding queries.

1. Project Overview

The Python Programming Chatbot is developed to assist users in solving Python-related queries. It is fine-tuned on Salesforce's codegen-350M-multi model and uses a custom dataset of real-world Python challenges. This chatbot can be deployed in educational platforms, developer tools, and coding assistants.

2. Features

Interactive Python Programming Assistance: Responds to Python programming queries with tailored solutions.
Real-World Problem Solving: Handles real-world coding scenarios, including debugging, optimization, and scripting.
Developer-Friendly Interface: Seamless integration for developers needing real-time coding support.
Scalable Backend: Built using Flask for API development and MongoDB for chat history storage.

3. Technologies Used

Programming Languages

Python

Libraries and Frameworks

Flask
Hugging Face Transformers
Pandas
NumPy
Scikit-learn

Machine Learning Models

Base Model: Salesforce's codegen-350M-multi
Fine-tuned Model: Optimized for Python coding dialogues

Database

MongoDB (For chat history storage)

4. Dataset

The custom dataset for this project includes Python programming challenges in a question-answer format.

Dataset Structure

Instruction: Describes the task or query.
Input: Provides additional context.
Output: Contains the expected Python code solution.

Preprocessing Steps

Combined Instruction and Input into a single dialogue format:
```
User: [Instruction + Input]  
Chatbot: [Output]  
```
Split into 80% training and 20% evaluation subsets.
Converted to Hugging Face Dataset format.

5. Model Training and Fine-Tuning

The chatbot model was fine-tuned using the Hugging Face Trainer API.

Training Parameters

Batch Size: 4
Learning Rate: 5e-5
Epochs: 3
Gradient Accumulation Steps: 8

Code Example

trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=train_dataset,
    eval_dataset=eval_dataset,
    tokenizer=tokenizer,
    data_collator=data_collator,
)
trainer.train()

6. Applications

Personalized Learning Platforms

Enhances e-learning by providing detailed programming solutions and personalized guidance for Python learners.

Developer Support Systems

Assists developers with debugging, error resolution, and best practice suggestions in real-time.

Automated Coding Assistants

Boosts productivity by offering quick responses to Python coding challenges, saving time on searching and troubleshooting.

7. Project Structure

├── dataset/
│   ├── train.json
│   ├── eval.json
├── model/
│   ├── fine_tuned_model/
│   ├── tokenizer/
├── api/
│   ├── app.py
│   ├── requirements.txt
├── README.md

dataset/: Contains training and evaluation datasets.
model/: Stores the fine-tuned model and tokenizer.
api/: Flask-based API files for chatbot interaction.

8. Installation and Usage

Prerequisites

Python 3.8 or higher
MongoDB

Steps

Clone the Repository

git clone https://github.com/your-username/python-chatbot.git
cd python-chatbot

Install Dependencies
```
pip install -r api/requirements.txt
```
Set Up the Database
- Install and configure MongoDB.
- Update the connection string in app.py.
Run the Flask App
```
python api/app.py
```
Interact with the Chatbot
- Use a REST client like Postman to send queries to the chatbot API.

9. Future Work

Expand dataset to include more programming languages.
Implement a web-based front end for easier user interaction.
Enhance model capabilities for handling advanced coding tasks.

10. Contributing

We welcome contributions!

Fork the repository.
Create a new branch:
```
git checkout -b feature-name
```
Make changes and commit:
```
git commit -m "Add feature-name"
```
Push the branch and open a pull request.

11. License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
__pycache__		__pycache__
data		data
models		models
src		src
templates		templates
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
app.py		app.py
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
tailwind.config.js		tailwind.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python Programming Chatbot

Table of Contents

1. Project Overview

2. Features

3. Technologies Used

Programming Languages

Libraries and Frameworks

Machine Learning Models

Database

4. Dataset

Dataset Structure

Preprocessing Steps

5. Model Training and Fine-Tuning

Training Parameters

Code Example

6. Applications

Personalized Learning Platforms

Developer Support Systems

Automated Coding Assistants

7. Project Structure

8. Installation and Usage

Prerequisites

Steps

9. Future Work

10. Contributing

11. License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Python Programming Chatbot

Table of Contents

1. Project Overview

2. Features

3. Technologies Used

Programming Languages

Libraries and Frameworks

Machine Learning Models

Database

4. Dataset

Dataset Structure

Preprocessing Steps

5. Model Training and Fine-Tuning

Training Parameters

Code Example

6. Applications

Personalized Learning Platforms

Developer Support Systems

Automated Coding Assistants

7. Project Structure

8. Installation and Usage

Prerequisites

Steps

9. Future Work

10. Contributing

11. License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages