🛡️ AI-Powered Network Anomaly Detection using K-Means

📖 Project Overview

This project demonstrates how Unsupervised Machine Learning can be applied to Blue Team operations. By using the K-Means Clustering algorithm, we analyze network traffic to automatically establish a baseline and detect security anomalies (outliers) that could indicate malicious activity like unauthorized data transfers or scanning.

🚀 Key Features

Real-World Data: Analyzes traffic captured directly from Wireshark.
AI Implementation: Uses Scikit-learn to perform automated clustering.
Interactive Visualization: Generates scatter plots showing traffic groups and centroids.
Threat Hunting: Helps identify suspicious packets that deviate from the normal baseline.

🛠️ Step-by-Step Guide: How to Capture Data

To use this project with your own network data, follow these steps in Wireshark:

1. Start Capture

Open Wireshark and select your active interface (Wi-Fi or Ethernet).
Click the Blue Shark Fin icon to start live capturing.

2. Create a Baseline

Perform normal activities (browsing, streaming, work) for 5-10 minutes so the AI can learn what "Normal" looks like.

3. Export to CSV

Click the Red Stop Button.
Go to File > Export Packet Dissections > As CSV...
Select "All packets" and save the file as test_cap.csv in your project folder.

📊 Results & Visualization

The AI successfully groups thousands of packets into clusters. Below is the visual representation of the analysis:

Note: Isolated data points (Outliers) far from the centroids represent anomalies that a SOC Analyst must investigate.

💻 How to Run

The Python script (kmeans_script.py) automatically fetches data from your test_cap.csv file.

Prerequisites

Install the necessary Python libraries:

pip install pandas scikit-learn matplotlib

Execution
Open your terminal/CMD in the project directory and run:

Bash
python kmeans_script.py test_cap.csv



## 📊 Results & Visualization
The AI successfully groups thousands of packets into clusters. Below is the visual representation of the analysis:

![Network Analysis Graph](Analysis_graph.png)

> **Note:** Isolated data points (Outliers) far from the centroids represent anomalies that a SOC Analyst must investigate.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Analysis_graph.png		Analysis_graph.png
README.md		README.md
test_cap.csv		test_cap.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ AI-Powered Network Anomaly Detection using K-Means

📖 Project Overview

🚀 Key Features

🛠️ Step-by-Step Guide: How to Capture Data

1. Start Capture

2. Create a Baseline

3. Export to CSV

📊 Results & Visualization

💻 How to Run

Prerequisites

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

🛡️ AI-Powered Network Anomaly Detection using K-Means

📖 Project Overview

🚀 Key Features

🛠️ Step-by-Step Guide: How to Capture Data

1. Start Capture

2. Create a Baseline

3. Export to CSV

📊 Results & Visualization

💻 How to Run

Prerequisites

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages