Skip to content

albertomateus9/web-server-log-analyzer

Repository files navigation

Web Server Log Analyzer

Regex-based web-log parser for synthetic access logs, route counts, status codes, and hourly traffic.

Educational project inspired by EETEPA Vilhena Alves. It is not an official institutional system and does not use real student data.

Overview

Curricular code: R-12
Course: Computer Networks
Discipline: Network Programming
Difficulty: Intermediate

This repository is a runnable Python MVP for portfolio and classroom practice. The default command uses only safe sample data committed to data/sample/.

Concepts Practiced

  • regular expressions
  • log parsing
  • status codes
  • traffic peaks
  • security observation

Repository Structure

data/
  sample/       # safe synthetic samples
  raw/          # optional external files, ignored except .gitkeep
  processed/    # generated outputs, ignored except .gitkeep
notebooks/
  01_exploracao.ipynb
scripts/
  download_data.py
src/
  main.py
charts/
reports/

Quick Start

python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt
python -m src.main --sample

Linux/macOS activation:

source .venv/bin/activate

Safe Operations

  • The CI workflow never scans live networks, scrapes sites, runs speed tests, sends packets, or calls external APIs.
  • Real-world data collection, when applicable, must be performed only in authorized lab environments.
  • Generated outputs are written to data/processed/, charts/, or reports/.

Web Server Log Analyzer

Regex-based web-log parser for synthetic access logs, route counts, status codes, and hourly traffic.

Projeto educacional inspirado na EETEPA Vilhena Alves. Nao e sistema oficial institucional e nao usa dados reais de estudantes.

Visao Geral

Codigo curricular: R-12
Curso: Tecnico em Redes de Computadores
Disciplina: Programacao para Ambientes de Redes
Dificuldade: Intermediario

Este repositorio e um MVP Python executavel para portfolio e pratica em sala. O comando padrao usa apenas dados de amostra seguros em data/sample/.

Conceitos Praticados

  • regular expressions
  • log parsing
  • status codes
  • traffic peaks
  • security observation

Como Rodar

python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt
python -m src.main --sample

Operacao Segura

  • O CI nunca varre redes reais, faz scraping, executa speed tests, envia pacotes ou chama APIs externas.
  • Coleta real de dados, quando aplicavel, deve ocorrer apenas em ambientes de laboratorio autorizados.
  • Saidas geradas ficam em data/processed/, charts/ ou reports/.

License

MIT. See LICENSE.

About

Regex-based web-log parser for synthetic access logs, route counts, status codes, and hourly traffic.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors