Skip to content

gucordeiro26/portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

Portfolio | Backend, Big Data & BI

Welcome! This repository is a curated collection of my projects as a FullStack Developer focusing on Data Engineering and Backend Infrastructure.

I am a 20-year-old developer based in Brazil, specializing in building scalable pipelines, relational modeling (3FN), and transforming raw data into business intelligence.


Featured Projects

Melbourne Housing - End-to-End ETL Pipeline

Tech Stack: PySpark, Databricks, Delta Lake, Python.

A robust implementation of a Medallion Architecture to process real estate data.

  • Ingestion: Handled heterogeneous sources (CSV/JSON) using PySpark.
  • Data Engineering: Implemented schema harmonization, handled corrupted records, and automated data cleaning.
  • Business Intelligence: Created a Gold layer with aggregated metrics for price-per-square-meter and regional trends.

View Project Repository


Web Scraping & Sentiment Analysis

Tech Stack: Python, Selenium, Pandas, TextBlob, MySQL.

An automated pipeline to extract and analyze consumer sentiment from e-commerce platforms.

  • Scraping: Developed a resilient scraper with user-agent rotation to bypass anti-bot systems.
  • NLP: Applied sentiment analysis (Polarity/Subjectivity) and text normalization to unstructured reviews.
  • Relational Modeling: Designed a database schema to store processed comments, ready for BI dashboard consumption.

View Project Repository


Skills & Tools

Backend & Architecture

  • Languages: Node.js, PHP (Laravel), Python.
  • Databases: MySQL (expert in 3FN normalization), PostgreSQL, NoSQL.
  • Design: RESTful APIs, System Refactoring, Scalable Infrastructure.

Big Data & Engineering

  • Frameworks: Apache Spark (PySpark), Delta Lake.
  • Platforms: Databricks, XAMPP.
  • ETL/ELT: Pipeline orchestration and data quality enforcement.

Business Intelligence

  • Analytics: SQL for deep data exploration.
  • Visualization: Power BI, data-driven KPI definition.

Connect with me

About

Personal portfolio showcasing end-to-end projects in Backend Development, Big Data Pipelines, and BI. Focused on Python, PySpark, Node.js, and SQL optimization.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors