Skip to content
View ranjithguggilla's full-sized avatar
💭
Building AI systems and research-driven software
💭
Building AI systems and research-driven software

Block or report ranjithguggilla

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ranjithguggilla/README.md

Ranjith Guggilla

Research Data Engineer

Building tools for oceanographic data curation, metadata compliance, and FAIR data publication.

Python xarray CF-1.8 ISO 19115


Oceanographic Data Engineering Portfolio

Slocum glider mission archive pipeline. Ingests IOOS Glider DAC segments, merges into CF-1.8 trajectory NetCDF, computes TEOS-10 derived variables, applies QARTOD QC, and produces archive packages with interactive maps, reports, and DataCite DOI metadata.

CF-1.8 TEOS-10 QARTOD DataCite 4.4 CI

ISO 19115-2 metadata compliance engine. XSD structure checks, Schematron policy rules, CF-1.8 and ACDD-1.3 attribute validation, YAML rules DSL for custom policies, FAIR self-scoring, and a FastAPI web dashboard.

ISO 19115-2 ACDD-1.3 FAIR FastAPI CI

CTD cast processing pipeline. Reads Sea-Bird CNV files, applies TEOS-10 conversions via GSW, flags outliers, bins to standard depths, and exports CF-compliant NetCDF with full provenance.

TEOS-10 Sea-Bird CF-1.8 CI

NDBC buoy observation ETL pipeline. Extracts Gulf of Mexico station data, transforms with unit normalization and QC flagging, loads into partitioned Parquet with DuckDB analytics.

NDBC DuckDB Parquet CI

Reusable framework for building oceanographic data curation pipelines. Provides base classes for ingest, transform, validate, and publish stages with plugin architecture, checksum verification, and structured logging.

Pipeline SHA-256 CI


Tech Stack

Category Tools
Languages Python, Bash, SQL
Ocean Data xarray, netCDF4, GSW (TEOS-10), CF-conventions, ERDDAP
Standards ISO 19115-2, ACDD-1.3, IOOS QARTOD, DataCite 4.4, FAIR
Data Pandas, NumPy, DuckDB, Parquet
Web FastAPI, Jinja2, Folium
Quality pytest, ruff, GitHub Actions CI
XML/Schema lxml, XSD, Schematron, XPath

Contact

Email GitHub LinkedIn

Pinned Loading

  1. glider-data-curation glider-data-curation Public

    Slocum glider mission archive pipeline: ingest, merge, QC, and package IOOS DAC trajectory data

    Python

  2. gulf-buoy-etl gulf-buoy-etl Public

    Production-grade autonomous ETL pipeline for Gulf of Mexico TABS + NDBC buoy data: idempotent CF-1.8 NetCDF archives, SHA-256 fixity, Prometheus metrics, Zenodo DOI minting

    Python

  3. ctd-cast-processor ctd-cast-processor Public

    Seabird CTD processing pipeline: raw .cnv to CF-1.8 NetCDF with TEOS-10 derived variables, QC (loop edit, despike, bin-average), and IOOS compliance validation

    Python

  4. iso19115-validator iso19115-validator Public

    ISO 19115-2, CF-1.8, and ACDD-1.3 metadata linter with Schematron policy rules, YAML DSL, FAIR scoring, and FastAPI web UI

    Python

  5. ocean-curation-pipeline-toolkit ocean-curation-pipeline-toolkit Public

    FAIR oceanographic dataset packaging pipeline for scientific data repositories (ISO 19115-2, NetCDF, checksums)

    Python

  6. EDITH-voice-assistant EDITH-voice-assistant Public

    EDITH – A Python-based virtual voice assistant with speech recognition, text-to-speech, weather, news, Wikipedia, WhatsApp, email, YouTube, translation, and system control. Packaged with CI, docs, …

    Python