Skip to content
View YahyaSoker's full-sized avatar
:electron:
:electron:

Block or report YahyaSoker

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YahyaSoker/README.md

Hi there, I'm Yahya SΓΆker πŸ‘‹

AI Solutions Architect | Edge-AI & Real-Time Inference

Typing SVG

I bridge the gap between research-grade ML models and production hardware. My focus is on air-gapped multimodal agents, medical diagnostic pipelines, and high-concurrency streaming systems.


πŸš€ Key Architecture & Projects

Local NotebookLM & Mind-Mapper

Built a local research assistant using LLM-aided OCR to extract handwritten "stickers" and notes. Powered by a local LLM, it dynamically generates interactive, expandable flowcharts and a vectorized knowledge base for local RAG. βš™οΈ Flow: Image Ingest βž” LLM OCR βž” Entity Extraction βž” Dynamic Expanding Flowchart πŸ› οΈ Tech: Local LLM Dynamic UI OCR Local RAG

Nebulai Ecosystem

Architected a local LLM cluster and an agentic routing system. Cut token costs by 30% and operational costs by 60%. βš™οΈ Flow: User Query βž” Router Agent βž” vLLM Cluster βž” Cost-Optimized Output πŸ› οΈ Tech: vLLM Agentic Orchestration Cost Optimization

OpTomo (Medical Diagnostic Pipeline)

Designed an end-to-end inference pipeline for breast cancer detection, migrating heavy Python logic to hardware-accelerated bindings for a 40% latency reduction. βš™οΈ Flow: Medical Scan βž” Python Preprocessing βž” C++ Inference Bindings βž” Real-Time Diagnosis πŸ› οΈ Tech: C++ Python Hardware Optimization Edge Inference

Secure Enterprise RAG

Built an air-gapped "Talk-to-your-Data" tool utilizing hybrid search and robust data engineering pipelines. βš™οΈ Flow: Enterprise Data βž” Apache NiFi Ingest βž” Hybrid Search (BM25 + Vector) βž” Secure LLM πŸ› οΈ Tech: Apache NiFi Milvus / Pinecone Hybrid Search Air-gapped LLM

Mobile Edge-AI Engine

Engineered a hot-swappable mobile inference engine capable of switching neural architectures at runtime without requiring app store updates. βš™οΈ Flow: Android App βž” Kotlin Orchestrator βž” ONNX Model Hot-Swap βž” On-Device Inference πŸ› οΈ Tech: Kotlin ONNX Android SDK Neural Architecture

🐍 Contribution Snake


πŸ› οΈ Tech Stack

Edge AI & Hardware Optimization

Generative AI, Agents & Audio

Data Engineering, RAG & Databases

Computer Vision, Analysis & Visualization

Full Stack & DevOps


πŸ“Š GitHub Stats

Stats Langs

πŸ“« Connect

LinkedIn Email Kaggle

Profile Views

Pinned Loading

  1. Medical_Detection Medical_Detection Public

    This worktree combines multiple medical detection and AI applications, integrating deep learning models, computer vision techniques, and large language models (LLMs) to create comprehensive medical…

    Jupyter Notebook 2

  2. Object_Detection Object_Detection Public

    This repository contains multiple computer vision projects including object detection, image classification, and facial emotion recognition. Each project focuses on detecting or classifying specifi…

    Jupyter Notebook 2

  3. LLM-Projects LLM-Projects Public

    Jupyter Notebook