Skip to content
View kimsijin33's full-sized avatar
😊
I may be slow to respond.
😊
I may be slow to respond.

Block or report kimsijin33

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kimsijin33/README.md

I'm AI Engineer Kim Si Jin.

AI Engineer · LLM/RAG · Government R&D Project Lead · End-to-End ML


👨‍💻 About Me

I like deep neural nets. - AI Engineer with 4+ years of specialized AI/ML experience and 19+ years of IT infrastructure background.
I design and build RAG systems, LLM applications, and end-to-end AI pipelines — from data collection and model training to cloud deployment and monitoring.

  • 🏆 1st place — Upstage AI OCR Competition (2025.12)
  • 📄 SCOPUS-indexed international journal publication — GCN-based Fire Situation Recognition (2024)
  • 🏛 Led government R&D AI projects (ETRI, IITP) as AI Project Lead
  • 🤖 Built RAG/NLP systems deployed in real-world services (112/119 emergency inference, B2B matching platform)
  • ☁️ Experienced in MLOps: MLflow, GitHub Actions, AWS EC2/S3, Docker-based CI/CD/CT pipelines

🔑 Core Skills

LLM · RAG
LangChain LangGraph ChromaDB FAISS Prompt Engineering

ML · DL
PyTorch TensorFlow Scikit-learn XGBoost GCN BERT

MLOps · Cloud
AWS Docker MLflow GitHub Actions

Backend · Infra
Python FastAPI Spring Boot Node.js PostgreSQL


🚀 Featured Projects

🔬 GCN-Based Fire Situation Recognition (SCOPUS, 2024)

  • Researched and developed a fire-situation recognition model based on a Graph Convolutional Network (GCN)
  • 95% prediction accuracy achieved · Published in a SCOPUS-indexed international journal
  • AI Project Lead · Owned the full pipeline: data collection, preprocessing, labeling, and model training

🏆 Upstage AI OCR Competition — 1st Place (2025.12)

  • Designed a receipt text-detection model and optimized polygon-coordinate detection
  • Built a robust detection pipeline across 3,600+ images
  • Maximized generalization performance through data augmentation and hyperparameter tuning

🚨 112/119 Emergency Urgency Inference — ETRI (2020–2024)

  • Developed an NLP model and API to infer urgency level (Code 0–4) and incident type from emergency-call text
  • AI Project Lead · Led a team of 3 data scientists and 2 AI developers

📊 Digital Signal Processing (DSP) Desktop Software — ETRI (2022–2024)

  • Converted a MATLAB-based signal-processing application developed at ETRI (Electronics and Telecommunications Research Institute) into C++ software for a Windows desktop application
  • AI Project Lead · Led a team of 1 data scientist and 3 AI developers

🌐 Global B2B Matching AI Platform — IITP R&D (2023)

  • Designed an ETL pipeline for 400GB of Russian patent data → produced 30,000 high-quality JSON records
  • Developed a Multilingual BERT-based enterprise-matching classification model · 90% classification accuracy
  • Devised a three-stage cross-mapping table (IPC ↔ KSIC) and built a VPN-based crawling engine to collect a database of 100,000 Russian companies

🔍 Missing Child Detection Project (YOLO) — ETRI (2020–2021)

  • CCTV video-based missing-child detection project using a YOLO model, conducted at ETRI (Electronics and Telecommunications Research Institute)
  • AI Project Lead · Led a team of 3 data scientists and 2 AI developers

🤖 RAG Chatbot — LangChain + Upstage Solar LLM (2025)

  • Built an end-to-end RAG service: constructed a ChromaDB/FAISS vector database from self-scraped LangChain technical documentation
  • Improved Top-k retrieval accuracy by 30% · Docker Compose CI/CD · Implemented a streaming UI

📊 Apartment Price Prediction — Seoul (2025)

  • Combined Ministry of Land, Infrastructure and Transport transaction data with subway/bus accessibility data through feature engineering
  • Built an XGBoost/LightGBM ensemble model · Reduced RMSE by 35% versus the baseline

📚 Research & Publications

Year Title Venue
2024 Recognition of Fire Situation Using Graph Convolutional Network Model SCOPUS-indexed International Journal
2023 Fire Situation Recognition Using a GCN Model Korea Information Processing Society Conference
2023 Fire Situation Recognition Using a Graph Convolutional Network Model Master's Thesis

🏅 Certifications & Awards

🥇 Upstage AI OCR Competition 1st Place 2025.12
📜 Engineer Information Processing 2012
🌐 CCNA (Score: 1000/1000) 2006
🐧 LPIC-1 2013

📬 Contact

Pinned Loading

  1. ClusterGCN ClusterGCN Public

    Forked from benedekrozemberczki/ClusterGCN

    A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).

    Python 1

  2. -AI- -AI- Public

    Jupyter Notebook

  3. 8wk-dl-dano-ai.github.io 8wk-dl-dano-ai.github.io Public

    Forked from dano-ai/8wk-dl-dano-ai.github.io

    8-week deep dive into deep learning

    CSS

  4. AIforallthepeople AIforallthepeople Public

    Shell

  5. nngraph nngraph Public

    Forked from torch/nngraph

    Graph Computation for nn

    Lua

  6. pygcn pygcn Public

    Forked from tkipf/pygcn

    Graph Convolutional Networks in PyTorch

    Python