Skip to content
View bernardinolintang's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report bernardinolintang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bernardinolintang/README.md

πŸ‘‹ Hi, I'm Bernardino Lintang

Year 3 Data Science & Analytics student at NUS building explainable AI systems, LLM-powered agents, and production-grade ML applications.

I focus on:

  • Agentic AI workflows
  • Retrieval-Augmented Generation (RAG) systems
  • Hybrid ML + LLM architectures
  • Explainable company intelligence systems

Currently exploring AI systems design, evaluation frameworks, and scalable AI product deployment.

visitors


Welcome! πŸ‘‹

πŸ§‘β€πŸ’» About Me

  • 🌱 Pursuing a B.Sc. (Hons) in Data Science & Analytics
  • πŸ’‘ Strong interests in AI/ML, AI Agents, Machine Learning, and Product Development
  • πŸš€ Experienced in data science, operations, and leadership through internships and student organisations
  • πŸ† Finalist at NUS SDS Datathon 2026 (Top 1/76 teams)
  • πŸ† Finalist at NUS SDS Hackathon 2025 (Top 3/40 teams)
  • πŸ† Finalist at NUS SDS Datathon 2025 (Top 5/75 teams)
  • ✨ Believer in teamwork, adaptability, and continuous learning

🌱 Currently Exploring

  • Multi-agent orchestration frameworks
  • Vector database optimisation strategies
  • Embedding evaluation techniques
  • LLM cost-latency tradeoffs
  • AI evaluation benchmarks (precision, hallucination detection)
  • AI product management frameworks

πŸ› οΈ Tech Stack

βš™οΈ Languages

Python R SQL Java TypeScript JavaScript HTML5 CSS3

πŸ€– AI & Machine Learning

scikit-learn TensorFlow Keras LangChain LangGraph OpenAI Claude Groq Ollama

πŸ“Š Data Analysis & Visualisation

NumPy Pandas Matplotlib Plotly Jupyter Tableau Power BI

🌐 Web & APIs

React Next.js FastAPI Flask Node.js

πŸ’Ύ Databases & Storage

PostgreSQL Supabase Snowflake BigQuery SQLite pgvector

πŸ—‚οΈ DevOps & Tools

Docker Git GitHub Figma Canva


πŸ’Ό Experience

  • Artificial Intelligence Entrepreneur Intern @ Crayon Data (Dec 2025 – Jan 2026) β†’ Built LLM-powered data extraction and agentic workflows to transform unstructured merchant offers into schema-compliant datasets, reducing manual processing and improving data quality at scale.

  • Operations (Data Science) Intern @ Superbank (May – Aug 2025) β†’ Engineered 200+ fraud detection features in Python/SQL, optimised queries on 50M+ records (~40% faster), built real-time monitoring pipelines in Snowflake.

  • Head of Branding & Marketing @ Developer Group, NUS Computing (Jul 2025 – Present) β†’ Leading branding initiatives and managing associates for impactful events.

  • Curriculum Executive @ NUS Product Club (Jul 2025 – Present) β†’ Organised flagship Product Management Executive (PME) programme with industry PMs.

  • Marketing Head @ NUS Science Club (2024–25) β†’ Secured sponsorships, co-led 7 executives, and streamlined sponsor email processes (50% faster).


πŸš€ Flagship Project: IntelliCompany AI (1st Place Winner)

Explainable AI-powered company intelligence system combining classical ML clustering with LLM-based reasoning β€” fully grounded, no hallucination.

Problem

Traditional company intelligence tools lack explainability and rely on black-box scoring systems.

Solution

Built a hybrid ML + LLM architecture:

  • K-Prototypes clustering on 8,559 companies
  • 20+ engineered financial and IT intensity features
  • Structured RAG-style retrieval (15 records/query)
  • Guardrail-constrained Llama 3.3 70B via Groq
  • Temperature-controlled outputs (0.1)
  • Dockerised full-stack system

Technical Highlights

  • Mixed-data clustering (numeric + categorical)
  • Silhouette-based K selection
  • Dynamic knowledge base generation
  • Strict dataset-only grounding
  • No hallucination policy enforcement
  • React + FastAPI production pipeline

Why This Matters

Demonstrates ability to:

  • Combine classical ML with modern LLMs
  • Design explainable AI systems
  • Build safe and grounded AI workflows
  • Ship end-to-end AI applications

πŸ€– LLM & Agent Experience

Crayon Data – AI Intern

  • Built LLM-powered workflows to extract structured insights from unstructured merchant datasets
  • Designed prompt pipelines for schema-compliant data generation
  • Implemented guardrails for output consistency
  • Worked with production-grade AI deployment workflows

CPF Board – Data & AI

  • Worked on AI-driven data pipelines
  • Applied ML models in regulated public sector environments
  • Focused on accuracy, auditability, and stakeholder trust

πŸ† Achievements

  • πŸ… NUS Datathon 2026 Finalist (Top 1/76 teams)
  • πŸ“š Principal's Honours Roll (2019 & 2020)
  • πŸŽ“ Top cohort results for O-Level Mathematics & Science
  • ⭐ Outstanding academic performance in JC1 & JC2

🧠 My AI Systems Approach

When building AI systems, I prioritise:

  1. Grounding over hallucination
  2. Classical ML + LLM hybrid architectures
  3. Explainability before automation
  4. Low-temperature deterministic outputs for business settings
  5. Clear guardrails and retrieval constraints
  6. Containerised deployment-ready systems

I believe production AI must be: Safe Β· Interpretable Β· Scalable Β· Cost-aware


πŸ“§ Contacts / Profiles

Reach out to me via the platforms below! My resume is available on request.

LinkedIn Email


πŸ“ˆ GitHub Stats

Bernardino's github activity graph

GitHub Stats Top Languages GitHub Streak

Pinned Loading

  1. BernardinoLintangWebsite BernardinoLintangWebsite Public

    Personal portfolio website showcasing my projects, skills, and achievements as a Data Science & Analytics student at NUS. Built with React, Vite, and TailwindCSS, and deployed on Vercel.

    TypeScript

  2. Colour-Classification-Computer-Vision Colour-Classification-Computer-Vision Public

    A computer vision project for colour classification using Python and OpenCV.

    Python

  3. IT1244-Project-DNA-Binding-Protein IT1244-Project-DNA-Binding-Protein Public

    This project explores the use of machine learning techniques to classify DNA-binding proteins (DBPs) from non-DNA-binding proteins. Accurate DBP identification is critical for understanding biologi…

    Python

  4. Quiz-Bank Quiz-Bank Public

    Team-maintained DSA3101 Quiz Bank designed for collaborative revision. Includes categorized questions, difficulty levels, and reusable templates to support students preparing for Data Science in Pr…

    Python

  5. SDS-Hackathon-Insurance-Costs SDS-Hackathon-Insurance-Costs Public

    NUS SDS Mini-Datathon 2025 - Top 3. End-to-end ML project for predicting medical insurance costs using linear (Ridge, Lasso, Elastic Net) and tree-based (Random Forest, XGBoost) models. Includes AI…

    Jupyter Notebook

  6. DSA2101-Taylor-Swift-Music-Analysis DSA2101-Taylor-Swift-Music-Analysis Public

    Data Science analysis of Taylor Swift’s discography using Spotify audio features, Metacritic scores, and user ratings. Explores how acousticness, valence, collaborations, and other song characteris…