Skip to content

suryast/indonesia-civic-stack

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

65 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ‡ฎ๐Ÿ‡ฉ indonesia-civic-stack

PyPI MCP Registry CI Python License

Production-ready scrapers, normalizers, and API wrappers for Indonesian government data sources.

The infrastructure layer beneath halalkah.id, legalkah.id, and a public good for the Indonesian civic tech and developer community.


Why

Indonesian public data is nominally open but practically inaccessible. Every developer building civic tooling re-solves the same scraping problems independently: BPOM product registrations, BPJPH halal certificates, AHU company records. Scrapers bit-rot within months as portals change. There is no shared, maintained layer.

This repo is that layer. One pip install to query Indonesian government portals โ€” no more bespoke scrapers.

AI-Agent First

This SDK is designed for both humans and AI agents:

  • ๐Ÿค– 46 MCP tools โ€” plug into Claude, GPT, or any MCP-compatible agent
  • ๐Ÿ“‹ SKILL.md โ€” AI agent skill discovery (AgentSkills format)
  • ๐Ÿง‘โ€๐Ÿ’ป AGENTS.md โ€” architecture guide for coding agents (Claude Code, Codex, Cursor)
  • ๐Ÿ“ CLAUDE.md โ€” Claude Code-specific instructions
  • โœ… Typed responses โ€” CivicStackResponse envelope, never raw dicts
  • ๐Ÿ” Consistent patterns โ€” every module follows the same contract

Architecture

graph TB
    subgraph "Your App"
        A[halalkah.id] 
        B[legalkah.id]
        C[Your Project]
    end

    subgraph "civic-stack"
        SDK[Python SDK]
        MCP[MCP Servers]
        API[REST API]
        
        subgraph "Shared Layer"
            SC[shared/schema.py<br/>CivicStackResponse]
            HC[shared/http.py<br/>Rate limiting ยท Retries ยท Proxy]
        end

        subgraph "Phase 1"
            BPOM[bpom<br/>Food & Drug]
            BPJPH[bpjph<br/>Halal Certs]
            AHU[ahu<br/>Company Registry]
        end

        subgraph "Phase 2"
            OJK[ojk<br/>Financial Licenses]
            OSS[oss_nib<br/>Business ID]
            LPSE[lpse<br/>Procurement]
            KPU[kpu<br/>Elections]
        end

        subgraph "Phase 3"
            LHKPN[lhkpn<br/>Wealth Declarations]
            BPS[bps<br/>Statistics]
            BMKG[bmkg<br/>Weather & Disasters]
            SIMBG[simbg<br/>Building Permits]
        end
    end

    subgraph "Government Portals"
        P1[cekbpom.pom.go.id]
        P2[sertifikasi.halal.go.id]
        P3[ahu.go.id]
        P4[ojk.go.id]
        P5[oss.go.id]
        P6[lpse.*.go.id]
        P7[infopemilu.kpu.go.id]
        P8[elhkpn.kpk.go.id]
        P9[webapi.bps.go.id]
        P10[data.bmkg.go.id]
        P11[simbg.pu.go.id]
    end

    A & B & C --> SDK & MCP & API
    SDK & MCP & API --> SC
    SC --> BPOM & BPJPH & AHU & OJK & OSS & LPSE & KPU & LHKPN & BPS & BMKG & SIMBG
    BPOM & BPJPH & AHU & OJK & OSS & LPSE & KPU & LHKPN & BPS & BMKG & SIMBG --> HC
    BPOM --> P1
    BPJPH --> P2
    AHU --> P3
    OJK --> P4
    OSS --> P5
    LPSE --> P6
    KPU --> P7
    LHKPN --> P8
    BPS --> P9
    BMKG --> P10
    SIMBG --> P11
Loading

Request Flow

sequenceDiagram
    participant App as Your App
    participant SDK as Civic SDK
    participant HTTP as shared/http.py
    participant Proxy as Proxy (optional)
    participant Portal as Gov Portal

    App->>SDK: search("paracetamol")
    SDK->>HTTP: civic_client(proxy_url)
    Note over HTTP: Auto-reads PROXY_URL<br/>from environment
    alt rewrite mode (CF Worker)
        HTTP->>Proxy: GET ?url=encoded_target
        Proxy->>Portal: Forwarded request
        Portal-->>Proxy: HTML/JSON response
        Proxy-->>HTTP: Response
    else connect mode (SOCKS/HTTP)
        HTTP->>Proxy: CONNECT tunnel
        Proxy->>Portal: Proxied request
        Portal-->>HTTP: Response
    else no proxy
        HTTP->>Portal: Direct request
        Portal-->>HTTP: Response
    end
    HTTP-->>SDK: httpx.Response
    SDK->>SDK: Parse + Normalize
    SDK-->>App: CivicStackResponse
Loading

Module Status

Module Source Data Proxy Status
bpom cekbpom.pom.go.id Food, drug, cosmetic registrations ๐ŸŒ โœ… Active
bpjph cmsbl.halal.go.id Halal certificates (1.98M+ records) ๐ŸŒ โœ… Active โ€” migrated to REST API (v1.0.0)
ahu ahu.go.id Company registry โ€” PT, CV, Yayasan, Koperasi ๐Ÿ‡ฎ๐Ÿ‡ฉ โš ๏ธ Page restructured โ€” search input changed (Apr 2026)
ojk www.ojk.go.id/waspada-investasi Licensed financial institutions + Waspada list ๐Ÿ‡ฎ๐Ÿ‡ฉ โš ๏ธ Portal migrated to SharePoint (Apr 2026) โ€” scraper needs rewrite
oss_nib oss.go.id Business identity (NIB) ๐Ÿ‡ฎ๐Ÿ‡ฉ โš ๏ธ Page restructured โ€” Playwright can't find inputs (Apr 2026)
lpse spse.inaproc.id Government procurement ๐Ÿ‡ฎ๐Ÿ‡ฉ โœ… Active โ€” un-deprecated (v1.0.0)
kpu infopemilu.kpu.go.id Election data โ€” candidates, results, finance ๐ŸŒ โœ… Active
bps webapi.bps.go.id Statistical datasets (1,000+) ๐ŸŒ โœ… Active (requires BPS_API_KEY)
bmkg data.bmkg.go.id Weather, earthquake, and disaster data ๐ŸŒ โœ… Active
simbg simbg.pu.go.id Building permits (PBG) โ€” multi-portal ๐ŸŒ โœ… Active
jdih peraturan.go.id National legal database โ€” UU, PP, Perpres, Permen ๐Ÿ‡ฎ๐Ÿ‡ฉ โœ… New โ€” Playwright scraping
ksei web.ksei.co.id Securities statistics (62 monthly PDFs) + registered securities ๐ŸŒ โœ… New โ€” HTML scraping (no proxy needed)
djpb data-apbn.kemenkeu.go.id APBN budget themes โ€” target/realization/achievement ๐Ÿ‡ฎ๐Ÿ‡ฉ โœ… New โ€” clean REST JSON API
lhkpn elhkpn.kpk.go.id Wealth declarations (officials) โ€” โœ… Active โ€” reCAPTCHA v3 solved via Playwright

๐ŸŒ = works globally ย  ๐Ÿ‡ฎ๐Ÿ‡ฉ = requires Indonesian proxy (set PROXY_URL)

Every module returns the same CivicStackResponse envelope โ€” swap data sources without touching application logic.

Module Maturity

Module Scraper Normalizer MCP Tests Portal Status
bpom โœ… โœ… โœ… โœ… โœ…
bpjph โœ… โœ… โœ… โœ… โœ… REST API
ahu โœ… โœ… โœ… โœ… โš ๏ธ page restructured
ojk โœ… โœ… โœ… โœ… โš ๏ธ SharePoint migration
oss_nib โœ… โœ… โœ… โœ… โš ๏ธ page restructured
lpse โœ… โœ… โœ… โœ… ๐Ÿ‡ฎ๐Ÿ‡ฉ geo-blocked
kpu โœ… โœ… โœ… โœ… โœ…
bps โœ… โœ… โœ… โœ… โœ…
bmkg โœ… โœ… โœ… โœ… โœ…
simbg โœ… โœ… โœ… โœ… โœ…
jdih โœ… โœ… โŒ โŒ ๐Ÿ‡ฎ๐Ÿ‡ฉ Playwright
ksei โœ… โœ… โŒ โŒ โœ… (no proxy needed)
djpb โœ… โœ… โŒ โŒ โœ… REST JSON API
lhkpn โœ… โœ… โœ… โœ… โœ… Active (Playwright)

Quick Start

Install

pip install indonesia-civic-stack          # Core SDK
pip install "indonesia-civic-stack[mcp]"   # + MCP server (40 tools)
pip install "indonesia-civic-stack[api]"   # + REST API (FastAPI + uvicorn)
pip install "indonesia-civic-stack[all]"   # Everything

Python SDK

import asyncio
from civic_stack.bpom.scraper import search as bpom_search
from civic_stack.bmkg.scraper import get_latest_earthquake

async def main():
    # Search BPOM product registry
    results = await bpom_search("paracetamol")
    for r in results:
        if r.found:
            print(r.result)

    # Get latest earthquake
    eq = await get_latest_earthquake()
    print(eq.result)  # {'date': '...', 'magnitude': '5.2', ...}

asyncio.run(main())

MCP Server (for AI agents)

All 14 modules expose 46 MCP tools for use with Claude, GPT, or any MCP-compatible agent.

# Install locally:
pip install "indonesia-civic-stack[mcp]"
claude mcp add civic-stack -- civic-stack-mcp

# Or deploy your own remote server (Railway one-click):
# See "Self-Hosted MCP Server" section below

MCP server classes support two init styles:

# Style 1: Explicit init
class BpomMCPServer(CivicStackMCPBase):
    def __init__(self):
        super().__init__("bpom")

# Style 2: Class attribute
class BmkgMCPServer(CivicStackMCPBase):
    module_name = "bmkg"

REST API

# Run all modules
uvicorn app:app --port 8000

# With API key auth (recommended)
CIVIC_API_KEY=your-secret-key uvicorn app:app --port 8000

# Individual module
uvicorn modules.bpom.app:app --port 8001

# With proxy
PROXY_URL=socks5://id-proxy:1080 uvicorn app:app --port 8000
# Endpoints
GET /bpom/check/MD123456789012
GET /bpom/search?q=paracetamol
GET /bpjph/check/BPJPH-12345
GET /ahu/search?q=PT+Contoh+Indonesia
GET /ojk/check?name=Bank+BCA
GET /kpu/candidate/search?q=Joko
GET /lhkpn/search?q=Anies          # โœ… reCAPTCHA v3 solved via Playwright
GET /bps/search?q=inflasi           # Requires BPS_API_KEY
GET /bmkg/weather?city=jakarta
GET /simbg/search?q=Jakarta+Selatan

Response Envelope

Every module returns CivicStackResponse:

{
  "result": {"product_name": "...", "registration_status": "ACTIVE"},
  "found": true,
  "status": "ACTIVE",
  "confidence": 1.0,
  "source_url": "https://cekbpom.pom.go.id/...",
  "fetched_at": "2026-03-14T06:30:00Z",
  "module": "bpom"
}

Status values: ACTIVE, EXPIRED, SUSPENDED, REVOKED, NOT_FOUND, ERROR.

When a module can't reach its portal or is missing configuration (e.g., BPS_API_KEY), it returns an error envelope instead of crashing:

{
  "result": null,
  "found": false,
  "status": "ERROR",
  "confidence": 0.0,
  "source_url": "https://webapi.bps.go.id",
  "module": "bps",
  "detail": "BPS_API_KEY not set. Register at https://webapi.bps.go.id/developer/register"
}

Module Internals

civic_stack/bpom/
โ”œโ”€โ”€ __init__.py
โ”œโ”€โ”€ app.py          # FastAPI application
โ”œโ”€โ”€ normalizer.py   # Raw HTML/JSON โ†’ structured dict
โ”œโ”€โ”€ router.py       # FastAPI routes
โ”œโ”€โ”€ scraper.py      # fetch() + search() โ€” core logic
โ”œโ”€โ”€ server.py       # FastMCP MCP server
โ”œโ”€โ”€ Dockerfile
โ””โ”€โ”€ README.md

The shared/ layer provides:

  • schema.py โ€” CivicStackResponse Pydantic model, status enum, helper constructors
  • http.py โ€” civic_client() factory with auto-proxy, rate limiter, exponential backoff retry, URL rewriting for CF Worker proxies
  • mcp.py โ€” CivicStackMCPBase abstract base class for MCP servers

Deployment Notes

Geo-blocking & Proxy Requirements

Most Indonesian government portals (*.go.id) restrict access to Indonesian IP addresses. If deploying outside Indonesia, you must set PROXY_URL to route requests through an Indonesian endpoint.

# Option 1: Indonesian VPS/SOCKS proxy (recommended for production)
export PROXY_URL="socks5://id-proxy.example.com:1080"
export PROXY_MODE="connect"

# Option 2: CF Worker proxy (free, but limited โ€” see below)
export PROXY_URL="https://your-proxy.workers.dev"
# PROXY_MODE auto-detects "rewrite" for *.workers.dev

Without a proxy, expect: DNS resolution failures, connection timeouts, or HTTP 403/404 responses from most modules.

The SDK auto-reads PROXY_URL from environment โ€” no code changes needed in scrapers or MCP servers.

Proxy Modes

Mode PROXY_URL example How it works
connect socks5://id-proxy:1080 Standard HTTP/SOCKS CONNECT proxy via httpx transport
rewrite https://x.workers.dev Rewrites URLs to ?url=<target> (auto-detected for *.workers.dev)
none (unset) Direct connection

Override auto-detection with PROXY_MODE=connect|rewrite.

CF Worker Proxy

A ready-to-deploy CF Worker proxy is included in proxy/. Deploy with:

cd proxy && npx wrangler deploy

โš ๏ธ CF Worker limitation: Many .go.id portals are themselves behind Cloudflare. CF Workers making fetch() calls to other CF-protected origins receive 403/522 errors. This is a known Cloudflare limitation.

Verified through CF Worker proxy:

Portal Status Notes
data.bmkg.go.id โœ… Works JSON API, not behind CF
cekbpom.pom.go.id โŒ 403/522 Portal is CF-protected
api.ojk.go.id โŒ DNS dead NXDOMAIN since March 2026
infopemilu.kpu.go.id โŒ 403 CF-protected
lpse.*.go.id โŒ 403 CF-protected
elhkpn.kpk.go.id โœ… 200 reCAPTCHA v3 solved via Playwright headless browser

For production with CF-protected portals, use an Indonesian VPS with a SOCKS5/HTTP proxy and set PROXY_MODE=connect.

Geo-Restriction Test Results (March 2026)

Tested from three locations to map which portals enforce geo-blocking vs WAF:

Portal Sydney (AU) Singapore Jakarta (ID) Verdict
ahu.go.id โŒ โœ… โœ… Geo-blocked (SEA+ OK)
elhkpn.kpk.go.id โŒ โœ… โœ… Geo-blocked (SEA+ OK)
ojk.go.id โŒ 403 โŒ 403 โœ… ID-only
jaga.id (KPK) โœ… โœ… โœ… No restriction
data.bmkg.go.id โœ… โœ… โœ… No restriction
cekbpom.pom.go.id โš ๏ธ โš ๏ธ โš ๏ธ CF-protected (all locations)
webapi.bps.go.id โŒ 403 โŒ 403 โŒ 403 WAF, not geo (needs API key)
lpse.lkpp.go.id โŒ โŒ โŒ Unreliable (all locations)
coretaxdjp.pajak.go.id โŒ โŒ โŒ Unreliable (all locations)

Takeaway: An Indonesian proxy (e.g., CloudKilat Jakarta) unlocks OJK โ€” the most important geo-restricted portal. Singapore unlocks AHU + LHKPN. BPS and LPSE failures are not geo-related.

VPS Hardening Lesson

โš ๏ธ Never disable password auth and restart sshd in one automated script on a fresh VPS. If the SSH key wasn't copied correctly, you're locked out with no recovery path except a web console. Always: (1) copy key, (2) verify key login works in a separate session, (3) then disable password auth.

Portal URL Stability

Indonesian government portals frequently change their URL structure without notice. Known changes as of March 2026:

Module Old URL New URL Status
BPOM /index.php/home/produk/1/{keyword}/... /all-produk?q={keyword} โœ… Updated
KPU /Pemilu/caleg/list /Pemilu/Peserta_pemilu โœ… Updated
BMKG /DataMKG/MEWS/Warning/cuacasignifikan.json /DataMKG/TEWS/gempadirasakan.json โœ… Updated
LHKPN /portal/user/check_search_announ reCAPTCHA v3 (Playwright) ๐ŸŸข Active

Modules that fail for 60 days are flagged DEGRADED and may be archived.

Browser-Based Modules

Some portals require a real browser (JavaScript rendering, anti-bot protection):

Module Browser Anti-bot
bpjph Playwright (Chromium) Standard
ahu Playwright + Camoufox Bot management (datacenter IP blocking)
oss_nib Playwright (Chromium) Standard

Install browser dependencies:

pip install ".[playwright]"
playwright install chromium

# For AHU (optional, improves success rate):
pip install camoufox && python -m camoufox fetch

API Keys

Module Key Required Env Var Registration
BPS Yes BPS_API_KEY webapi.bps.go.id/developer/register (free)
All others No โ€” โ€”

Without BPS_API_KEY, the BPS module returns an error envelope (not a crash):

{"status": "ERROR", "detail": "BPS_API_KEY not set. Register at ..."}

MCP Tool Inventory

All 11 modules expose 40 MCP tools total:

Module Tools Count
bpom check_bpom, search_bpom, get_bpom_status 3
bpjph check_halal_cert, lookup_halal_by_product, get_halal_status, cross_reference_halal_bpom 4
ahu lookup_company_ahu, get_company_directors, verify_company_status, search_companies_ahu 4
ojk check_ojk_license, search_ojk_institutions, get_ojk_status, check_ojk_waspada 4
oss_nib lookup_nib, verify_nib, search_oss_businesses 3
lpse lookup_vendor_lpse, search_lpse_vendors, search_lpse_tenders, get_lpse_portals 4
kpu get_candidate, search_kpu_candidates, get_election_results_kpu, get_campaign_finance_kpu 4
lhkpn get_lhkpn, search_lhkpn, compare_lhkpn, get_lhkpn_pdf 4
bps search_bps_datasets, get_bps_indicator, list_bps_regions 3
bmkg get_bmkg_alerts, get_weather_forecast, get_earthquake_history, get_latest_earthquake 4
simbg lookup_building_permit, search_permits_by_area, list_simbg_portals 3

AI Agent Integration

This repo is built for AI agents as first-class consumers.

For AI Coding Agents

File Purpose Agent
AGENTS.md Architecture, patterns, critical rules, gotchas All coding agents
CLAUDE.md Commands, do/don't rules, style guide Claude Code
.cursorrules Project rules for Cursor Cursor
.github/copilot-instructions.md Instructions for Copilot GitHub Copilot
CONTRIBUTING.md Module contract + PR checklist All
SKILL.md Skill discovery (AgentSkills format) Skill-aware agents
PROMPTS.md Example prompts + interactive artifact recipes All AI agents

Connect MCP Tools (Pick One)

Option A โ€” Self-hosted remote server (deploy your own):

Deploy on Railway

# After deploying to Railway/Fly/Render, add to Claude Code:
claude mcp add civic-stack --transport http https://your-deployment.up.railway.app/mcp

# Or Claude Desktop โ€” add to claude_desktop_config.json:
{
  "mcpServers": {
    "civic-stack": {
      "transport": "streamable-http",
      "url": "https://your-deployment.up.railway.app/mcp"
    }
  }
}

Note: There is no shared hosted server. Each user deploys their own instance to control proxy settings, rate limits, and API keys.

Option B โ€” Local install via pip:

pip install "indonesia-civic-stack[mcp]"
claude mcp add civic-stack -- civic-stack-mcp

Option C โ€” Clone repo (auto-discovery):

git clone https://github.com/suryast/indonesia-civic-stack.git
cd indonesia-civic-stack
pip install -e ".[mcp]"
claude  # Claude Code auto-detects .mcp.json โ€” 40 tools available immediately

All three options give you the same 40 tools. Then ask:

"Check if BPOM registration MD 123456789 is still active" "Search for companies named 'Maju Bersama' in the AHU registry" "What was the latest earthquake in Indonesia?"

See PROMPTS.md for more example prompts and interactive artifact recipes.

REST API

pip install "indonesia-civic-stack[api]"
civic-stack api --port 8000
# GET http://localhost:8000/bpom/search?q=paracetamol

Example Prompts

Once MCP tools are connected, try these with your AI agent:

Food Safety "Check if BPOM registration number MD 123456789 is still active" "Search for all paracetamol products registered with BPOM"

Halal Verification "Is product XYZ halal certified? Cross-reference with BPOM registration" "Find all halal certificates issued to PT Indofood"

Company Due Diligence "Look up PT Maju Bersama in the AHU company registry and check who the directors are" "Is this company OJK-licensed? Check both the license registry and the waspada (warning) list"

Public Finance "Search LHKPN wealth declarations for officials in Jakarta" "Find government procurement tenders for road construction on LPSE"

Disaster & Weather "What was the latest earthquake in Indonesia?" "Get the weather forecast for DKI Jakarta from BMKG"

Statistics "Find BPS datasets about poverty rates by province" "Get the inflation indicator for the last 5 years"

Multi-Source Queries "I want to verify a food company: check AHU for registration, OJK for financial license, BPOM for product registrations, and BPJPH for halal certificates" "Compare LHKPN wealth declarations for these two officials over the last 3 reporting periods"

Design Decisions for AI Agents

  1. Uniform response envelope โ€” every tool returns CivicStackResponse with the same fields. Agents don't need module-specific parsing logic.
  2. Error envelopes, not exceptions โ€” agents receive structured error info they can reason about, not stack traces.
  3. Self-documenting tools โ€” MCP tool descriptions include parameter types, expected values, and response format.
  4. Deterministic naming โ€” check_<module>, search_<module>, get_<module>_status pattern across all modules.

Security

Feature Config Default
API key auth CIVIC_API_KEY env var Disabled (open)
Rate limiting CIVIC_RATE_LIMIT env var 60 req/min per IP
Proxy allowlist CIVIC_ALLOWED_PROXIES env var Any non-private IP
SSRF prevention Built-in Blocks RFC 1918 + localhost
Container user Dockerfile Non-root (civicapp, uid 1000)
# Production deployment
export CIVIC_API_KEY="your-secret-key"
export CIVIC_RATE_LIMIT=30                          # 30 req/min
export CIVIC_ALLOWED_PROXIES="proxy.example.com"    # optional proxy allowlist
export PROXY_URL="socks5://id-proxy:1080"           # Indonesian proxy
uvicorn app:app --host 0.0.0.0 --port 8000

Docker

docker compose up                             # All modules
docker build -t civic-bpom civic_stack/bpom/      # Individual
docker run -p 8001:8000 -e CIVIC_API_KEY=secret -e PROXY_URL=socks5://proxy:1080 civic-bpom

Development

git clone https://github.com/suryast/indonesia-civic-stack.git
cd indonesia-civic-stack
python -m venv .venv && source .venv/bin/activate
pip install -e ".[all,dev]"
playwright install chromium

pytest -v              # VCR replay โ€” no live portal calls
ruff check .           # Lint
ruff format --check .  # Format check
mypy shared/           # Type check

Tests

pytest -v                       # 89 tests, VCR replay (no live calls)
pytest tests/bpom/ -v           # Single module
pytest --tb=short -q            # Quick summary
pie title Test Coverage (89 tests)
    "BPOM" : 7
    "BPJPH" : 8
    "AHU" : 12
    "OJK" : 4
    "KPU" : 5
    "LPSE" : 9
    "OSS-NIB" : 6
    "LHKPN" : 10
    "BPS" : 7
    "BMKG" : 8
    "SIMBG" : 7
    "Schema" : 6
Loading

Contributing

See CONTRIBUTING.md. Every module PR must include:

  • fetch() and search() returning CivicStackResponse
  • FastAPI router + FastMCP server
  • 3+ VCR test fixtures
  • Module README

A module that breaks for 60 days is flagged DEGRADED and archived.


Used By

  • halalkah.id โ€” Halal product verification (9.57M products)
  • legalkah.id โ€” Financial institution legality checker
  • datarakyat.id โ€” Landing page & documentation

Sample Architectures

Simple: Halal Product Checker

A single-page app that checks if a product is halal-certified. One module, no proxy needed for Indonesian users.

graph LR
    subgraph Client
        A[Mobile App / Web]
    end

    subgraph Your Server
        B[FastAPI]
        C[bpjph module]
    end

    subgraph Government Portal
        D[sertifikasi.halal.go.id]
    end

    A -->|POST /check| B
    B --> C
    C -->|scrape| D
    D -->|HTML| C
    C -->|CivicStackResponse| B
    B -->|JSON| A

    style A fill:#f9f9f9,stroke:#333
    style B fill:#e8f5e9,stroke:#2e7d32
    style C fill:#e8f5e9,stroke:#2e7d32
    style D fill:#fff3e0,stroke:#e65100
Loading
# app.py โ€” 15 lines, production-ready
from fastapi import FastAPI
from civic_stack.bpjph.scraper import fetch

app = FastAPI()

@app.get("/check/{product_id}")
async def check_halal(product_id: str):
    result = await fetch(product_id)
    return {"halal": result.found, "data": result.result}

Intermediate: Multi-Source Due Diligence API

A compliance tool that cross-checks a company across multiple government databases. Runs behind a proxy for overseas deployment.

graph TB
    subgraph Client
        A[Compliance Dashboard]
    end

    subgraph Your Infrastructure
        B[API Gateway]
        C[Due Diligence Service]
        D[ahu module]
        E[ojk module]
        F[bpom module]
        G[oss_nib module]
        H[(Redis Cache)]
    end

    subgraph Proxy Layer
        I[CF Worker Proxy]
    end

    subgraph Government Portals
        J[ahu.go.id]
        K[www.ojk.go.id]
        L[cekbpom.pom.go.id]
        M[oss.go.id]
    end

    A -->|GET /company/:name| B
    B --> C
    C --> H
    C --> D & E & F & G
    D & E & F & G -->|via PROXY_URL| I
    I --> J & K & L & M

    style A fill:#f9f9f9,stroke:#333
    style B fill:#e3f2fd,stroke:#1565c0
    style C fill:#e8f5e9,stroke:#2e7d32
    style D fill:#e8f5e9,stroke:#2e7d32
    style E fill:#e8f5e9,stroke:#2e7d32
    style F fill:#e8f5e9,stroke:#2e7d32
    style G fill:#e8f5e9,stroke:#2e7d32
    style H fill:#fce4ec,stroke:#c62828
    style I fill:#fff8e1,stroke:#f57f17
    style J fill:#fff3e0,stroke:#e65100
    style K fill:#fff3e0,stroke:#e65100
    style L fill:#fff3e0,stroke:#e65100
    style M fill:#fff3e0,stroke:#e65100
Loading
# due_diligence.py โ€” parallel checks across 4 portals
import asyncio
from civic_stack.ahu.scraper import search as ahu_search
from civic_stack.ojk.scraper import search as ojk_search
from civic_stack.bpom.scraper import search as bpom_search
from civic_stack.oss_nib.scraper import search as nib_search

async def check_company(name: str) -> dict:
    ahu, ojk, bpom, nib = await asyncio.gather(
        ahu_search(name),
        ojk_search(name),
        bpom_search(name),
        nib_search(name),
    )
    return {
        "company": name,
        "registered": any(r.found for r in ahu),
        "ojk_licensed": any(r.found for r in ojk),
        "bpom_products": len([r for r in bpom if r.found]),
        "nib_valid": any(r.found for r in nib),
        "risk_flags": _assess_risk(ahu, ojk, bpom, nib),
    }

Advanced: AI Agent with MCP Tools

An AI assistant that answers natural language questions about Indonesian civic data using MCP tools. The agent reasons about which portals to query.

sequenceDiagram
    participant User
    participant Agent as AI Agent (Claude/GPT)
    participant MCP as MCP Server
    participant SDK as civic-stack modules
    participant Proxy as CF Worker Proxy
    participant Gov as Government Portals

    User->>Agent: "Is PT Maju Bersama a legitimate company<br/>with halal certification?"

    Note over Agent: Agent reasons: need AHU (company)<br/>+ BPJPH (halal) + OJK (finance)

    Agent->>MCP: search_companies_ahu("PT Maju Bersama")
    MCP->>SDK: ahu.search()
    SDK->>Proxy: GET ahu.go.id/...
    Proxy->>Gov: Forward request
    Gov-->>Proxy: HTML response
    Proxy-->>SDK: Response
    SDK-->>MCP: CivicStackResponse
    MCP-->>Agent: {found: true, status: "ACTIVE", ...}

    Agent->>MCP: check_halal_cert("PT Maju Bersama")
    MCP->>SDK: bpjph.fetch()
    SDK->>Proxy: GET sertifikasi.halal.go.id/...
    Proxy-->>SDK: Response
    SDK-->>MCP: CivicStackResponse
    MCP-->>Agent: {found: true, status: "ACTIVE", ...}

    Agent->>MCP: check_ojk_license("PT Maju Bersama")
    MCP->>SDK: ojk.fetch()
    SDK-->>MCP: {found: false, status: "NOT_FOUND"}

    Note over Agent: Agent synthesizes results

    Agent->>User: "PT Maju Bersama is a registered company (AHU โœ…)<br/>with active halal certification (BPJPH โœ…).<br/>No OJK financial license found โ€” this is normal<br/>for non-financial companies."
Loading
# Connect MCP servers to Claude Desktop โ€” one command per module
claude mcp add civic-ahu   -- python -m civic_stack.ahu.server
claude mcp add civic-bpjph -- python -m civic_stack.bpjph.server
claude mcp add civic-ojk   -- python -m civic_stack.ojk.server

# Or run unified REST API for HTTP-based agents
PROXY_URL=https://your-proxy.workers.dev uvicorn app:app

Related

License

MIT โ€” see LICENSE

About

๐Ÿ‡ฎ๐Ÿ‡ฉ Production-ready Python SDK + MCP server for 14 Indonesian government data sources โ€” halal certs, company registry, procurement, budget, legal docs & more. 46 AI-agent tools.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Contributors