Production-ready scrapers, normalizers, and API wrappers for Indonesian government data sources.
The infrastructure layer beneath halalkah.id, legalkah.id, and a public good for the Indonesian civic tech and developer community.
Indonesian public data is nominally open but practically inaccessible. Every developer building civic tooling re-solves the same scraping problems independently: BPOM product registrations, BPJPH halal certificates, AHU company records. Scrapers bit-rot within months as portals change. There is no shared, maintained layer.
This repo is that layer. One pip install to query Indonesian government portals โ no more bespoke scrapers.
This SDK is designed for both humans and AI agents:
- ๐ค 46 MCP tools โ plug into Claude, GPT, or any MCP-compatible agent
- ๐ SKILL.md โ AI agent skill discovery (AgentSkills format)
- ๐งโ๐ป AGENTS.md โ architecture guide for coding agents (Claude Code, Codex, Cursor)
- ๐ CLAUDE.md โ Claude Code-specific instructions
- โ
Typed responses โ
CivicStackResponseenvelope, never raw dicts - ๐ Consistent patterns โ every module follows the same contract
graph TB
subgraph "Your App"
A[halalkah.id]
B[legalkah.id]
C[Your Project]
end
subgraph "civic-stack"
SDK[Python SDK]
MCP[MCP Servers]
API[REST API]
subgraph "Shared Layer"
SC[shared/schema.py<br/>CivicStackResponse]
HC[shared/http.py<br/>Rate limiting ยท Retries ยท Proxy]
end
subgraph "Phase 1"
BPOM[bpom<br/>Food & Drug]
BPJPH[bpjph<br/>Halal Certs]
AHU[ahu<br/>Company Registry]
end
subgraph "Phase 2"
OJK[ojk<br/>Financial Licenses]
OSS[oss_nib<br/>Business ID]
LPSE[lpse<br/>Procurement]
KPU[kpu<br/>Elections]
end
subgraph "Phase 3"
LHKPN[lhkpn<br/>Wealth Declarations]
BPS[bps<br/>Statistics]
BMKG[bmkg<br/>Weather & Disasters]
SIMBG[simbg<br/>Building Permits]
end
end
subgraph "Government Portals"
P1[cekbpom.pom.go.id]
P2[sertifikasi.halal.go.id]
P3[ahu.go.id]
P4[ojk.go.id]
P5[oss.go.id]
P6[lpse.*.go.id]
P7[infopemilu.kpu.go.id]
P8[elhkpn.kpk.go.id]
P9[webapi.bps.go.id]
P10[data.bmkg.go.id]
P11[simbg.pu.go.id]
end
A & B & C --> SDK & MCP & API
SDK & MCP & API --> SC
SC --> BPOM & BPJPH & AHU & OJK & OSS & LPSE & KPU & LHKPN & BPS & BMKG & SIMBG
BPOM & BPJPH & AHU & OJK & OSS & LPSE & KPU & LHKPN & BPS & BMKG & SIMBG --> HC
BPOM --> P1
BPJPH --> P2
AHU --> P3
OJK --> P4
OSS --> P5
LPSE --> P6
KPU --> P7
LHKPN --> P8
BPS --> P9
BMKG --> P10
SIMBG --> P11
sequenceDiagram
participant App as Your App
participant SDK as Civic SDK
participant HTTP as shared/http.py
participant Proxy as Proxy (optional)
participant Portal as Gov Portal
App->>SDK: search("paracetamol")
SDK->>HTTP: civic_client(proxy_url)
Note over HTTP: Auto-reads PROXY_URL<br/>from environment
alt rewrite mode (CF Worker)
HTTP->>Proxy: GET ?url=encoded_target
Proxy->>Portal: Forwarded request
Portal-->>Proxy: HTML/JSON response
Proxy-->>HTTP: Response
else connect mode (SOCKS/HTTP)
HTTP->>Proxy: CONNECT tunnel
Proxy->>Portal: Proxied request
Portal-->>HTTP: Response
else no proxy
HTTP->>Portal: Direct request
Portal-->>HTTP: Response
end
HTTP-->>SDK: httpx.Response
SDK->>SDK: Parse + Normalize
SDK-->>App: CivicStackResponse
| Module | Source | Data | Proxy | Status |
|---|---|---|---|---|
bpom |
cekbpom.pom.go.id | Food, drug, cosmetic registrations | ๐ | โ Active |
bpjph |
cmsbl.halal.go.id | Halal certificates (1.98M+ records) | ๐ | โ Active โ migrated to REST API (v1.0.0) |
ahu |
ahu.go.id | Company registry โ PT, CV, Yayasan, Koperasi | ๐ฎ๐ฉ | |
ojk |
www.ojk.go.id/waspada-investasi | Licensed financial institutions + Waspada list | ๐ฎ๐ฉ | |
oss_nib |
oss.go.id | Business identity (NIB) | ๐ฎ๐ฉ | |
lpse |
spse.inaproc.id | Government procurement | ๐ฎ๐ฉ | โ Active โ un-deprecated (v1.0.0) |
kpu |
infopemilu.kpu.go.id | Election data โ candidates, results, finance | ๐ | โ Active |
bps |
webapi.bps.go.id | Statistical datasets (1,000+) | ๐ | โ
Active (requires BPS_API_KEY) |
bmkg |
data.bmkg.go.id | Weather, earthquake, and disaster data | ๐ | โ Active |
simbg |
simbg.pu.go.id | Building permits (PBG) โ multi-portal | ๐ | โ Active |
jdih |
peraturan.go.id | National legal database โ UU, PP, Perpres, Permen | ๐ฎ๐ฉ | โ New โ Playwright scraping |
ksei |
web.ksei.co.id | Securities statistics (62 monthly PDFs) + registered securities | ๐ | โ New โ HTML scraping (no proxy needed) |
djpb |
data-apbn.kemenkeu.go.id | APBN budget themes โ target/realization/achievement | ๐ฎ๐ฉ | โ New โ clean REST JSON API |
lhkpn |
elhkpn.kpk.go.id | Wealth declarations (officials) | โ | โ Active โ reCAPTCHA v3 solved via Playwright |
๐ = works globally ย ๐ฎ๐ฉ = requires Indonesian proxy (set PROXY_URL)
Every module returns the same CivicStackResponse envelope โ swap data sources without touching application logic.
| Module | Scraper | Normalizer | MCP | Tests | Portal Status |
|---|---|---|---|---|---|
| bpom | โ | โ | โ | โ | โ |
| bpjph | โ | โ | โ | โ | โ REST API |
| ahu | โ | โ | โ | โ | |
| ojk | โ | โ | โ | โ | |
| oss_nib | โ | โ | โ | โ | |
| lpse | โ | โ | โ | โ | ๐ฎ๐ฉ geo-blocked |
| kpu | โ | โ | โ | โ | โ |
| bps | โ | โ | โ | โ | โ |
| bmkg | โ | โ | โ | โ | โ |
| simbg | โ | โ | โ | โ | โ |
| jdih | โ | โ | โ | โ | ๐ฎ๐ฉ Playwright |
| ksei | โ | โ | โ | โ | โ (no proxy needed) |
| djpb | โ | โ | โ | โ | โ REST JSON API |
| lhkpn | โ | โ | โ | โ | โ Active (Playwright) |
pip install indonesia-civic-stack # Core SDK
pip install "indonesia-civic-stack[mcp]" # + MCP server (40 tools)
pip install "indonesia-civic-stack[api]" # + REST API (FastAPI + uvicorn)
pip install "indonesia-civic-stack[all]" # Everythingimport asyncio
from civic_stack.bpom.scraper import search as bpom_search
from civic_stack.bmkg.scraper import get_latest_earthquake
async def main():
# Search BPOM product registry
results = await bpom_search("paracetamol")
for r in results:
if r.found:
print(r.result)
# Get latest earthquake
eq = await get_latest_earthquake()
print(eq.result) # {'date': '...', 'magnitude': '5.2', ...}
asyncio.run(main())All 14 modules expose 46 MCP tools for use with Claude, GPT, or any MCP-compatible agent.
# Install locally:
pip install "indonesia-civic-stack[mcp]"
claude mcp add civic-stack -- civic-stack-mcp
# Or deploy your own remote server (Railway one-click):
# See "Self-Hosted MCP Server" section belowMCP server classes support two init styles:
# Style 1: Explicit init
class BpomMCPServer(CivicStackMCPBase):
def __init__(self):
super().__init__("bpom")
# Style 2: Class attribute
class BmkgMCPServer(CivicStackMCPBase):
module_name = "bmkg"# Run all modules
uvicorn app:app --port 8000
# With API key auth (recommended)
CIVIC_API_KEY=your-secret-key uvicorn app:app --port 8000
# Individual module
uvicorn modules.bpom.app:app --port 8001
# With proxy
PROXY_URL=socks5://id-proxy:1080 uvicorn app:app --port 8000# Endpoints
GET /bpom/check/MD123456789012
GET /bpom/search?q=paracetamol
GET /bpjph/check/BPJPH-12345
GET /ahu/search?q=PT+Contoh+Indonesia
GET /ojk/check?name=Bank+BCA
GET /kpu/candidate/search?q=Joko
GET /lhkpn/search?q=Anies # โ
reCAPTCHA v3 solved via Playwright
GET /bps/search?q=inflasi # Requires BPS_API_KEY
GET /bmkg/weather?city=jakarta
GET /simbg/search?q=Jakarta+SelatanEvery module returns CivicStackResponse:
{
"result": {"product_name": "...", "registration_status": "ACTIVE"},
"found": true,
"status": "ACTIVE",
"confidence": 1.0,
"source_url": "https://cekbpom.pom.go.id/...",
"fetched_at": "2026-03-14T06:30:00Z",
"module": "bpom"
}Status values: ACTIVE, EXPIRED, SUSPENDED, REVOKED, NOT_FOUND, ERROR.
When a module can't reach its portal or is missing configuration (e.g., BPS_API_KEY), it returns an error envelope instead of crashing:
{
"result": null,
"found": false,
"status": "ERROR",
"confidence": 0.0,
"source_url": "https://webapi.bps.go.id",
"module": "bps",
"detail": "BPS_API_KEY not set. Register at https://webapi.bps.go.id/developer/register"
}civic_stack/bpom/
โโโ __init__.py
โโโ app.py # FastAPI application
โโโ normalizer.py # Raw HTML/JSON โ structured dict
โโโ router.py # FastAPI routes
โโโ scraper.py # fetch() + search() โ core logic
โโโ server.py # FastMCP MCP server
โโโ Dockerfile
โโโ README.md
The shared/ layer provides:
schema.pyโCivicStackResponsePydantic model, status enum, helper constructorshttp.pyโcivic_client()factory with auto-proxy, rate limiter, exponential backoff retry, URL rewriting for CF Worker proxiesmcp.pyโCivicStackMCPBaseabstract base class for MCP servers
Most Indonesian government portals (*.go.id) restrict access to Indonesian IP addresses. If deploying outside Indonesia, you must set PROXY_URL to route requests through an Indonesian endpoint.
# Option 1: Indonesian VPS/SOCKS proxy (recommended for production)
export PROXY_URL="socks5://id-proxy.example.com:1080"
export PROXY_MODE="connect"
# Option 2: CF Worker proxy (free, but limited โ see below)
export PROXY_URL="https://your-proxy.workers.dev"
# PROXY_MODE auto-detects "rewrite" for *.workers.devWithout a proxy, expect: DNS resolution failures, connection timeouts, or HTTP 403/404 responses from most modules.
The SDK auto-reads PROXY_URL from environment โ no code changes needed in scrapers or MCP servers.
| Mode | PROXY_URL example |
How it works |
|---|---|---|
connect |
socks5://id-proxy:1080 |
Standard HTTP/SOCKS CONNECT proxy via httpx transport |
rewrite |
https://x.workers.dev |
Rewrites URLs to ?url=<target> (auto-detected for *.workers.dev) |
none |
(unset) | Direct connection |
Override auto-detection with PROXY_MODE=connect|rewrite.
A ready-to-deploy CF Worker proxy is included in proxy/. Deploy with:
cd proxy && npx wrangler deploy
โ ๏ธ CF Worker limitation: Many.go.idportals are themselves behind Cloudflare. CF Workers makingfetch()calls to other CF-protected origins receive 403/522 errors. This is a known Cloudflare limitation.
Verified through CF Worker proxy:
| Portal | Status | Notes |
|---|---|---|
| data.bmkg.go.id | โ Works | JSON API, not behind CF |
| cekbpom.pom.go.id | โ 403/522 | Portal is CF-protected |
| api.ojk.go.id | โ DNS dead | NXDOMAIN since March 2026 |
| infopemilu.kpu.go.id | โ 403 | CF-protected |
| lpse.*.go.id | โ 403 | CF-protected |
| elhkpn.kpk.go.id | โ 200 | reCAPTCHA v3 solved via Playwright headless browser |
For production with CF-protected portals, use an Indonesian VPS with a SOCKS5/HTTP proxy and set PROXY_MODE=connect.
Tested from three locations to map which portals enforce geo-blocking vs WAF:
| Portal | Sydney (AU) | Singapore | Jakarta (ID) | Verdict |
|---|---|---|---|---|
| ahu.go.id | โ | โ | โ | Geo-blocked (SEA+ OK) |
| elhkpn.kpk.go.id | โ | โ | โ | Geo-blocked (SEA+ OK) |
| ojk.go.id | โ 403 | โ 403 | โ | ID-only |
| jaga.id (KPK) | โ | โ | โ | No restriction |
| data.bmkg.go.id | โ | โ | โ | No restriction |
| cekbpom.pom.go.id | CF-protected (all locations) | |||
| webapi.bps.go.id | โ 403 | โ 403 | โ 403 | WAF, not geo (needs API key) |
| lpse.lkpp.go.id | โ | โ | โ | Unreliable (all locations) |
| coretaxdjp.pajak.go.id | โ | โ | โ | Unreliable (all locations) |
Takeaway: An Indonesian proxy (e.g., CloudKilat Jakarta) unlocks OJK โ the most important geo-restricted portal. Singapore unlocks AHU + LHKPN. BPS and LPSE failures are not geo-related.
โ ๏ธ Never disable password auth and restart sshd in one automated script on a fresh VPS. If the SSH key wasn't copied correctly, you're locked out with no recovery path except a web console. Always: (1) copy key, (2) verify key login works in a separate session, (3) then disable password auth.
Indonesian government portals frequently change their URL structure without notice. Known changes as of March 2026:
| Module | Old URL | New URL | Status |
|---|---|---|---|
| BPOM | /index.php/home/produk/1/{keyword}/... |
/all-produk?q={keyword} |
โ Updated |
| KPU | /Pemilu/caleg/list |
/Pemilu/Peserta_pemilu |
โ Updated |
| BMKG | /DataMKG/MEWS/Warning/cuacasignifikan.json |
/DataMKG/TEWS/gempadirasakan.json |
โ Updated |
| LHKPN | /portal/user/check_search_announ |
reCAPTCHA v3 (Playwright) | ๐ข Active |
Modules that fail for 60 days are flagged DEGRADED and may be archived.
Some portals require a real browser (JavaScript rendering, anti-bot protection):
| Module | Browser | Anti-bot |
|---|---|---|
| bpjph | Playwright (Chromium) | Standard |
| ahu | Playwright + Camoufox | Bot management (datacenter IP blocking) |
| oss_nib | Playwright (Chromium) | Standard |
Install browser dependencies:
pip install ".[playwright]"
playwright install chromium
# For AHU (optional, improves success rate):
pip install camoufox && python -m camoufox fetch| Module | Key Required | Env Var | Registration |
|---|---|---|---|
| BPS | Yes | BPS_API_KEY |
webapi.bps.go.id/developer/register (free) |
| All others | No | โ | โ |
Without BPS_API_KEY, the BPS module returns an error envelope (not a crash):
{"status": "ERROR", "detail": "BPS_API_KEY not set. Register at ..."}All 11 modules expose 40 MCP tools total:
| Module | Tools | Count |
|---|---|---|
| bpom | check_bpom, search_bpom, get_bpom_status |
3 |
| bpjph | check_halal_cert, lookup_halal_by_product, get_halal_status, cross_reference_halal_bpom |
4 |
| ahu | lookup_company_ahu, get_company_directors, verify_company_status, search_companies_ahu |
4 |
| ojk | check_ojk_license, search_ojk_institutions, get_ojk_status, check_ojk_waspada |
4 |
| oss_nib | lookup_nib, verify_nib, search_oss_businesses |
3 |
| lpse | lookup_vendor_lpse, search_lpse_vendors, search_lpse_tenders, get_lpse_portals |
4 |
| kpu | get_candidate, search_kpu_candidates, get_election_results_kpu, get_campaign_finance_kpu |
4 |
| lhkpn | get_lhkpn, search_lhkpn, compare_lhkpn, get_lhkpn_pdf |
4 |
| bps | search_bps_datasets, get_bps_indicator, list_bps_regions |
3 |
| bmkg | get_bmkg_alerts, get_weather_forecast, get_earthquake_history, get_latest_earthquake |
4 |
| simbg | lookup_building_permit, search_permits_by_area, list_simbg_portals |
3 |
This repo is built for AI agents as first-class consumers.
| File | Purpose | Agent |
|---|---|---|
AGENTS.md |
Architecture, patterns, critical rules, gotchas | All coding agents |
CLAUDE.md |
Commands, do/don't rules, style guide | Claude Code |
.cursorrules |
Project rules for Cursor | Cursor |
.github/copilot-instructions.md |
Instructions for Copilot | GitHub Copilot |
CONTRIBUTING.md |
Module contract + PR checklist | All |
SKILL.md |
Skill discovery (AgentSkills format) | Skill-aware agents |
PROMPTS.md |
Example prompts + interactive artifact recipes | All AI agents |
Option A โ Self-hosted remote server (deploy your own):
# After deploying to Railway/Fly/Render, add to Claude Code:
claude mcp add civic-stack --transport http https://your-deployment.up.railway.app/mcp
# Or Claude Desktop โ add to claude_desktop_config.json:{
"mcpServers": {
"civic-stack": {
"transport": "streamable-http",
"url": "https://your-deployment.up.railway.app/mcp"
}
}
}Note: There is no shared hosted server. Each user deploys their own instance to control proxy settings, rate limits, and API keys.
Option B โ Local install via pip:
pip install "indonesia-civic-stack[mcp]"
claude mcp add civic-stack -- civic-stack-mcpOption C โ Clone repo (auto-discovery):
git clone https://github.com/suryast/indonesia-civic-stack.git
cd indonesia-civic-stack
pip install -e ".[mcp]"
claude # Claude Code auto-detects .mcp.json โ 40 tools available immediatelyAll three options give you the same 40 tools. Then ask:
"Check if BPOM registration MD 123456789 is still active" "Search for companies named 'Maju Bersama' in the AHU registry" "What was the latest earthquake in Indonesia?"
See PROMPTS.md for more example prompts and interactive artifact recipes.
pip install "indonesia-civic-stack[api]"
civic-stack api --port 8000
# GET http://localhost:8000/bpom/search?q=paracetamolOnce MCP tools are connected, try these with your AI agent:
Food Safety "Check if BPOM registration number
MD 123456789is still active" "Search for all paracetamol products registered with BPOM"
Halal Verification "Is product XYZ halal certified? Cross-reference with BPOM registration" "Find all halal certificates issued to PT Indofood"
Company Due Diligence "Look up PT Maju Bersama in the AHU company registry and check who the directors are" "Is this company OJK-licensed? Check both the license registry and the waspada (warning) list"
Public Finance "Search LHKPN wealth declarations for officials in Jakarta" "Find government procurement tenders for road construction on LPSE"
Disaster & Weather "What was the latest earthquake in Indonesia?" "Get the weather forecast for DKI Jakarta from BMKG"
Statistics "Find BPS datasets about poverty rates by province" "Get the inflation indicator for the last 5 years"
Multi-Source Queries "I want to verify a food company: check AHU for registration, OJK for financial license, BPOM for product registrations, and BPJPH for halal certificates" "Compare LHKPN wealth declarations for these two officials over the last 3 reporting periods"
- Uniform response envelope โ every tool returns
CivicStackResponsewith the same fields. Agents don't need module-specific parsing logic. - Error envelopes, not exceptions โ agents receive structured error info they can reason about, not stack traces.
- Self-documenting tools โ MCP tool descriptions include parameter types, expected values, and response format.
- Deterministic naming โ
check_<module>,search_<module>,get_<module>_statuspattern across all modules.
| Feature | Config | Default |
|---|---|---|
| API key auth | CIVIC_API_KEY env var |
Disabled (open) |
| Rate limiting | CIVIC_RATE_LIMIT env var |
60 req/min per IP |
| Proxy allowlist | CIVIC_ALLOWED_PROXIES env var |
Any non-private IP |
| SSRF prevention | Built-in | Blocks RFC 1918 + localhost |
| Container user | Dockerfile | Non-root (civicapp, uid 1000) |
# Production deployment
export CIVIC_API_KEY="your-secret-key"
export CIVIC_RATE_LIMIT=30 # 30 req/min
export CIVIC_ALLOWED_PROXIES="proxy.example.com" # optional proxy allowlist
export PROXY_URL="socks5://id-proxy:1080" # Indonesian proxy
uvicorn app:app --host 0.0.0.0 --port 8000docker compose up # All modules
docker build -t civic-bpom civic_stack/bpom/ # Individual
docker run -p 8001:8000 -e CIVIC_API_KEY=secret -e PROXY_URL=socks5://proxy:1080 civic-bpomgit clone https://github.com/suryast/indonesia-civic-stack.git
cd indonesia-civic-stack
python -m venv .venv && source .venv/bin/activate
pip install -e ".[all,dev]"
playwright install chromium
pytest -v # VCR replay โ no live portal calls
ruff check . # Lint
ruff format --check . # Format check
mypy shared/ # Type checkpytest -v # 89 tests, VCR replay (no live calls)
pytest tests/bpom/ -v # Single module
pytest --tb=short -q # Quick summarypie title Test Coverage (89 tests)
"BPOM" : 7
"BPJPH" : 8
"AHU" : 12
"OJK" : 4
"KPU" : 5
"LPSE" : 9
"OSS-NIB" : 6
"LHKPN" : 10
"BPS" : 7
"BMKG" : 8
"SIMBG" : 7
"Schema" : 6
See CONTRIBUTING.md. Every module PR must include:
fetch()andsearch()returningCivicStackResponse- FastAPI router + FastMCP server
- 3+ VCR test fixtures
- Module README
A module that breaks for 60 days is flagged DEGRADED and archived.
- halalkah.id โ Halal product verification (9.57M products)
- legalkah.id โ Financial institution legality checker
- datarakyat.id โ Landing page & documentation
A single-page app that checks if a product is halal-certified. One module, no proxy needed for Indonesian users.
graph LR
subgraph Client
A[Mobile App / Web]
end
subgraph Your Server
B[FastAPI]
C[bpjph module]
end
subgraph Government Portal
D[sertifikasi.halal.go.id]
end
A -->|POST /check| B
B --> C
C -->|scrape| D
D -->|HTML| C
C -->|CivicStackResponse| B
B -->|JSON| A
style A fill:#f9f9f9,stroke:#333
style B fill:#e8f5e9,stroke:#2e7d32
style C fill:#e8f5e9,stroke:#2e7d32
style D fill:#fff3e0,stroke:#e65100
# app.py โ 15 lines, production-ready
from fastapi import FastAPI
from civic_stack.bpjph.scraper import fetch
app = FastAPI()
@app.get("/check/{product_id}")
async def check_halal(product_id: str):
result = await fetch(product_id)
return {"halal": result.found, "data": result.result}A compliance tool that cross-checks a company across multiple government databases. Runs behind a proxy for overseas deployment.
graph TB
subgraph Client
A[Compliance Dashboard]
end
subgraph Your Infrastructure
B[API Gateway]
C[Due Diligence Service]
D[ahu module]
E[ojk module]
F[bpom module]
G[oss_nib module]
H[(Redis Cache)]
end
subgraph Proxy Layer
I[CF Worker Proxy]
end
subgraph Government Portals
J[ahu.go.id]
K[www.ojk.go.id]
L[cekbpom.pom.go.id]
M[oss.go.id]
end
A -->|GET /company/:name| B
B --> C
C --> H
C --> D & E & F & G
D & E & F & G -->|via PROXY_URL| I
I --> J & K & L & M
style A fill:#f9f9f9,stroke:#333
style B fill:#e3f2fd,stroke:#1565c0
style C fill:#e8f5e9,stroke:#2e7d32
style D fill:#e8f5e9,stroke:#2e7d32
style E fill:#e8f5e9,stroke:#2e7d32
style F fill:#e8f5e9,stroke:#2e7d32
style G fill:#e8f5e9,stroke:#2e7d32
style H fill:#fce4ec,stroke:#c62828
style I fill:#fff8e1,stroke:#f57f17
style J fill:#fff3e0,stroke:#e65100
style K fill:#fff3e0,stroke:#e65100
style L fill:#fff3e0,stroke:#e65100
style M fill:#fff3e0,stroke:#e65100
# due_diligence.py โ parallel checks across 4 portals
import asyncio
from civic_stack.ahu.scraper import search as ahu_search
from civic_stack.ojk.scraper import search as ojk_search
from civic_stack.bpom.scraper import search as bpom_search
from civic_stack.oss_nib.scraper import search as nib_search
async def check_company(name: str) -> dict:
ahu, ojk, bpom, nib = await asyncio.gather(
ahu_search(name),
ojk_search(name),
bpom_search(name),
nib_search(name),
)
return {
"company": name,
"registered": any(r.found for r in ahu),
"ojk_licensed": any(r.found for r in ojk),
"bpom_products": len([r for r in bpom if r.found]),
"nib_valid": any(r.found for r in nib),
"risk_flags": _assess_risk(ahu, ojk, bpom, nib),
}An AI assistant that answers natural language questions about Indonesian civic data using MCP tools. The agent reasons about which portals to query.
sequenceDiagram
participant User
participant Agent as AI Agent (Claude/GPT)
participant MCP as MCP Server
participant SDK as civic-stack modules
participant Proxy as CF Worker Proxy
participant Gov as Government Portals
User->>Agent: "Is PT Maju Bersama a legitimate company<br/>with halal certification?"
Note over Agent: Agent reasons: need AHU (company)<br/>+ BPJPH (halal) + OJK (finance)
Agent->>MCP: search_companies_ahu("PT Maju Bersama")
MCP->>SDK: ahu.search()
SDK->>Proxy: GET ahu.go.id/...
Proxy->>Gov: Forward request
Gov-->>Proxy: HTML response
Proxy-->>SDK: Response
SDK-->>MCP: CivicStackResponse
MCP-->>Agent: {found: true, status: "ACTIVE", ...}
Agent->>MCP: check_halal_cert("PT Maju Bersama")
MCP->>SDK: bpjph.fetch()
SDK->>Proxy: GET sertifikasi.halal.go.id/...
Proxy-->>SDK: Response
SDK-->>MCP: CivicStackResponse
MCP-->>Agent: {found: true, status: "ACTIVE", ...}
Agent->>MCP: check_ojk_license("PT Maju Bersama")
MCP->>SDK: ojk.fetch()
SDK-->>MCP: {found: false, status: "NOT_FOUND"}
Note over Agent: Agent synthesizes results
Agent->>User: "PT Maju Bersama is a registered company (AHU โ
)<br/>with active halal certification (BPJPH โ
).<br/>No OJK financial license found โ this is normal<br/>for non-financial companies."
# Connect MCP servers to Claude Desktop โ one command per module
claude mcp add civic-ahu -- python -m civic_stack.ahu.server
claude mcp add civic-bpjph -- python -m civic_stack.bpjph.server
claude mcp add civic-ojk -- python -m civic_stack.ojk.server
# Or run unified REST API for HTTP-based agents
PROXY_URL=https://your-proxy.workers.dev uvicorn app:app- indonesia-civic-signal-monitor โ Anomaly detection engine built on this SDK, monitors 11 government data sources for newsworthy changes
- indonesia-gov-apis โ Reference docs for 50+ Indonesian government APIs
- datarakyat.id โ Project homepage with full module documentation
MIT โ see LICENSE