You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[x] Training data v0.2 (balance + explicit negatives)
80
+
- Goal: reduce early UTI bias; encode negative GU evidence in respiratory cases and vice‑versa; strengthen URI patterns and mild/early variants.
81
+
- Acceptance criteria:
82
+
-[x] Generator emits JSONL to schema with balanced class counts (or class weights are configured). See `data/generate_v02.py` → `data/v02/cases_v02.jsonl`.
83
+
-[x] Respiratory cases explicitly mark `dysuria=0`, `frequency=0`; GU cases often mark `cough=0`, `rhinorrhea=0`, `congestion=0`.
84
+
-[x] URI patterns included (cough + rhinorrhea + congestion ± low fever, sore throat); GU patterns (dysuria + frequency).
85
+
-[x] Mild/early and atypical variants present; unknowns used appropriately.
86
+
-[x] Dataset versioned as v0.2; training pipeline `tools/train_pipeline.py` writes model to `models/enhanced_medical_model_v02.json`.
87
+
-[x] Re‑train + calibration performed; quick confusion/ECE report at `reports/metrics_v02.json` shows improved Resp vs GU separation.
77
88
- Label policy
78
89
- For each disease: criteria for “confirmed” (ICD‑10 + test) vs “presumptive”; clinician adjudication rules.
79
90
- Acceptance criteria:
@@ -103,6 +114,7 @@ Acceptance:
103
114
- Acceptance criteria:
104
115
- Config toggles in `configs/training.yaml` enable Adam/L2/dropout; seed fixed.
105
116
- Training summary logs include optimizer, regularization, and early stopping status.
117
+
- Class weighting or balanced sampling applied if class counts are imbalanced; documented in DATA_CARD.
106
118
- Metrics & calibration
107
119
- AUROC, AUPRC, F1, Top‑k, confusion per class.
108
120
- Reliability diagrams + ECE; re‑tune temperature on held‑out set; subgroup calibration (age/sex/season). Add drift monitors for class priors and feature distributions.
@@ -532,14 +544,15 @@ medical_diagnosis_model/
532
544
## First actionable tasks (suggested order)
533
545
534
546
1. Create `configs/clinical_schema.yaml` with symptom/vitals/labs mappings.
535
-
2. Write `docs/label_policy.md`; wire gold labels (confirmed vs presumptive) into dataset.
4. Implement patient‑ and time‑based splits with stratification; add class weighting.
538
-
5. Add training toggles (Adam, L2, dropout) and fixed seeds via `configs/training.yaml`.
539
-
6. Add metrics module (AUROC/AUPRC/F1/Confusion) and reliability diagram + ECE.
540
-
7. Expand rules: Centor + CURB‑65; add “need more info” if entropy/confidence threshold.
541
-
8. Build batch CLI to score CSV and emit results JSON/CSV.
542
-
543
-
9. Implement adaptive questioning selector stub (`backend/selector/eig_selector.py`) with unit tests on toy distributions; wire a no‑UI CLI demo.
547
+
2. Regenerate training data v0.2 (balanced counts or class weights; explicit negative GU for respiratory and vice‑versa; URI patterns; mild/early/atypical). Retrain + recalibrate; update DATA_CARD.
548
+
3. Write `docs/label_policy.md`; wire gold labels (confirmed vs presumptive) into dataset.
0 commit comments