Skip to content

Commit 3bc8350

Browse files
committed
docs: update ROADMAP with Phase 6 (done) and Phase 7 (open), all items linked to issues
1 parent dd9c219 commit 3bc8350

File tree

1 file changed

+105
-0
lines changed

1 file changed

+105
-0
lines changed

ROADMAP.md

Lines changed: 105 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -636,3 +636,108 @@ Curated, searchable database of published biomarker signatures for cross-study c
636636
- Compare mode: Plotly performance bars, Jaccard overlap matrix, common features, feature presence chart
637637
- Route `/signature-zoo` in router.js, navbar link in App.vue
638638
- 35 i18n keys in EN/FR
639+
640+
---
641+
642+
## Phase 6: gpredomics v1.0.0 Integration & Regression
643+
644+
_Updated: 2026-03-31_
645+
646+
### 50. Regression Support ✅ [gpredomics#15](https://github.com/predomics/gpredomics/issues/15), [gpredomics#79](https://github.com/predomics/gpredomics/issues/79)
647+
**Priority:** HIGH | **Effort:** High | **Status:** Done
648+
649+
Data.y changed from Vec<u8> to Vec<f64>. 4 regression fitness functions (spearman, pearson, rmse, mutual_information). Regression-aware display, test metrics, CV fold splitting, voting skip.
650+
651+
### 51. 4 New Optimization Algorithms ✅ [gpredomics#61](https://github.com/predomics/gpredomics/issues/61), [#62](https://github.com/predomics/gpredomics/issues/62), [#65](https://github.com/predomics/gpredomics/issues/65), [#54](https://github.com/predomics/gpredomics/issues/54)
652+
**Priority:** HIGH | **Effort:** High | **Status:** Done
653+
654+
SA (Simulated Annealing), ILS (Iterated Local Search), LASSO/Elastic Net, ACO (Ant Colony Optimization). All integrated in web app.
655+
656+
### 52. MCMC Gibbs Variable Selection ✅ [gpredomics#73](https://github.com/predomics/gpredomics/issues/73), [#70](https://github.com/predomics/gpredomics/issues/70)
657+
**Priority:** HIGH | **Effort:** High | **Status:** Done
658+
659+
Joint feature+coefficient sampling replacing SBS. LASSO prescreen, parallel chains, golden section optimizer. MCMC from OOM/minutes to 5-11 seconds.
660+
661+
### 53. Code Audit & Fixes ✅ [gpredomics#75](https://github.com/predomics/gpredomics/issues/75)
662+
**Priority:** HIGH | **Effort:** Medium | **Status:** Done
663+
664+
12 critical + 14 medium issues fixed. HashMap→BTreeMap for determinism. ClassificationMetrics refactor. -280 lines dead code removed.
665+
666+
### 54. Metadata Upload & Variable Selection ✅ [predomicsapp#1](https://github.com/predomics/predomicsapp/issues/1)
667+
**Priority:** HIGH | **Effort:** Medium | **Status:** Done
668+
669+
Upload metadata TSV, select numeric column as regression y, auto-switch to regression mode. Backend APIs for metadata column parsing and y extraction.
670+
671+
### 55. Feature Selection in UI ✅
672+
**Priority:** MEDIUM | **Effort:** Low | **Status:** Done
673+
674+
Exposed prevalence, adj_pvalue, selection method in Parameters tab. Documented adaptive FDR relaxation.
675+
676+
### 56. Adaptive BH-FDR ✅
677+
**Priority:** MEDIUM | **Effort:** Low | **Status:** Done
678+
679+
When strict FDR selects < 10 features, alpha relaxes progressively (0.05→0.1→0.2→0.5) with warnings. Fallback to top 10 by raw p-value.
680+
681+
### 57. Wetlab Protocol Dataset ✅ [gpredomics#60](https://github.com/predomics/gpredomics/issues/60)
682+
**Priority:** MEDIUM | **Effort:** Low | **Status:** Done
683+
684+
Paired study (459 subjects, 2 extraction protocols). 1,981 MSPs. Subject-level train/test split. Metadata with age, sex, BMI, Gram+/- counts, gene_count.
685+
686+
### 58. Samples/Data Separation ✅
687+
**Priority:** MEDIUM | **Effort:** Low | **Status:** Done
688+
689+
Bundled demos in samples/ (baked in image). User workspace in data/ (persistent volume for K8s).
690+
691+
### 59. Documentation ✅
692+
**Priority:** MEDIUM | **Effort:** Medium | **Status:** Done
693+
694+
17-page PDF documentation. Vignette tutorial. All 7 algorithm docs with references. Fully documented param.yaml.
695+
696+
---
697+
698+
## Phase 7: Upcoming
699+
700+
### 60. Multi-user Workspace Management [predomicsapp#4](https://github.com/predomics/predomicsapp/issues/4)
701+
**Priority:** HIGH | **Effort:** High | **Status:** Open
702+
703+
Job concurrency limits (semaphore), per-user disk quotas, job timeouts, dataset deduplication, admin dashboard.
704+
705+
### 61. Data Scanning Console Feedback [predomicsapp#5](https://github.com/predomics/predomicsapp/issues/5)
706+
**Priority:** MEDIUM | **Effort:** Low | **Status:** Open
707+
708+
Show scanning progress during dataset load: feature count, class distribution, prevalence stats, warnings.
709+
710+
### 62. Optuna Hyperparameter Optimization [gpredomics#77](https://github.com/predomics/gpredomics/issues/77)
711+
**Priority:** MEDIUM | **Effort:** Medium | **Status:** Open
712+
713+
Bayesian hyperparameter tuning via Optuna in gpredomicspy. Search space for k_penalty, population_size, algorithm choice, etc.
714+
715+
### 63. Multiclass Classification (OVO/OVA) [gpredomics#52](https://github.com/predomics/gpredomics/issues/52)
716+
**Priority:** HIGH | **Effort:** High | **Status:** Open
717+
718+
One-vs-All and One-vs-One strategies for multi-class problems. Orchestrate K binary gpredomics runs.
719+
720+
### 64. Clinical Data Integration [gpredomics#51](https://github.com/predomics/gpredomics/issues/51)
721+
**Priority:** MEDIUM | **Effort:** Medium | **Status:** Open
722+
723+
Stacking, calibration, stratification of omics scores with clinical variables.
724+
725+
### 65. MCMC SBS Fix [gpredomics#81](https://github.com/predomics/gpredomics/issues/81)
726+
**Priority:** MEDIUM | **Effort:** Medium | **Status:** Open
727+
728+
SBS Bayesian evaluation produces inverted AUC after optimization changes.
729+
730+
### 66. Auto-discover Feature Weights [gpredomics#78](https://github.com/predomics/gpredomics/issues/78)
731+
**Priority:** LOW | **Effort:** Medium | **Status:** Open
732+
733+
Compare feature distributions between folds to auto-discover sampling weights.
734+
735+
### 67. More Heuristics [gpredomics#63](https://github.com/predomics/gpredomics/issues/63), [#64](https://github.com/predomics/gpredomics/issues/64), [#66](https://github.com/predomics/gpredomics/issues/66)
736+
**Priority:** LOW | **Effort:** Medium | **Status:** Open
737+
738+
PSO (Particle Swarm), EDA/UMDA, Bayesian Optimization.
739+
740+
### 68. Island Model GA [gpredomics#33](https://github.com/predomics/gpredomics/issues/33)
741+
**Priority:** LOW | **Effort:** High | **Status:** Open
742+
743+
Separate environments with periodic migration for diversity preservation.

0 commit comments

Comments
 (0)