@@ -636,3 +636,108 @@ Curated, searchable database of published biomarker signatures for cross-study c
636636- Compare mode: Plotly performance bars, Jaccard overlap matrix, common features, feature presence chart
637637- Route ` /signature-zoo ` in router.js, navbar link in App.vue
638638- 35 i18n keys in EN/FR
639+
640+ ---
641+
642+ ## Phase 6: gpredomics v1.0.0 Integration & Regression
643+
644+ _ Updated: 2026-03-31_
645+
646+ ### 50. Regression Support ✅ [ gpredomics #15 ] ( https://github.com/predomics/gpredomics/issues/15 ) , [ gpredomics #79 ] ( https://github.com/predomics/gpredomics/issues/79 )
647+ ** Priority:** HIGH | ** Effort:** High | ** Status:** Done
648+
649+ Data.y changed from Vec<u8 > to Vec<f64 >. 4 regression fitness functions (spearman, pearson, rmse, mutual_information). Regression-aware display, test metrics, CV fold splitting, voting skip.
650+
651+ ### 51. 4 New Optimization Algorithms ✅ [ gpredomics #61 ] ( https://github.com/predomics/gpredomics/issues/61 ) , [ #62 ] ( https://github.com/predomics/gpredomics/issues/62 ) , [ #65 ] ( https://github.com/predomics/gpredomics/issues/65 ) , [ #54 ] ( https://github.com/predomics/gpredomics/issues/54 )
652+ ** Priority:** HIGH | ** Effort:** High | ** Status:** Done
653+
654+ SA (Simulated Annealing), ILS (Iterated Local Search), LASSO/Elastic Net, ACO (Ant Colony Optimization). All integrated in web app.
655+
656+ ### 52. MCMC Gibbs Variable Selection ✅ [ gpredomics #73 ] ( https://github.com/predomics/gpredomics/issues/73 ) , [ #70 ] ( https://github.com/predomics/gpredomics/issues/70 )
657+ ** Priority:** HIGH | ** Effort:** High | ** Status:** Done
658+
659+ Joint feature+coefficient sampling replacing SBS. LASSO prescreen, parallel chains, golden section optimizer. MCMC from OOM/minutes to 5-11 seconds.
660+
661+ ### 53. Code Audit & Fixes ✅ [ gpredomics #75 ] ( https://github.com/predomics/gpredomics/issues/75 )
662+ ** Priority:** HIGH | ** Effort:** Medium | ** Status:** Done
663+
664+ 12 critical + 14 medium issues fixed. HashMap→BTreeMap for determinism. ClassificationMetrics refactor. -280 lines dead code removed.
665+
666+ ### 54. Metadata Upload & Variable Selection ✅ [ predomicsapp #1 ] ( https://github.com/predomics/predomicsapp/issues/1 )
667+ ** Priority:** HIGH | ** Effort:** Medium | ** Status:** Done
668+
669+ Upload metadata TSV, select numeric column as regression y, auto-switch to regression mode. Backend APIs for metadata column parsing and y extraction.
670+
671+ ### 55. Feature Selection in UI ✅
672+ ** Priority:** MEDIUM | ** Effort:** Low | ** Status:** Done
673+
674+ Exposed prevalence, adj_pvalue, selection method in Parameters tab. Documented adaptive FDR relaxation.
675+
676+ ### 56. Adaptive BH-FDR ✅
677+ ** Priority:** MEDIUM | ** Effort:** Low | ** Status:** Done
678+
679+ When strict FDR selects < 10 features, alpha relaxes progressively (0.05→0.1→0.2→0.5) with warnings. Fallback to top 10 by raw p-value.
680+
681+ ### 57. Wetlab Protocol Dataset ✅ [ gpredomics #60 ] ( https://github.com/predomics/gpredomics/issues/60 )
682+ ** Priority:** MEDIUM | ** Effort:** Low | ** Status:** Done
683+
684+ Paired study (459 subjects, 2 extraction protocols). 1,981 MSPs. Subject-level train/test split. Metadata with age, sex, BMI, Gram+/- counts, gene_count.
685+
686+ ### 58. Samples/Data Separation ✅
687+ ** Priority:** MEDIUM | ** Effort:** Low | ** Status:** Done
688+
689+ Bundled demos in samples/ (baked in image). User workspace in data/ (persistent volume for K8s).
690+
691+ ### 59. Documentation ✅
692+ ** Priority:** MEDIUM | ** Effort:** Medium | ** Status:** Done
693+
694+ 17-page PDF documentation. Vignette tutorial. All 7 algorithm docs with references. Fully documented param.yaml.
695+
696+ ---
697+
698+ ## Phase 7: Upcoming
699+
700+ ### 60. Multi-user Workspace Management [ predomicsapp #4 ] ( https://github.com/predomics/predomicsapp/issues/4 )
701+ ** Priority:** HIGH | ** Effort:** High | ** Status:** Open
702+
703+ Job concurrency limits (semaphore), per-user disk quotas, job timeouts, dataset deduplication, admin dashboard.
704+
705+ ### 61. Data Scanning Console Feedback [ predomicsapp #5 ] ( https://github.com/predomics/predomicsapp/issues/5 )
706+ ** Priority:** MEDIUM | ** Effort:** Low | ** Status:** Open
707+
708+ Show scanning progress during dataset load: feature count, class distribution, prevalence stats, warnings.
709+
710+ ### 62. Optuna Hyperparameter Optimization [ gpredomics #77 ] ( https://github.com/predomics/gpredomics/issues/77 )
711+ ** Priority:** MEDIUM | ** Effort:** Medium | ** Status:** Open
712+
713+ Bayesian hyperparameter tuning via Optuna in gpredomicspy. Search space for k_penalty, population_size, algorithm choice, etc.
714+
715+ ### 63. Multiclass Classification (OVO/OVA) [ gpredomics #52 ] ( https://github.com/predomics/gpredomics/issues/52 )
716+ ** Priority:** HIGH | ** Effort:** High | ** Status:** Open
717+
718+ One-vs-All and One-vs-One strategies for multi-class problems. Orchestrate K binary gpredomics runs.
719+
720+ ### 64. Clinical Data Integration [ gpredomics #51 ] ( https://github.com/predomics/gpredomics/issues/51 )
721+ ** Priority:** MEDIUM | ** Effort:** Medium | ** Status:** Open
722+
723+ Stacking, calibration, stratification of omics scores with clinical variables.
724+
725+ ### 65. MCMC SBS Fix [ gpredomics #81 ] ( https://github.com/predomics/gpredomics/issues/81 )
726+ ** Priority:** MEDIUM | ** Effort:** Medium | ** Status:** Open
727+
728+ SBS Bayesian evaluation produces inverted AUC after optimization changes.
729+
730+ ### 66. Auto-discover Feature Weights [ gpredomics #78 ] ( https://github.com/predomics/gpredomics/issues/78 )
731+ ** Priority:** LOW | ** Effort:** Medium | ** Status:** Open
732+
733+ Compare feature distributions between folds to auto-discover sampling weights.
734+
735+ ### 67. More Heuristics [ gpredomics #63 ] ( https://github.com/predomics/gpredomics/issues/63 ) , [ #64 ] ( https://github.com/predomics/gpredomics/issues/64 ) , [ #66 ] ( https://github.com/predomics/gpredomics/issues/66 )
736+ ** Priority:** LOW | ** Effort:** Medium | ** Status:** Open
737+
738+ PSO (Particle Swarm), EDA/UMDA, Bayesian Optimization.
739+
740+ ### 68. Island Model GA [ gpredomics #33 ] ( https://github.com/predomics/gpredomics/issues/33 )
741+ ** Priority:** LOW | ** Effort:** High | ** Status:** Open
742+
743+ Separate environments with periodic migration for diversity preservation.
0 commit comments