Fix shuffle misalignment by basakbahcivanci · Pull Request #58 · IBM/AutoXAI4Omics

basakbahcivanci · 2025-09-08T12:31:44Z

This pull request fixes an index misalignment bug where the target column in transformed_model_target_data.csv could become out of sync with condition_binary(original target) in the metadata after shuffling/splitting.
Changes:

autoxai4omics/utils/ml/data_split.py
Reset indices after train/test split to ensure targets remain correctly aligned with features and metadata.
autoxai4omics/utils/ml/preprocessing.py
Updated to preserve index consistency.
autoxai4omics/utils/ml/class_balancing.py
Replaced direct imports (from numpy import ndarray, from pandas.core.frame import DataFrame) with import numpy as np and ”import pandas as pd due to error. Improved error messages to show actual received types for easier debugging. Allowed y_train as a pd.Series (for labels).
autoxai4omics/omics/tabular.py
Allowed y to be kept as Series with SampleID index.
autoxai4omics/utils/save.py
Updated to preserve consistent indices
autoxai4omics/models/tabauto/keras_model.py , autoxai4omics/models/tabauto/lgbm_model.py , autoxai4omics/models/tabauto/xgboost_model.py
Updated model wrappers to correctly handle input/target arrays with consistent indices, preventing downstream Errors and misaligned labels.

… and splitting Signed-off-by: Basak Bahcivanci <basakbahcivanci@gmail.com>

…essing and splitting Signed-off-by: Basak Bahcivanci <basakbahcivanci@gmail.com>

basakbahcivanci added 2 commits September 8, 2025 12:24

fix: ensures targets and metadata remain aligned across preprocessing…

90c54bf

… and splitting Signed-off-by: Basak Bahcivanci <basakbahcivanci@gmail.com>

fix: ensures target and metadata indices remain aligned after preproc…

a86c668

…essing and splitting Signed-off-by: Basak Bahcivanci <basakbahcivanci@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix shuffle misalignment#58

Fix shuffle misalignment#58
basakbahcivanci wants to merge 2 commits into
IBM:mainfrom
basakbahcivanci:fix-shuffle-misalignment

basakbahcivanci commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

basakbahcivanci commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant