Abstract:Machine Learning is transforming medical research by improving diagnostic accuracy and personalizing treatments. General ML models trained on large datasets identify broad patterns across populations, but their effectiveness is often limited by the diversity of human biology. This has led to interest in subject-specific models that use individual data for more precise predictions. However, these models are costly and challenging to develop. To address this, we propose a novel validation approach that uses a general ML model to ensure reproducible performance and robust feature importance analysis at both group and subject-specific levels. We tested a single Random Forest (RF) model on nine datasets varying in domain, sample size, and demographics. Different validation techniques were applied to evaluate accuracy and feature importance consistency. To introduce variability, we performed up to 400 trials per subject, randomly seeding the ML algorithm for each trial. This generated 400 feature sets per subject, from which we identified top subject-specific features. A group-specific feature importance set was then derived from all subject-specific results. We compared our approach to conventional validation methods in terms of performance and feature importance consistency. Our repeated trials approach, with random seed variation, consistently identified key features at the subject level and improved group-level feature importance analysis using a single general model. Subject-specific models address biological variability but are resource-intensive. Our novel validation technique provides consistent feature importance and improved accuracy within a general ML model, offering a practical and explainable alternative for clinical research.

Discussion of ‘Stability Selection’, by Nicolai Meinshausen and Peter Bühlmann

Cluster Stability Selection

Balancing the Stability and Predictive Performance for Multivariate Voxel Selection in fMRI Study.

Variable selection with error control: Another look at Stability Selection

Stabilizing black-box model selection with the inflated argmax

Loss-guided Stability Selection

Trimming Stability Selection increases variable selection robustness

Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data

Stabilizing Variable Selection and Regression

Ensembling Variable Selectors by Stability Selection for the Cox Model

Stability Selection for Structured Variable Selection

Stable Feature Selection for Biomarker Discovery

Stability of decision trees and logistic regression

Stability Approach to Regularization Selection for Reduced-Rank Regression

Instability of Variable-selection Algorithms Used to Identify True Predictors of an Outcome in Intermediate-dimension Epidemiologic Studies

An information theoretic approach to quantify the stability of feature selection and ranking algorithms

Stabilizing Machine Learning for Reproducible and Explainable Results: A Novel Validation Approach to Subject-Specific Insights

A Novel Bagging Approach for Variable Ranking and Selection Via a Mixed Importance Measure

Extended comparisons of best subset selection, forward stepwise selection, and the lasso

Stable Feature Selection with Applications to MALDI Imaging Mass Spectrometry Data

Forward stability and model path selection