Discussion of ‘Stability Selection’, by Nicolai Meinshausen and Peter Bühlmann

John Shawe-Taylor,Shiliang Sun
2010-01-01
Abstract:We congratulate the authors on a paper with an exciting mix of novel theoretical insights and practical experimental testing and verification of the ideas. We provide a personal view of the developments introduced by the paper, mentioning some areas where further work might be usefully undertaken, before presenting some results assessing the generalisation performance of stability selection on a medical dataset.The paper introduces a general method for assessing the reliability of including component features in a model. They independently follow a similar line to that proposed by Bach (2008), in which the author proposes to run the Lasso algorithm using boostrap samples and only include features that occur in all of the models thus created. Meinshausen and Bühlmann refine this idea by assessing the probability that a feature is included in models created with random subsets of Ln/2C training examples. Features are included if this probability exceeds a threshold πthr.
What problem does this paper attempt to address?