Abstract:In machine learning applications for online product offerings and marketing strategies, there are often hundreds or thousands of features available to build such models. Feature selection is one essential method in such applications for multiple objectives: improving the prediction accuracy by eliminating irrelevant features, accelerating the model training and prediction speed, reducing the monitoring and maintenance workload for feature data pipeline, and providing better model interpretation and diagnosis capability. However, selecting an optimal feature subset from a large feature space is considered as an NP-complete problem. The mRMR (Minimum Redundancy and Maximum Relevance) feature selection framework solves this problem by selecting the relevant features while controlling for the redundancy within the selected features. This paper describes the approach to extend, evaluate, and implement the mRMR feature selection methods for classification problem in a marketing machine learning platform at Uber that automates creation and deployment of targeting and personalization models at scale. This study first extends the existing mRMR methods by introducing a non-linear feature redundancy measure and a model-based feature relevance measure. Then an extensive empirical evaluation is performed for eight different feature selection methods, using one synthetic dataset and three real-world marketing datasets at Uber to cover different use cases. Based on the empirical results, the selected mRMR method is implemented in production for the marketing machine learning platform. A description of the production implementation is provided and an online experiment deployed through the platform is discussed.

The Minimum Redundancy-Maximum Relevance Approach to Building Sparse Support Vector Machines

Feature Selection and Parameter Optimization for Support Vector Machines: A New Approach Based on Genetic Algorithm with Feature Chromosomes.

M-estimator Based Support Vector Machine and Its Application

MVMR-FS : Non-parametric feature selection algorithm based on Maximum inter-class Variation and Minimum Redundancy

Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction

A new improved maximal relevance and minimal redundancy method based on feature subset

Sparse Least Squares Support Vector Machine for Function Estimation

Sparse Least Absolute Deviation Support Vector Machine

MRM-Lasso: A Sparse Multiview Feature Selection Method Via Low-Rank Analysis.

Probabilistic Feature Selection and Classification Vector Machine

Modeling method of least squares support vector regression based on vector base learning

Discovering the Representative Subset with Low Redundancy for Hyperspectral Feature Selection

Maximum Relevance and Minimum Redundancy Feature Selection Methods for a Marketing Machine Learning Platform

Sparse Least Squares Low Rank Kernel Machines

A simple and reliable instance selection for fast training support vector machine: Valid Border Recognition

Semi-supervised feature selection by minimum neighborhood redundancy and maximum neighborhood relevancy

The mRMR variable selection method: a comparative study for functional data

Least Squares Support Vector Machine with Self-Organizing Multiple Kernel Learning and Sparsity.

Fast Pruning Superfluous Support Vectors in SVMs

Accelerated multi-kernel sparse stochastic optimization classifier algorithm for explainable prediction

2D Feature Selection by Sparse Matrix Regression