Forced to Learn: Discovering Disentangled Representations Without Exhaustive Labels

Alexey Romanov,Anna Rumshisky
DOI: https://doi.org/10.48550/arXiv.1705.00574
IF: 5.414
2017-05-01
Machine Learning
Abstract:Learning a better representation with neural networks is a challenging problem, which was tackled extensively from different prospectives in the past few years. In this work, we focus on learning a representation that could be used for a clustering task and introduce two novel loss components that substantially improve the quality of produced clusters, are simple to apply to an arbitrary model and cost function, and do not require a complicated training procedure. We evaluate them on two most common types of models, Recurrent Neural Networks and Convolutional Neural Networks, showing that the approach we propose consistently improves the quality of KMeans clustering in terms of Adjusted Mutual Information score and outperforms previously proposed methods.
What problem does this paper attempt to address?