Anime Style Space Exploration Using Metric Learning and Generative Adversarial Networks

Sitao Xiang,Hao Li
DOI: https://doi.org/10.48550/arXiv.1805.07997
2018-05-21
Abstract:Deep learning-based style transfer between images has recently become a popular area of research. A common way of encoding "style" is through a feature representation based on the Gram matrix of features extracted by some pre-trained neural network or some other form of feature statistics. Such a definition is based on an arbitrary human decision and may not best capture what a style really is. In trying to gain a better understanding of "style", we propose a metric learning-based method to explicitly encode the style of an artwork. In particular, our definition of style captures the differences between artists, as shown by classification performances, and such that the style representation can be interpreted, manipulated and visualized through style-conditioned image generation through a Generative Adversarial Network. We employ this method to explore the style space of anime portrait illustrations.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?