Unsupervised Text-to-image Synthesis

Yanlong Dong,Ying Zhang,Lin Ma,Zhi Wang,Jiebo Luo
DOI: https://doi.org/10.1016/j.patcog.2020.107573
IF: 8
2020-01-01
Pattern Recognition
Abstract:•We make the first attempt to train one text-to-image synthesis model in an unsupervised manner.•A novel visual concept discrimination loss is proposed to train both generator and discriminator, which not only encourages the generated image expressing the local visual concepts but also ensures the noisy visual concepts contained in the pseudo sentence being suppressed.•One global semantic consistency loss is used to ensure that the generated image semantically corresponds to the input real sentence.•Our proposed model can generate pleasant image for one given sentence, with no reliance on any image-text pair data, which even outperforms some text-to-image synthesis models trained in the supervised manner.
What problem does this paper attempt to address?