Augmenting Character Designers Creativity Using Generative Adversarial Networks

Mohammad Lataifeh,Xavier Carrasco,Ashraf Elnagar,Naveed Ahmed
2023-05-28
Abstract:Recent advances in Generative Adversarial Networks (GANs) continue to attract the attention of researchers in different fields due to the wide range of applications devised to take advantage of their key features. Most recent GANs are focused on realism, however, generating hyper-realistic output is not a priority for some domains, as in the case of this work. The generated outcomes are used here as cognitive components to augment character designers creativity while conceptualizing new characters for different multimedia projects. To select the best-suited GANs for such a creative context, we first present a comparison between different GAN architectures and their performance when trained from scratch on a new visual characters dataset using a single Graphics Processing Unit. We also explore alternative techniques, such as transfer learning and data augmentation, to overcome computational resource limitations, a challenge faced by many researchers in the domain. Additionally, mixed methods are used to evaluate the cognitive value of the generated visuals on character designers agency conceptualizing new characters. The results discussed proved highly effective for this context, as demonstrated by early adaptations to the characters design process. As an extension for this work, the presented approach will be further evaluated as a novel co-design process between humans and machines to investigate where and how the generated concepts are interacting with and influencing the design process outcome.
Human-Computer Interaction,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of enhancing character designers' creativity through Generative Adversarial Networks (GANs). Specifically, the authors aim to use character images generated by GANs as cognitive components to help designers gain creative inspiration when conceptualizing new characters. The paper focuses on how to select and optimize suitable GAN architectures for this creative task under limited computational resources and evaluate the impact of generated images on the designers' creative process. ### Main Issues: 1. **Enhancing character designers' creativity**: Providing designers with creative inspiration by generating new character images. 2. **Selecting suitable GAN architectures**: Comparing the performance of different GAN architectures in generating character images, especially under limited computational resources. 3. **Evaluating the creative value of generated images**: Assessing the quality and creative value of generated images through human expert reviews and quantitative metrics (such as FID scores). 4. **Exploring the human-machine co-creation design process**: Investigating how generated images interact with the designers' creative process and influence the final design outcomes. ### Solutions: - **Dataset construction**: Constructed a new character image dataset, including character outlines and colored images. - **Model selection and training**: Selected and trained various GAN architectures (such as DCGAN, WGAN, WGAN-GP, BIGGAN-deep, StyleGAN2-ada) and compared their performance. - **Transfer learning**: Utilized pre-trained models (such as StyleGAN2-ada) for transfer learning to reduce training time and improve generation quality. - **Evaluation methods**: Evaluated the quality and creative value of generated images through FID scores and human expert reviews. ### Conclusions: - **Performance comparison**: Early GAN models (such as DCGAN, WGAN, WGAN-GP) performed poorly under limited computational resources, while BIGGAN-deep and StyleGAN2-ada with transfer learning performed better. - **Creative value**: The generated images received positive feedback from human expert reviews, indicating that these images can effectively stimulate designers' creativity. - **Future directions**: Further research on the specific impact of generated images in the design process and how designers with different experience levels interact with these images. Through these methods, the paper not only demonstrates the potential application of GANs in the field of creative design but also proposes a novel human-machine co-creation design process.