
Towards Enhancing LightWeight GAN for Text-Guided Generation of Animated Character Faces
Author(s) -
Sameh Zarif,
Abdalfatah Najja,
Khalid Amin,
Abdullah Alharbi,
Wail S. Elkilani,
Mahmoud A. Shawky,
Marian Wagdy
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3595928
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Traditional text-guided image generation methods primarily focus on modifying existing images or altering specific elements, which limits their applicability. This paper introduces a significant enhancement to the LightWeight-GAN model, originally designed for image generation from random noise, by transforming it into a text-guided generative system. The proposed approach enables the generation of high-quality animated character faces directly from textual descriptions. To achieve this, we incorporate a mapping network that refines textual inputs before feeding them into the generator, ensuring more precise feature representation. Additionally, we integrate contrastive language-image pretraining (CLIP) to verify the generated images and enforce stronger alignment between textual prompts and visual outputs. The model is trained on a specialized facial dataset, demonstrating its ability to generate semantically accurate and visually compelling character faces. Extensive experiments using the CartoonSet dataset validate the effectiveness of our approach, achieving an FID score of 29.8 across 10,000 generated images. These improvements significantly outperform existing text-to-image generation models, making our system a promising tool for applications in game development, animation, and virtual reality.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom