Towards Enhancing LightWeight GAN for Text-Guided Generation of Animated Character Faces | Zendy

Sameh Zarif | Zendy; Abdalfatah Najja | Zendy; Khalid Amin | Zendy; Abdullah Alharbi | Zendy; Wail S. Elkilani | Zendy; Mahmoud A. Shawky | Zendy; Marian Wagdy | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Towards Enhancing LightWeight GAN for Text-Guided Generation of Animated Character Faces

Author(s) -

Sameh Zarif,

Abdalfatah Najja,

Khalid Amin,

Abdullah Alharbi,

Wail S. Elkilani,

Mahmoud A. Shawky,

Marian Wagdy

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3595928

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

Traditional text-guided image generation methods primarily focus on modifying existing images or altering specific elements, which limits their applicability. This paper introduces a significant enhancement to the LightWeight-GAN model, originally designed for image generation from random noise, by transforming it into a text-guided generative system. The proposed approach enables the generation of high-quality animated character faces directly from textual descriptions. To achieve this, we incorporate a mapping network that refines textual inputs before feeding them into the generator, ensuring more precise feature representation. Additionally, we integrate contrastive language-image pretraining (CLIP) to verify the generated images and enforce stronger alignment between textual prompts and visual outputs. The model is trained on a specialized facial dataset, demonstrating its ability to generate semantically accurate and visually compelling character faces. Extensive experiments using the CartoonSet dataset validate the effectiveness of our approach, achieving an FID score of 29.8 across 10,000 generated images. These improvements significantly outperform existing text-to-image generation models, making our system a promising tool for applications in game development, animation, and virtual reality.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research