Visual Similarity Assessment for Product Aesthetic Properties Using Single Reference Training | Zendy

Seok Young Hwang | Zendy; Juseong Kim | Zendy; Kicheol Pak | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Visual Similarity Assessment for Product Aesthetic Properties Using Single Reference Training

Author(s) -

Seok Young Hwang,

Juseong Kim,

Kicheol Pak

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3594058

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

Traditional product aesthetic evaluation requires extensive preference labeling for each design, limiting scalability and increasing costs. This study presents a novel single reference training approach that enables aesthetic preference assessment through visual similarity to a single highly preferred product. Using 30 home audio speakers, we collected aesthetic preference ratings from 44 participants on a 7-point Likert scale. Four deep learning approaches—Pre-trained CNN, Auto-encoder, Siamese Network, and Triplet Network—measured visual similarity between themost preferred reference products and remaining samples. Results demonstrated significant correlations between visual similarity and aesthetic preference: the Triplet Network achieved Pearson correlation coefficient r = 0.448 ( p = 0.013), while Pre-trained CNN approach yielded r = 0.478 ( p = 0.008). After filtering high-variance products, correlations substantially improved to r = 0.738 ( p < 0.001). Principal component analysis of embedding vectors revealed interpretable aesthetic dimensions, with specific components significantly correlating with novelty ( r = 0.51 − 0.61), harmony ( r = 0.36 − 0.37), dynamics ( r = 0.55 − 0.65), and complexity ( r = 0.42 − 0.56). Interaction models of multiple principal components increased explanatory power ( R 2 = 0.4065), demonstrating that aesthetic preferences emerge from complex relationships among visual features. The findings prove that single reference training effectively extracts interpretable aesthetic properties from embedding spaces without explicit labeling. Our results suggest that highly preferred products contain multiple aesthetic preference-related properties, and deep learning models can successfully extract features corresponding to these properties. This approach provides an alternative aesthetic assessment from labor-intensive individual evaluation to efficient similarity-based inference, offering a methodology with reduced data requirements for product design evaluation and enabling evaluation of new designs.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research