z-logo
open-access-imgOpen Access
Visual Similarity Assessment for Product Aesthetic Properties Using Single Reference Training
Author(s) -
Seok Young Hwang,
Juseong Kim,
Kicheol Pak
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3594058
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Traditional product aesthetic evaluation requires extensive preference labeling for each design, limiting scalability and increasing costs. This study presents a novel single reference training approach that enables aesthetic preference assessment through visual similarity to a single highly preferred product. Using 30 home audio speakers, we collected aesthetic preference ratings from 44 participants on a 7-point Likert scale. Four deep learning approaches—Pre-trained CNN, Auto-encoder, Siamese Network, and Triplet Network—measured visual similarity between themost preferred reference products and remaining samples. Results demonstrated significant correlations between visual similarity and aesthetic preference: the Triplet Network achieved Pearson correlation coefficient r = 0.448 ( p = 0.013), while Pre-trained CNN approach yielded r = 0.478 ( p = 0.008). After filtering high-variance products, correlations substantially improved to r = 0.738 ( p < 0.001). Principal component analysis of embedding vectors revealed interpretable aesthetic dimensions, with specific components significantly correlating with novelty ( r = 0.51 − 0.61), harmony ( r = 0.36 − 0.37), dynamics ( r = 0.55 − 0.65), and complexity ( r = 0.42 − 0.56). Interaction models of multiple principal components increased explanatory power ( R 2 = 0.4065), demonstrating that aesthetic preferences emerge from complex relationships among visual features. The findings prove that single reference training effectively extracts interpretable aesthetic properties from embedding spaces without explicit labeling. Our results suggest that highly preferred products contain multiple aesthetic preference-related properties, and deep learning models can successfully extract features corresponding to these properties. This approach provides an alternative aesthetic assessment from labor-intensive individual evaluation to efficient similarity-based inference, offering a methodology with reduced data requirements for product design evaluation and enabling evaluation of new designs.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom