
Hyperparameter Optimization for Problem-Based Custom CNN Architectures Using a Smart Grid Search Method
Author(s) -
H. Aktas
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3594212
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Recent advancements in deep learning architectures have made it possible to solve a wide range of classification problems present in industrial applications. However, many CNN architectures are complex, with large model sizes and high inference times, making them less suitable for deployment in resource-constrained environments. This study proposes an improved grid search based hyperparameter optimization method to design lightweight CNN architectures for task-specific classification problems. In order to test the performance of the proposed algorithm, a dataset of ripe and unripe pistachios was used. To classify the ripe and unripe pistachios with a small-sized and high test accuracy model , a two-layer CNN architecture’s hyperparameters were optimized with the proposed algorithm. The implementation of the proposed algorithm took 14 hours and 17 minutes, and the algorithm calculated 173 different CNN architectures with different parameters. Among these 173 CNN architectures, the model with higher test accuracy than the literature and the smallest size was determined to have a size of 4.74 MB and achieve 98.70% test accuracy. When compared to the literature, this model shows 0.26% improvement in test accuracy and 36.6 times smaller than the AlexNet model. The proposed method was also compared with Bayesian optimization using a similar search space. While Bayesian optimization achieved the same highest test accuracy (99.22%), it produced a model that was 3.7 times larger. Moreover, when small-sized CNN architectures with identical accuracies (98.19%) were compared, the proposed method produced a model that was 2.5 times smaller than the Bayesian result in terms of model size. These results demonstrate the success of the method in efficiently discovering lightweight, high-accuracy CNN architectures.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom