z-logo
open-access-imgOpen Access
Quantum Vision Theory in Deep Learning for Object Recognition
Author(s) -
Cem Direkoglu,
Melike Sah
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3592037
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
We introduce a new perspective and a theory, called quantum vision (QV) theory in deep learning, for object recognition. The proposed theory is based on particle-wave duality of quantum physics. In quantum-scale, an object appears as a wave until it is observed, but after observation the object collapses into a solid object, called particle. In quantum world, every object has a ’wave function’ that contains all the information about the object permitted by the uncertainty principle. Quantum-scale world looks different from our human-scale world. Attempts to relate the microscopic quantum world to our macroscopic world led to philosophical issues and questions. But what if the objects in human-scale world such as cats, dogs and bicycles have wave functions as well?And what will happen if we feed waves of objects to Deep Neural Networks (DNN) instead of collapsed still images of objects captured by cameras? This is the main contribution our work. Inspired from quantum physics, we introduce a new perspective and theory, called Quantum Vision (QV) theory in deep learning that is a completely new perspective for object recognition. The proposed QV theory takes captured still images of objects, and converts them to information wave functions using a deep learning block that is called QV block. The proposed QV block is integrated into sequential CNNs, vision transformers and convolutional vision transformer to generate QV model variants for object classification. Extensive experiments are carried out on several datasets, and results demonstrate that QV model variants perform consistently better than standalone versions.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom