z-logo
open-access-imgOpen Access
Integrating Deep Learning with Correlation-based Multimedia Semantic Concept Detection
Author(s) -
Hsin-Yu Ha
Publication year - 2015
Language(s) - English
Resource type - Dissertations/theses
DOI - 10.25148/etd.fidc000162
Subject(s) - computer science , convolutional neural network , feature (linguistics) , deep learning , process (computing) , semantic feature , artificial intelligence , machine learning , layer (electronics) , data mining , multimedia , information retrieval , philosophy , linguistics , chemistry , organic chemistry , operating system
OF THE DISSERTATION INTEGRATING DEEP LEARNING WITH CORRELATION-BASED MULTIMEDIA SEMANTIC CONCEPT DETECTION by Hsin-Yu Ha Florida International University, 2015 Miami, Florida Professor Shu-Ching Chen, Major Professor The rapid advances in technologies make the explosive growth of multimedia data possible and available to the public. Multimedia data can be defined as data collection, which is composed of various data types and different representations. Due to the fact that multimedia data carries knowledgeable information, it has been widely adopted to different genera, like surveillance event detection, medical abnormality detection, and many others. To fulfill various requirements for different applications, it is important to effectively classify multimedia data into semantic concepts across multiple domains. In this dissertation, a correlation-based multimedia semantic concept detection framework is seamlessly integrated with the deep learning technique. The framework aims to explore implicit and explicit correlations among features and concepts while adopting different Convolutional Neural Network (CNN) architectures accordingly. First, the Feature Correlation Maximum Spanning Tree (FC-MST) is proposed to remove the redundant and irrelevant features based on the correlations between the features and positive concepts. FC-MST identifies the effective features and decides the initial layer’s dimension in CNNs. Second, the Negative-based Sampling method is proposed to alleviate the data imbalance issue by keeping only the representative negative instances in the training process. To adjust different sizes of training data, the number of iterations for the CNN

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom