Malware Detection Using CNN via Word Embedding in Cloud Computing Infrastructure | Zendy

Rong Wang | Zendy; Cong Tian | Zendy; Yan Lin | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Malware Detection Using CNN via Word Embedding in Cloud Computing Infrastructure

Author(s) -

Rong Wang,

Cong Tian,

Yan Lin

Publication year - 2021

Publication title -

scientific programming

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.269

H-Index - 36

eISSN - 1875-919X

pISSN - 1058-9244

DOI - 10.1155/2021/8381550

Subject(s) - computer science , malware , cloud computing , softmax function , byte , word embedding , edge computing , convolutional neural network , embedding , word (group theory) , feature (linguistics) , feature extraction , computer security , data mining , artificial intelligence , operating system , linguistics , philosophy

The Internet of Things (IoT), cloud, and fog computing paradigms provide a powerful large-scale computing infrastructure for a variety of data and computation-intensive applications. These cutting-edge computing infrastructures, however, are nevertheless vulnerable to serious security and privacy risks. One of the most important countermeasures against cybersecurity threats is intrusion detection and prevention systems, which monitor devices, networks, and systems for malicious activity and policy violations. The detection and prevention systems range from antivirus software to hierarchical systems that monitor the traffic of whole backbone networks. At the moment, the primary defensive solutions are based on malware feature extraction. Most known feature extraction algorithms use byte N-gram patterns or binary strings to represent log files or other static information. The information taken from program files is expressed using word embedding (GloVe) and a new feature extraction method proposed in this article. As a result, the relevant vector space model (VSM) will incorporate more information about unknown programs. We utilize convolutional neural network (CNN) to analyze the feature maps represented by word embedding and apply Softmax to fit the probability of a malicious program. Eventually, we consider a program to be malicious if the probability is greater than 0.5; otherwise, it is a benign program. Experimental result shows that our approach achieves a level of accuracy higher than 98%.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research