z-logo
open-access-imgOpen Access
Deep learning model for metagenome fragment classification using spaced k-mers feature extraction
Author(s) -
Nur Choiriyati,
Yandra Arkeman,
Wisnu Ananta Kusuma
Publication year - 2020
Publication title -
jurnal teknologi dan sistem komputer
Language(s) - English
Resource type - Journals
eISSN - 2620-4002
pISSN - 2338-0403
DOI - 10.14710/jtsiskom.2020.13407
Subject(s) - convolutional neural network , artificial intelligence , pattern recognition (psychology) , classifier (uml) , feature extraction , metagenomics , computer science , deep learning , artificial neural network , machine learning , biology , biochemistry , gene
An open challenge in bioinformatics is the analysis of the sequenced metagenomes from the various environments. Several studies demonstrated bacteria classification at the genus level using k-mers as feature extraction where the highest value of k gives better accuracy but it is costly in terms of computational resources and computational time. Spaced k-mers method was used to extract the feature of the sequence using 111 1111 10001 where 1 was a match and 0 was the condition that could be a match or did not match. Currently, deep learning provides the best solutions to many problems in image recognition, speech recognition, and natural language processing. In this research, two different deep learning architectures, namely Deep Neural Network (DNN) and Convolutional Neural Network (CNN), trained to approach the taxonomic classification of metagenome data and spaced k-mers method for feature extraction. The result showed the DNN classifier reached 90.89 % and the CNN classifier reached 88.89 % accuracy at the genus level taxonomy.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here