z-logo
open-access-imgOpen Access
Discovering <i>Escherichia coli</i> K-12 Promoter Features Using Convolutional Neural Network
Author(s) -
Mengmeng Zhang,
Lu Wang,
Ping Wan
Publication year - 2020
Publication title -
computational biology and bioinformatics
Language(s) - English
Resource type - Journals
eISSN - 2330-8281
pISSN - 2330-8265
DOI - 10.11648/j.cbb.20200801.13
Subject(s) - promoter , convolutional neural network , gene , computational biology , genome , genetics , biology , computer science , artificial intelligence , gene expression
The mechanism of prokaryotic gene expression remains incompletely understood. Promoters are regions in genome that locating upstream to genes and regulate of gene expressions. Despite more and more E. coli K-12 promoter sequences have been obtained experimentally, and some regions such as -10 region and -30 region have been described, the features in promoter sequences are far from explicitly characterized. Here, we address this challenge using an approach based on the deep convolutional neural network (CNN). We collected six classes of E. coli K-12 promoter sequences which are all annotated as with strong evidence and belong to only one promoter class in RegulonDB database. Then, we applied the CNN model to recognize the six classes of promoters. The CNN model achieved an accuracy of above 97% for all six classes of promoters. Next, we extracted the weight matrix of the last convolution layer in CNN with the Grad-Cam algorithm, and convert the weight matrix to an information content matrix. Finally, we visualized the information content matrix as promoter logos using the logomaker tool and discover the promoter features in six classes of promoters. Our approach could not only find the previous described promoter feature regions, but could also discover promoter features with better sensitivity and accuracy. We provide a novel computational approach to discover features in biological sequences.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom