Using a 3‐Layer Artificial Neural Network to Predict S‐Nitrosylation | Zendy

Anand Vijay | Zendy; Liu Ziping | Zendy; Gow Andrew | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Using a 3‐Layer Artificial Neural Network to Predict S‐Nitrosylation

Author(s) -

Anand Vijay,

Liu Ziping,

Gow Andrew

Publication year - 2021

Publication title -

the faseb journal

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.709

H-Index - 277

eISSN - 1530-6860

pISSN - 0892-6638

DOI - 10.1096/fasebj.2021.35.s1.05160

Subject(s) - computer science , artificial intelligence , s nitrosylation , artificial neural network , machine learning , data mining , chemistry , biochemistry , cysteine , enzyme

S‐nitrosylation is a post‐translational modification (PTM) in proteins that is critical for many biochemical processes in cells. Though experimental procedures exist for studying and determining S‐nitrosylation, computer modeling presents a more convenient and cost‐effective method to determine potential sites of nitrosylation. Previous studies have explored the use of different Machine Learning models to predict S‐nitrosylation in proteins; however, the type of Machine Learning methods and the selection of input features that will yield the best predictive performance are still being studied. The goal of this project was to optimize the performance of a 3‐layer Artificial Neural Network (ANN) using S‐nitrosylation primary protein structure data. Primary structure data was taken from the dbSNO database, with each protein entry processed into instances of cysteines that were and were not S‐nitrosylated. This entire dataset contained 4150 S‐nitrosylation instances and 18506 non‐S‐nitrosylation instances, which was then split into a training set and a testing set at a ratio of 70:30, respectively. The performance of the 3‐layer ANN was assessed via the Mathew Correlation Coefficient (MCC), Area Under the Receiver Operating Characteristic (AROC) curve, specificity, recall, accuracy, and precision values for the testing set. Mini‐batch sizes, iterations per run, number of hidden neurons per layer, and other hyper‐parameters were modified between runs to gauge improvements in the ANN performance. The 3‐layer ANN was able to achieve an average MCC, AROC, specificity, recall, accuracy, and precision of 0.265, 0.685, 0.784, 0.467, 0.625, and 0.687, respectively with 50 neurons at both hidden layers, 150 iterations, learning rate of 0.05, 64‐size mini‐batch, and window size of 40. As the ANN's performance cannot be significantly improved beyond the metrics presented above, the addition of secondary structure sequence data from Protein Data Bank (PDB) files may improve the predictive performance of the 3‐layer ANN and will be tested. The data presented above demonstrates the 3‐layer ANN's predictive performance and suggests that the use of additional data, such as secondary structure, could allow for improvement.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore