A combined bioinformatics and LC-MS-based approach for the development and benchmarking of a comprehensive database of Lymnaea CNS proteins | Zendy

Sarah K. Wooller | Zendy; Aikaterini Anagnostopoulou | Zendy; Benno Kuropka | Zendy; Michael Crossley | Zendy; Paul R. Benjamin | Zendy; Frances M. G. Pearl | Zendy; Ildikó Kemenes | Zendy; György Kemenes | Zendy; Murat Eravci | Zendy

Open Access

A combined bioinformatics and LC-MS-based approach for the development and benchmarking of a comprehensive database of Lymnaea CNS proteins

Author(s) -

Sarah K. Wooller,

Aikaterini Anagnostopoulou,

Benno Kuropka,

Michael Crossley,

Paul R. Benjamin,

Frances M. G. Pearl,

Ildikó Kemenes,

György Kemenes,

Murat Eravci

Publication year - 2022

Publication title -

journal of experimental biology

Language(s) - English

Resource type - Journals

eISSN - 1477-9145

pISSN - 0022-0949

DOI - 10.1242/jeb.243753

Subject(s) - lymnaea stagnalis , proteomics , uniprot , proteome , biology , lymnaea , computational biology , database , identification (biology) , sequence database , protein data bank (rcsb pdb) , bioinformatics , model organism , computer science , genetics , gene , ecology , snail , biochemistry

Applications of key technologies in biomedical research, such as qRT-PCR or LC-MS-based proteomics, are generating large biological (-omics) datasets which are useful for the identification and quantification of biomarkers in any research area of interest. Genome, transcriptome and proteome databases are already available for a number of model organisms including vertebrates and invertebrates. However, there is insufficient information available for protein sequences of certain invertebrates, such as the great pond snail Lymnaea stagnalis, a model organism that has been used highly successfully in elucidating evolutionarily conserved mechanisms of memory function and dysfunction. Here, we used a bioinformatics approach to designing and benchmarking a comprehensive central nervous system (CNS) proteomics database (LymCNS-PDB) for the identification of proteins from the CNS of Lymnaea by LC-MS-based proteomics. LymCNS-PDB was created by using the Trinity TransDecoder bioinformatics tool to translate amino acid sequences from mRNA transcript assemblies obtained from a published Lymnaea transcriptomics database. The blast-style MMSeq2 software was used to match all translated sequences to UniProtKB sequences for molluscan proteins, including those from Lymnaea and other molluscs. LymCNS-PDB contains 9628 identified matched proteins that were benchmarked by performing LC-MS-based proteomics analysis with proteins isolated from the Lymnaea CNS. MS/MS analysis using the LymCNS-PDB database led to the identification of 3810 proteins. Only 982 proteins were identified by using a non-specific molluscan database. LymCNS-PDB provides a valuable tool that will enable us to perform quantitative proteomics analysis of protein interactomes involved in several CNS functions in Lymnaea, including learning and memory and age-related memory decline.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research