RNANet: an automatically built dual-source dataset integrating homologous sequences and RNA structures | Zendy

Louis Becquey | Zendy; Éric Angel | Zendy; Fariza Tahi | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

RNANet: an automatically built dual-source dataset integrating homologous sequences and RNA structures

Author(s) -

Louis Becquey,

Éric Angel,

Fariza Tahi

Publication year - 2020

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/btaa944

Subject(s) - computer science , software , data mining , pipeline (software) , classifier (uml) , annotation , source code , artificial intelligence , information retrieval , machine learning , programming language , operating system

Applied research in machine learning progresses faster when a clean dataset is available and ready to use. Several datasets have been proposed and released over the years for specific tasks such as image classification, speech-recognition and more recently for protein structure prediction. However, for the fundamental problem of RNA structure prediction, information is spread between several databases depending on the level we are interested in: sequence, secondary structure, 3D structure or interactions with other macromolecules. In order to speed-up advances in machine-learning based approaches for RNA secondary and/or 3D structure prediction, a dataset integrating all this information is required, to avoid spending time on data gathering and cleaning.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research