Practical selection of representative sets of RNA-seq samples using a hierarchical approach | Zendy

Laura H. Tung | Zendy; Carl Kingsford | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Practical selection of representative sets of RNA-seq samples using a hierarchical approach

Author(s) -

Laura H. Tung,

Carl Kingsford

Publication year - 2021

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/btab315

Subject(s) - selection (genetic algorithm) , computer science , rna seq , data mining , computational biology , artificial intelligence , biology , genetics , gene , transcriptome , gene expression

Despite numerous RNA-seq samples available at large databases, most RNA-seq analysis tools are evaluated on a limited number of RNA-seq samples. This drives a need for methods to select a representative subset from all available RNA-seq samples to facilitate comprehensive, unbiased evaluation of bioinformatics tools. In sequence-based approaches for representative set selection (e.g. a k-mer counting approach that selects a subset based on k-mer similarities between RNA-seq samples), because of the large numbers of available RNA-seq samples and of k-mers/sequences in each sample, computing the full similarity matrix using k-mers/sequences for the entire set of RNA-seq samples in a large database (e.g. the SRA) has memory and runtime challenges; this makes direct representative set selection infeasible with limited computing resources.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research