The Ensembl Analysis Pipeline | Zendy

Simon Potter | Zendy; Laura Clarke | Zendy; Val Curwen | Zendy; Stephen Keenan | Zendy; Emmanuel Mongin | Zendy; Stephen M. J. Searle | Zendy; Arne Stabenau | Zendy; Roy Storey | Zendy; Michèle Clamp | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

The Ensembl Analysis Pipeline

Author(s) -

Simon Potter,

Laura Clarke,

Val Curwen,

Stephen Keenan,

Emmanuel Mongin,

Stephen M. J. Searle,

Arne Stabenau,

Roy Storey,

Michèle Clamp

Publication year - 2004

Publication title -

genome research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 9.556

H-Index - 297

eISSN - 1549-5469

pISSN - 1088-9051

DOI - 10.1101/gr.1859804

Subject(s) - ensembl , biology , pipeline (software) , computational biology , data science , genetics , computer science , programming language , genomics , gene , genome

The Ensembl pipeline is an extension to the Ensembl system which allows automated annotation of genomic sequence. The software comprises two parts. First, there is a set of Perl modules ("Runnables" and "RunnableDBs") which are 'wrappers' for a variety of commonly used analysis tools. These retrieve sequence data from a relational database, run the analysis, and write the results back to the database. They inherit from a common interface, which simplifies the writing of new wrapper modules. On top of this sits a job submission system (the "RuleManager") which allows efficient and reliable submission of large numbers of jobs to a compute farm. Here we describe the fundamental software components of the pipeline, and we also highlight some features of the Sanger installation which were necessary to enable the pipeline to scale to whole-genome analysis.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research