Automatic Discovery and Inferencing of Complex Bioinformatics Web Interfaces | Zendy

Anne H. H. Ngu | Zendy; Daniel Rocco | Zendy; Terence Critchlow | Zendy; David Buttler | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Automatic Discovery and Inferencing of Complex Bioinformatics Web Interfaces

Author(s) -

Anne H. H. Ngu,

Daniel Rocco,

Terence Critchlow,

David Buttler

Publication year - 2005

Publication title -

world wide web

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.464

H-Index - 42

eISSN - 1573-1413

pISSN - 1386-145X

DOI - 10.1007/s11280-005-0509-5

Subject(s) - computer science , web service , world wide web , interface (matter) , genomics , information retrieval , maximum bubble pressure method , genome , parallel computing , biochemistry , chemistry , bubble , gene

The World Wide Web provides a vast resource to genomics researchers, with Web-based access to distributed data sources such as BLAST sequence homology search interfaces. However, finding the desired scientific information can still be very tedious and frustrating. While there are several known servers on genomic data (e.g., GeneBank, EMBL, NCBI) that are shared and accessed frequently, new data sources are created each day in laboratories all over the world. Sharing these new genomics results is hindered by the lack of a common interface or data exchange mechanism. Moreover, the number of autonomous genomics sources and their rate of change outpace the speed at which they can be manually identified, meaning that the available data is not being utilized to its full potential. An automated system that can find, classify, describe, and wrap new sources without tedious and low-level coding of source-specific wrappers is needed to assist scientists in accessing hundreds of dynamically changing bioinformatics Web data sources through a single interface. A correct classification of any kind of Web data source must address both the capability of the source and the conversation/interaction semantics inherent in the design of the data source. We propose a service class description (SCD)-a meta-data approach for classifying Web data sources that takes into account both the capability and the conversational semantics of the source. The ability to discover the interaction pattern of a Web source leads to increased accuracy in the classification process. Our results show that an SCD-based approach successfully classifies two thirds of BLAST sites with 100% accuracy and two thirds of bioinformatics keyword search sites with around 80% precision.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research