A novel method to quantify gene set functional association based on gene ontology
Author(s) -
Sali Lv,
Yan Li,
Qianghu Wang,
Shangwei Ning,
Teng Huang,
Peng Wang,
Jie Sun,
Yan Zheng,
Weisha Liu,
Jing Ai,
Xia Li
Publication year - 2011
Publication title -
journal of the royal society interface
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.655
H-Index - 139
eISSN - 1742-5689
pISSN - 1742-5662
DOI - 10.1098/rsif.2011.0551
Subject(s) - gene ontology , computational biology , semantic similarity , similarity (geometry) , context (archaeology) , gene , gene annotation , set (abstract data type) , annotation , pairwise comparison , function (biology) , biology , bioinformatics , computer science , data mining , genetics , information retrieval , artificial intelligence , gene expression , genome , paleontology , image (mathematics) , programming language
Numerous gene sets have been used as molecular signatures for exploring the genetic basis of complex disorders. These gene sets are distinct but related to each other in many cases; therefore, efforts have been made to compare gene sets for studies such as those evaluating the reproducibility of different experiments. Comparison in terms of biological function has been demonstrated to be helpful to biologists. We improved the measurement of semantic similarity to quantify the functional association between gene sets in the context of gene ontology and developed a web toolkit named Gene Set Functional Similarity (GSFS; http://bioinfo.hrbmu.edu.cn/GSFS). Validation based on protein complexes for which the functional associations are known demonstrated that the GSFS scores tend to be correlated with sequence similarity scores and that complexes with high GSFS scores tend to be involved in the same functional catalogue. Compared with the pairwise method and the annotation method, the GSFS shows better discrimination and more accurately reflects the known functional catalogues shared between complexes. Case studies comparing differentially expressed genes of prostate tumour samples from different microarray platforms and identifying coronary heart disease susceptibility pathways revealed that the method could contribute to future studies exploring the molecular basis of complex disorders.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom