z-logo
open-access-imgOpen Access
SCIA: A Novel Gene Set Analysis Applicable to Data With Different Characteristics
Author(s) -
Yiqun Li,
Ying Wu,
Xiaohan Zhang,
Yunfan Bai,
Luqman Muhammad Akthar,
Xin Lü,
Ming Shi,
Jianxiang Zhao,
Qinghua Jiang,
Yu Li
Publication year - 2019
Publication title -
frontiers in genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.413
H-Index - 81
ISSN - 1664-8021
DOI - 10.3389/fgene.2019.00598
Subject(s) - computer science , permutation (music) , data mining , false discovery rate , set (abstract data type) , a priori and a posteriori , scope (computer science) , data set , artificial intelligence , gene , biology , genetics , philosophy , physics , epistemology , acoustics , programming language
Gene set analysis is commonly used in functional enrichment and molecular pathway analyses. Most of the present methods are based on the competitive testing methods which assume each gene is independent of the others. However, the false discovery rates of competitive methods are amplified when they are applied to datasets with high inter-gene correlations. The self-contained testing methods could solve this problem, but there are other restrictions on data characteristics. Therefore, a statistically rigorous testing method applicable to different datasets with various complex characteristics is needed to obtain unbiased and comparable results. We propose a self-contained and competitive incorporated analysis (SCIA) to alleviate the bias caused by the limited application scope of existing gene set analysis methods. This is accomplished through a novel permutation strategy using a priori biological networks to selectively permute gene labels with different probabilities. In simulation studies, SCIA was compared with four representative analysis methods (GSEA, CAMERA, ROAST, and NES), and produced the best performance in both false discovery rate and sensitivity under most conditions with different parameter settings. Further, the KEGG pathway analysis on two real datasets of lung cancer showed that the results found by SCIA in both of the two datasets are much more than that of GSEA and most of them could be supported by literature. Overall, SCIA promisingly offers researchers more reliable and comparable results with different datasets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom