Premium
CCor : A whole genome network‐based similarity measure between two genes
Author(s) -
Hu Yiming,
Zhao Hongyu
Publication year - 2016
Publication title -
biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.298
H-Index - 130
eISSN - 1541-0420
pISSN - 0006-341X
DOI - 10.1111/biom.12508
Subject(s) - measure (data warehouse) , computational biology , similarity (geometry) , genome , similarity measure , gene , computer science , biology , genetics , artificial intelligence , data mining , image (mathematics)
Summary Measuring the similarity between genes is often the starting point for building gene regulatory networks. Most similarity measures used in practice only consider pairwise information with a few also consider network structure. Although theoretical properties of pairwise measures are well understood in the statistics literature, little is known about their statistical properties of those similarity measures based on network structure. In this article, we consider a new whole genome network‐based similarity measure, called CCor , that makes use of information of all the genes in the network. We derive a concentration inequality of CCor and compare it with the commonly used Pearson correlation coefficient for inferring network modules. Both theoretical analysis and real data example demonstrate the advantages of CCor over existing measures for inferring gene modules.