ID-GBA: Subgraph extension with Information Distance Guilt By Association in complex networks
Author(s) -
Predrag Obradovic,
Vladimir Kovacevic,
Aleksandar Milosavljevic,
Varduhi Petrosyan
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3622038
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
We introduce the ID-GBA (Information Distance Guilt By Association) method to expand highly connected sets of nodes by deploying a novel algorithm for subgraph extension based on the guilt-by-association principle and information distance. In this study, ID-GBA was utilized to expand disease clusters, and identify novel disease genes. We first validate its ability to expand related disease sets from disease/disease graphs built using Open Targets’ gene association scores. We then analyze disease/control gene expression networks and show that ID-GBA recaptures known disease genes in nine disease/control graphs. Compared to existing methods such as RandomWalk with Restarts and Personalized PageRank, ID-GBA achieves significantly higher Normalized Discounted Cumulative Gain scores, which indicates superior predictive performance in capturing known disease genes. Additionally, unlike other approaches that require users to specify either a threshold parameter or a fixed number of nodes to include in the extended subgraph, ID-GBA includes a built-in, automated, and data-driven thresholding mechanism. These results establish ID-GBA as a novel open-source tool to uncover hidden relationships in gene/gene, disease/disease, and other complex networks.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom