Matrix correlations for high-dimensional data: the modified RV-coefficient
Author(s) -
Age K. Smilde,
Henk A. L. Kiers,
Sabina Bijlsma,
Carina M. Rubingh,
Marjan J. van Erk
Publication year - 2008
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btn634
Subject(s) - computer science , curse of dimensionality , matrix (chemical analysis) , functional genomics , data mining , pearson product moment correlation coefficient , matlab , dimensionality reduction , correlation coefficient , coefficient matrix , genomics , artificial intelligence , machine learning , mathematics , physics , statistics , biology , genome , eigenvalues and eigenvectors , chemistry , biochemistry , gene , operating system , chromatography , quantum mechanics
Modern functional genomics generates high-dimensional datasets. It is often convenient to have a single simple number characterizing the relationship between pairs of such high-dimensional datasets in a comprehensive way. Matrix correlations are such numbers and are appealing since they can be interpreted in the same way as Pearson's correlations familiar to biologists. The high-dimensionality of functional genomics data is, however, problematic for existing matrix correlations. The motivation of this article is 2-fold: (i) we introduce the idea of matrix correlations to the bioinformatics community and (ii) we give an improvement of the most promising matrix correlation coefficient (the RV-coefficient) circumventing the problems of high-dimensional data.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom