z-logo
open-access-imgOpen Access
Quantitative Analysis of a Weak Correlation between Complicated Data on the Basis of Principal Component Analysis
Author(s) -
Tao Pang,
Haitao Zhang,
Liliang Wen,
Jun Tang,
Bing Zhou,
Qianxu Yang,
Yong Li,
Jiajun Wang,
Aiming Chen,
Zhongda Zeng
Publication year - 2021
Publication title -
journal of analytical methods in chemistry
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.407
H-Index - 25
eISSN - 2090-8865
pISSN - 2090-8873
DOI - 10.1155/2021/8874827
Subject(s) - principal component analysis , ellipse , basis (linear algebra) , correlation , similarity (geometry) , mathematics , pattern recognition (psychology) , covariance matrix , data mining , matrix (chemical analysis) , computer science , statistics , artificial intelligence , geometry , image (mathematics) , materials science , composite material
The mining of weak correlation information between two data matrices with high complexity is a very challenging task. A new method named principal component analysis-based multiconfidence ellipse analysis (PCA/MCEA) was proposed in this study, which first applied a confidence ellipse to describe the difference and correlation of such information among different categories of objects/samples on the basis of PCA operation of a single targeted data. This helps to find the number of objects contained in the overlapping and nonoverlapping areas of ellipses obtained from PCA runs. Then, a quantitative evaluation index of correlation between data matrices was defined by comparing the PCA results of more than one data matrix. The similarity and difference between data matrices was further quantified through comprehensively analyzing the outcomes. Complicated data of tobacco agriculture were used as an example to illustrate the strategy of the proposed method, which includes rich features of climate, altitude, and chemical compositions of tobacco leaves. The number of objects of these data reached 171,516 with 14, 4, and 5 descriptors of climate, altitude, and chemicals, respectively. On the basis of the new method, the complex but weak relationship between these independent and dependent variables were interestingly studied. Three widely used but conventional methods were applied for comparison in this work. The results showed the power of the new method to discover the weak correlation between complicated data.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom