
The symptom-syndrome analysis of multivariate categorical data based on Zhegalkin polynomials
Author(s) -
Н. П. Алексеева
Publication year - 2021
Publication title -
vestnik sankt-peterburgskogo universiteta. matematika. mehanika. astronomiâ/vestnik sankt-peterburgskogo universiteta. seriâ 1, matematika, mehanika, astronomiâ
Language(s) - English
Resource type - Journals
eISSN - 2587-5884
pISSN - 1025-3106
DOI - 10.21638/spbu01.2021.302
Subject(s) - majorization , mathematics , categorical variable , multistability , parameterized complexity , combinatorics , pure mathematics , discrete mathematics , statistics , physics , nonlinear system , quantum mechanics
In this article, we study the distribution, entropy and other informational properties of finite projective subspaces (syndromes) parameterized by impulse sequences with basic elements in the form of symptoms - polynomials over the field F2 which are known as Zhegalkin polynomials. It has been proven that the super syndrome, which is a linear syndrome with basic elements in the form of a multiplicative syndrome, is closed. If in the multiplication of two symptoms one is neutral, then we are talking about its majorization. The ordered by majorization symptoms form a majorized syndrome. Is proved that the majorized syndrome is closed and coincides with the super syndrome. The statements formulated in the first part of the paper are used to justify the convergence of the iterative procedure (PI), in which the most informative symptoms selected from partial super syndromes are again used in the next step. The stationary state of PI is obtained if all elements of the input set belong to either the same partial super syndrome or to the majorized syndrome. Thanks IP it is possible to quickly find the optimal syndrome from a large set of variables. An example from phthisiology shows how the specificity of classification can be improved using symptom analysis.