z-logo
Premium
Tests for mean vectors in high dimension
Author(s) -
MaboudouTchao Edgard M.,
Silva Ivair
Publication year - 2013
Publication title -
statistical analysis and data mining: the asa data science journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.381
H-Index - 33
eISSN - 1932-1872
pISSN - 1932-1864
DOI - 10.1002/sam.11209
Subject(s) - monte carlo method , test statistic , mathematics , statistic , dimension (graph theory) , multivariate normal distribution , covariance , independent and identically distributed random variables , sample size determination , statistical hypothesis testing , statistics , multivariate statistics , algorithm , combinatorics , random variable
Traditional multivariate tests, Hotelling's \documentclass{article}\usepackage{amsmath}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{amsfonts}\pagestyle{empty}\begin{document}$T^2$ \end{document} or Wilks \documentclass{article}\usepackage{amsmath}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{amsfonts}\pagestyle{empty}\begin{document}$\Lambda$ \end{document} , are designed for a test of the mean vector under the condition that the number of observations is larger than the number of variables. For high‐dimensional data, where the number of features is nearly as large as or larger than the number of observations, the existing tests do not provide a satisfactory solution because of the singularity of the estimated covariance matrix. In this article, we consider a test for the mean vector of independent and identically distributed multivariate normal random vectors where the dimension is larger than or equal to the number of observations. To solve this problem, we propose a modified Hotelling statistic. Simulation results show that the proposed test is superior to other tests available in the literature. However, because we do not know the theoretical distribution of this modified statistic, Monte Carlo methods were used to reach this conclusion. Instead of using conventional Monte Carlo methods, which perform a fixed‐number of simulations, we suggest using the sequential Monte Carlo test in order to decrease the number of simulations needed to reach a decision. Simulation results show that the sequential Monte Carlo test is preferable to a fixed‐sample test, especially when using computationally intensive statistical methods. © 2013 Wiley Periodicals, Inc. Statistical Analysis and Data Mining, 2013

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here