Premium
SOCIOECONOMIC STATUS MEASUREMENT WITH DISCRETE PROXY VARIABLES: IS PRINCIPAL COMPONENT ANALYSIS A RELIABLE ANSWER?
Author(s) -
Kolenikov Stanislav,
Angeles Gustavo
Publication year - 2009
Publication title -
review of income and wealth
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.024
H-Index - 57
eISSN - 1475-4991
pISSN - 0034-6586
DOI - 10.1111/j.1475-4991.2008.00309.x
Subject(s) - polychoric correlation , principal component analysis , econometrics , proxy (statistics) , mathematics , statistics , ordinal data , data set , correlation , geometry
The last several years have seen a growth in the number of publications in economics that use principal component analysis (PCA) in the area of welfare studies. This paper explores the ways discrete data can be incorporated into PCA. The effects of discreteness of the observed variables on the PCA are reviewed. The statistical properties of the popular Filmer and Pritchett (2001) procedure are analyzed. The concepts of polychoric and polyserial correlations are introduced with appropriate references to the existing literature demonstrating their statistical properties. A large simulation study is carried out to compare various implementations of discrete data PCA. The simulation results show that the currently used method of running PCA on a set of dummy variables as proposed by Filmer and Pritchett (2001) can be improved upon by using procedures appropriate for discrete data, such as retaining the ordinal variables without breaking them into a set of dummy variables or using polychoric correlations. An empirical example using Bangladesh 2000 Demographic and Health Survey data helps in explaining the differences between procedures.