z-logo
open-access-imgOpen Access
A Greedy Algorithm for Representative Sampling:repsampleinStata
Author(s) -
Evangelos Kontopantelis
Publication year - 2013
Publication title -
journal of statistical software
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 7.636
H-Index - 145
ISSN - 1548-7660
DOI - 10.18637/jss.v055.c01
Subject(s) - representativeness heuristic , sample (material) , population , selection bias , sampling (signal processing) , statistics , computer science , selection (genetic algorithm) , sampling bias , sample size determination , econometrics , variable (mathematics) , external validity , data mining , mathematics , artificial intelligence , demography , mathematical analysis , chemistry , filter (signal processing) , chromatography , sociology , computer vision
Quantitative empirical analyses of a population of interest usually aim to estimate the causal effect of one or more independent variables on a dependent variable. However, only in rare instances is the whole population available for analysis. Researchers tend to estimate causal effects on a selected sample and generalize their conclusions to the whole population. The validity of this approach rests on the assumption that the sample is representative of the population on certain key characteristics. A study using a non-representative sample is lacking in external validity by failing to minimize population choice bias. When the sample is large and non-response bias is not an issue, a random selection process is adequate to ensure external validity. If that is not the case, however, researchers could follow a more deterministic approach to ensure representativeness on the selected characteristics, provided these are known, or can be estimated, in the parent population. Although such approaches exist for matched sampling designs, research on representative sampling and the similarity between the sample and the parent population seems to be lacking. In this article we propose a greedy algorithm for obtaining a representative sample and quantifying representativeness in Stata.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom