Premium
Assessing the health of plants: simulation helps us understand observer disagreements
Author(s) -
Hutchinson T. P.
Publication year - 2000
Publication title -
environmetrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.68
H-Index - 58
eISSN - 1099-095X
pISSN - 1180-4009
DOI - 10.1002/(sici)1099-095x(200005/06)11:3<305::aid-env410>3.0.co;2-4
Subject(s) - statistics , bivariate analysis , joint probability distribution , multivariate normal distribution , mathematics , distribution (mathematics) , marginal distribution , normal distribution , econometrics , stability (learning theory) , quadratic equation , logarithm , computer science , multivariate statistics , machine learning , mathematical analysis , random variable , geometry
A dataset on the health of plants, as judged by two raters, appears to show more disagreement about the relatively healthy plants than about the less healthy. The bivariate normal distribution is shown to be a poor description of the data, and a new bivariate distribution is developed that gives a good fit to the data. The chief features of the distribution are that it is a model with latent variables in common (true score plus error for each rater), and that the scatter of error is greater when the true score is high than when it is low. As the distribution is fitted to the data by simulation, an explicit expression for the joint distribution of the two observed scores is not required. The software used has ranking and recoding commands of one line each, so it is easy to ensure the fitted marginal distributions exactly match the data, and it is unnecessary to estimate parameters representing the boundaries between the grades of rating. The method of obtaining the distribution is very flexible. In one illustration of this, the relation between true score and the logarithm of the scatter of error is quadratic (not linear); in another, the true score and the errors have non‐normal distributions. Copyright © 2000 John Wiley & Sons, Ltd.