Premium
A geostatistical method to account for the number of aliquots in composite samples for normal and lognormal random variables
Author(s) -
Orton T. G.,
Pringle M. J.,
Allen D. E.,
Dalal R. C.,
Bishop T. F. A.
Publication year - 2015
Publication title -
european journal of soil science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.244
H-Index - 111
eISSN - 1365-2389
pISSN - 1351-0754
DOI - 10.1111/ejss.12297
Subject(s) - log normal distribution , statistics , sample size determination , logarithm , sample (material) , mathematics , sampling (signal processing) , variable (mathematics) , composite number , variance (accounting) , point (geometry) , computer science , algorithm , mathematical analysis , chemistry , geometry , accounting , filter (signal processing) , chromatography , business , computer vision
Summary Geostatistical methods can be used to calculate predictions of soil variables at unsampled locations, but the methodology is typically based on samples collected on identical sample supports. In this paper, we provide and test theory that allows the inclusion of data from mixed sample supports in a single analysis. In particular, we consider composite sample supports that are defined by the number of aliquots used to form a single composite sample, n i , and the set of locations, x i , from which the aliquots were collected. We allow both n i and x i to vary between samples ( x i can vary in the extent and geometry of the aliquot locations), and thereby show how point data (a special case of composite data, defined by n i = 1 and x i as the known sample point) can be included in the same geostatistical analysis as composite data. A further complication arises when data are not normally distributed, rather their logarithm is. When composite sampling is used for such lognormal data, the sample support affects not only the variance but also the mean. We give the theory for normally distributed variables, and also derive an approximation that can be used when the point‐support variable is lognormal. We focus on this latter case, and test the approach with a series of simulation experiments. Finally, we illustrate the approach on a dataset of soil organic carbon ( SOC ) values from a grazing property in Queensland, Australia, where soil information from two measurement phases was obtained on different supports.