Stochastic Temporal Data Upscaling Using the Generalized k-Nearest Neighbor Algorithm
Author(s) -
John Mashford
Publication year - 2018
Publication title -
international journal of stochastic analysis
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.19
H-Index - 28
eISSN - 2090-3340
pISSN - 2090-3332
DOI - 10.1155/2018/2487947
Subject(s) - mathematics , k nearest neighbors algorithm , series (stratigraphy) , variance (accounting) , yield (engineering) , set (abstract data type) , basis (linear algebra) , term (time) , algorithm , data set , time series , statistics , computer science , artificial intelligence , geometry , materials science , paleontology , accounting , physics , quantum mechanics , metallurgy , business , biology , programming language
Three methods of temporal data upscaling, which may collectively be called the generalized k-nearest neighbor (GkNN) method, are considered. The accuracy of the GkNN simulation of month by month yield is considered (where the term yield denotes the dependent variable). The notion of an eventually well-distributed time series is introduced and on the basis of this assumption some properties of the average annual yield and its variance for a GkNN simulation are computed. The total yield over a planning period is determined and a general framework for considering the GkNN algorithm based on the notion of stochastically dependent time series is described and it is shown that for a sufficiently large training set the GkNN simulation has the same statistical properties as the training data. An example of the application of the methodology is given in the problem of simulating yield of a rainwater tank given monthly climatic data.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom