Towards a Bayesian Statistical Model for the Classification of the Causes of Data Loss
Author(s) -
Phillip M. Dickens,
Jeffery Peden
Publication year - 2005
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
ISBN - 3-540-29031-1
DOI - 10.1007/11557654_86
Subject(s) - computer science , packet loss , domain (mathematical analysis) , data mining , bayesian network , path (computing) , artificial intelligence , machine learning , data loss , naive bayes classifier , network packet , computer network , mathematical analysis , mathematics , support vector machine
Given the critical nature of communications in computational Grids it is important to develop efficient, intelligent, and adaptive communication mechanisms. An important milestone on this path is the development of classification mechanisms that can distinguish between the various causes of data loss in cluster and Grid environments. The idea is to use the classification mechanism to determine if data loss is caused by contention within the network or if the cause lies outside of the network domain. If it is outside of the network domain, then it is not necessary to trigger aggressive congestion-control mechanisms. Thus the goal is to operate the data transfer at the highest possible rate by only backing off aggressively when the data loss is classified as being network related. In this paper, we investigate one promising approach to developing such classification mechanisms based on the analysis of the patterns of packet loss and the application of Bayesian statistics.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom