Premium
A new version of PRT software for sibling groups reconstruction with comments regarding several issues in the sibling reconstruction problem
Author(s) -
ALMUDEVAR ANTHONY,
ANDERSON ERIC C.
Publication year - 2012
Publication title -
molecular ecology resources
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.96
H-Index - 136
eISSN - 1755-0998
pISSN - 1755-098X
DOI - 10.1111/j.1755-0998.2011.03061.x
Subject(s) - sibling , biology , set (abstract data type) , biometrics , software , statistics , computer science , machine learning , artificial intelligence , mathematics , developmental psychology , psychology , programming language
Pedigree reconstruction using genotypic markers has become an important tool for the study of natural populations. The nonstandard nature of the underlying statistical problems has led to the necessity of developing specialized statistical and computational methods. In this article, a new version of pedigree reconstruction tools (PRT 2.0) is presented. The software implements algorithms proposed in Almudevar & Field (Journal of Agricultural Biological and Environmental Statistics, 4, 1999, 136) and Almudevar (Biometrics, 57, 2001a, 757) for the reconstruction of single generation sibling groups (SG). A wider range of enumeration algorithms is included, permitting improved computational performance. In particular, an iterative version of the algorithm designed for larger samples is included in a fully automated form. The new version also includes expanded simulation utilities, as well as extensive reporting, including half-sibling compatibility, parental genotype estimates and flagging of potential genotype errors. A number of alternative algorithms are described and demonstrated. A comparative discussion of the underlying methodologies is presented. Although important aspects of this problem remain open, we argue that a number of methodologies including maximum likelihood estimation (COLONY 1.2 and 2.0) and the set cover formulation (KINALYZER) exhibit undesirable properties in the sibling reconstruction problem. There is considerable evidence that large sets of individuals not genetically excluded as siblings can be inferred to be a true sibling group, but it is also true that unrelated individuals may be genetically compatible with a true sibling group by chance. Such individuals may be identified on a statistical basis. PRT 2.0, based on these sound statistical principles, is able to efficiently match or exceed the highest reported accuracy rates, particularly for larger SG. The new version is available at http://www.urmc.rochester.edu/biostat/people/faculty/almudevar.cfm.