Premium
UNITCELL : a nonlinear least‐squares program for cell‐parameter refinement and implementing regression and deletion diagnostics
Author(s) -
Holland T. J. B.,
Redfern S. A. T.
Publication year - 1997
Publication title -
journal of applied crystallography
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.429
H-Index - 162
ISSN - 1600-5767
DOI - 10.1107/s0021889896011673
Subject(s) - nonlinear regression , regression , non linear least squares , nonlinear system , least squares function approximation , computer science , regression analysis , statistics , algorithm , mathematics , explained sum of squares , physics , quantum mechanics , estimator
Computer Program Abstracts The category Computer Program Abstracts provides a rapid means of communicating up-to-date information concerning both new programs or systems and significant updates to existing ones. Following normal submission, a Computer Program Abstract will be reviewed by one or two members of the IUCr Commission on Crystallographic Computing. It should not exceed 500 words in length and should follow the standard format given on page 189 of the June 1985 issue of the Journal lJ. Appl. Cryst. (1985). 18, 189-190]. J. Appl. Cryst. (1997). 30, 84 UNITCELL: a nonlinear least-squares program for cell-parameter refinement implementing regression and deletion diagnostics T. J. B. HOLLAND AND S. A. T. REDFERN Department of Earth Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EQ, England. E-mail: tjbh @ esc.cam.ac.uk (Received 25 July 1996; accepted 4 October 1996) The crystallographic problem: During the refinement of unit-cell parameters from the observed positions of Bragg reflections in powder diffraction patterns, errors in the indexing and measurement of reflections are usually identified by their associated large residuals (e.g. 20ob s -20calc ). A more robust approach to the identification of such outliers in a data set is to employ regression diagnostics (Belsey, Kuh & Welsh, 1980; Powell, 1985; Holland & Redfern, 1997). The program UN/TCELL refines cell parameters by minimizing residuals in the experimentally determined Bragg position (in terms of 20hk I, d spacing dhk I or energy Ebkl) using a nonlinear leastsquares method. Regression diagnostics are provided that give information on the sensitivity of the refinement result to individual reflection positions (the leverage of each datum), as well as deletion diagnostics that indicate how deleterious each observed position is to the overall fit and automatically compute the effect of removing individual potentially deleterious observations from the data set. Hence, outliers may be identified, remedial action taken, and the overall refinement improved upon. Method of solution: From an indexed list of Bragg positions (in Qhkl, 20hkl, dhkl or Ehk I provided in the input data file), UNITCELL refines the unit-cell parameters by first obtaining an initial estimate of the cell parameters from minimization of residuals in Qhkl by linear least squares [given the crystal system and following Kelsey (1964)] and then conducting nonlinear least-squares refinement, minimiz© 1997 International Union of Crystallography Printed in Great Britain all rights reserved ing the residuals in the measured (or other user-specified) variable following the approach of Marquardt [as described by Bevington & Robinson (1992)]. This two-step approach therefore requires no initial guess for the cell parameters. The program then computes regression diagnostics for each reflection, determining its Hat (leverage or influence on the calculated parameters), the change in standard error of the residuals obtained by removing it from the data set (how harmful it is to the fit), the magnitude of its residual corrected for leverage, the influence it has on its own calculated position (which is large if it is an outlier), and the change expected in each calculated cell parameter should the reflection be removed from the data set. Used together, these diagnostics are particularly useful in assessing any individual sources of error in a cell-parameter refinement. The user has the option of outputting diagnostics for every datum, or for a limited number of data points that lie beyond statistically defined cutoff values that indicate that they may be either influential or deleterious to the fit. Software environment: UNITCELL will run under both PC Windows and Apple Macintosh system 6 onwards. The Macintosh version was developed using Symantec Think Pascal and the PC Windows version was implemented using Borland's Turbo Pascal for Windows. The Macintosh version is designed to be interactive with a user-friendly graphical user interface. The crystal system, data type ( Qhkl, 20 hkl, dhkl or Ehkl), wavelength or 20 angle, quantity to be minimized in the refinement (residuals in Qhkl, 2ehkl, dhkl or Ehkl) and input and output files are specified via an interactive input screen, with menu options. The PC Windows version is less elaborate, these specifications being incorporated within the first few lines of the input data file. The output results are displayed in their own text window, as well as being saved to a user-specified output file. The output file for the PC Windows version is plain text; the output file for the Macintosh version can be written in one of seven user-specified editor formats (MS Word, EDIT II, Vantage/McSink, BBEdit, QEDM, Generic Text or TeachText). Hardware environment: UNITCELL will run on any Apple Macintosh with or without FPU (both versions available; PPC users should employ the non-FPU version). The Windows version has been run on PC-compatible computers from 386 upwards. The executable PC Windows version is 80 kbytes. The source code comprises some 1991 lines of Pas-