Non‐inferiority trials: the ‘at least as good as’ criterion | Zendy

Laster Larry L. | Zendy; Johnson Mary F. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Non‐inferiority trials: the ‘at least as good as’ criterion

Author(s) -

Laster Larry L.,

Johnson Mary F.

Publication year - 2002

Publication title -

statistics in medicine

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.996

H-Index - 183

eISSN - 1097-0258

pISSN - 0277-6715

DOI - 10.1002/sim.1137

Subject(s) - mathematics , confidence interval , null hypothesis , statistics , equivalence (formal languages) , statistical hypothesis testing , statistical significance , dimensionless quantity , medicine , discrete mathematics , physics , mechanics

To demonstrate in a clinical trial that a new or experimental therapy (et) is ‘at least as good as’ a standard therapy (st), a statistical test or confidence interval procedure must rule out clinical inferiority with a high probability. The term ‘at least as good as’ implies equivalent but not necessarily superior efficacy. As it is statistically impossible to demonstrate equivalence (that is, prove the null hypothesis of no difference), Blackwelder proposed a one‐sided significance test to reject the null hypothesis that standard therapy is better than experimental therapy by a clinically acceptable amount, δ BW . In this paper, Blackwelder's approach is redefined in terms of the ratio of two means ( R True =μ et /μ st ) based on a continuous variate with higher values denoting greater improvement. The ratio‐based equivalents to Blackwelder's hypotheses will be shown. The ratio parameter has the benefit of being available as a dimensionless percentage, not tied to a specified difference in means. Thus, a study can be sized to assure, with high probability, that the experimental therapy is ‘at least’ ( R LB ×100) per cent ‘as effective as’ the standard therapy, where R LB is the selected lower bound on the percentage effectiveness. A practical rationale is given for defining non‐inferiority as a high fraction or percentage of the standard drug's efficacy, both in terms of statistical efficiency and medical relevance. For most typical ‘at least as good as’ applications (when R LB < R True ⩽1), the ratio formatted test of H 0 : R True ⩽ R LB is shown to be more efficient than Blackwelder's test of H 0 :μ st ‐ μ et ⩾ δ BW , thereby requiring smaller sample sizes to detect the directionally based non‐null alternatives contained in H 1 :μ et /μ st > R LB or, equivalently, μ st ‐ μ et <δ BW . Further, when R True = 1.0, tests of Blackwelder's hypotheses, their ratio‐based equivalents and conventional superiority can be evaluated for comparative efficiency. Testing H 0 : R True ⩽R LB with single‐sided critical region of size α, versus H 1 : R True > R LB , is shown to be more efficient than excluding R LB from the lower limit of a 100(1–2α) per cent two‐sided symmetric confidence interval centred by R̂ . Relevant examples will be presented. Copyright © 2003 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research