z-logo
Premium
An empirical comparison of three methods for multiple cutoff diagnostic test meta‐analysis of the Patient Health Questionnaire‐9 ( PHQ ‐9) depression screening tool using published data vs individual level data
Author(s) -
Benedetti Andrea,
Levis Brooke,
Rücker Gerta,
Jones Hayley E.,
Schumacher Martin,
Ioannidis John P. A.,
Thombs Brett
Publication year - 2020
Publication title -
research synthesis methods
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.376
H-Index - 35
eISSN - 1759-2887
pISSN - 1759-2879
DOI - 10.1002/jrsm.1443
Subject(s) - patient health questionnaire , meta analysis , cutoff , depression (economics) , test (biology) , psychology , medicine , clinical psychology , statistics , computer science , applied psychology , psychiatry , depressive symptoms , mathematics , physics , quantum mechanics , anxiety , paleontology , biology , economics , macroeconomics
Selective cutoff reporting in primary diagnostic accuracy studies with continuous or ordinal data may result in biased estimates when meta‐analyzing studies. Collecting individual participant data (IPD) and estimating accuracy across all relevant cutoffs for all studies can overcome such bias but is labour intensive. We meta‐analyzed the diagnostic accuracy of the Patient Health Questionnaire‐9 (PHQ‐9) depression screening tool. We compared results for two statistical methods proposed by Steinhauser and by Jones to account for missing cutoffs, with results from a series of bivariate random effects models (BRM) estimated separately at each cutoff. We applied the methods to a dataset that contained information only on cutoffs that were reported in the primary publications and to the full IPD dataset that contained information for all cutoffs for every study. For each method, we estimated pooled sensitivity and specificity and associated 95% confidence intervals for each cutoff and area under the curve (AUC). The full IPD dataset comprised data from 45 studies, 15 020 subjects, and 1972 cases of major depression and included information on every possible cutoff. When using data available in publications, using statistical approaches outperformed the BRM applied to the same data. AUC was similar for all approaches when using the full IPD dataset, though pooled estimates were slightly different. Overall, using statistical methods to fill in missing cutoff data recovered the receiver operating characteristic (ROC) curve from the full IPD dataset well when using only the published subset. All methods performed similarly when applied to the full IPD dataset.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here