A statistical note on Karl Pearson’s 1904 meta-analysis | Zendy

Harry S. Shan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A statistical note on Karl Pearson’s 1904 meta-analysis

Author(s) -

Harry S. Shan

Publication year - 2016

Publication title -

journal of the royal society of medicine

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.38

H-Index - 81

eISSN - 1758-1095

pISSN - 0141-0768

DOI - 10.1177/0141076816659003

Subject(s) - statistical analysis , meta analysis , computer science , statistics , data science , information retrieval , natural language processing , medicine , mathematics , pathology

Karl Pearson’s 1904 report on Certain enteric fever inoculation statistics is seen as a key paper in the history of meta-analysis. In it, Pearson raised several important methodological issues arising from his correlations between typhoid and mortality and the inoculation status of soldiers serving in various parts of the British Empire. First, he noted the ‘significance’ of the individual correlations. For this he used the magnitude of the correlations in relation to their ‘probable errors’. Second, he pointed out the ‘extreme irregularity’ of the correlation values – what we would now call heterogeneity – and sought to explain why they differed. Third, he commented on the ‘lowness’ of the values, arguing that they were too low to convince him that the inoculation had been proven worthwhile. He felt that a better vaccine was needed. Pearson also commented on how the data had been obtained. He was concerned that self-selection into the inoculated group by volunteers who were ‘more cautious and careful’ could have produced spurious estimates of effectiveness. This and his concerns about the weakness of the correlations led him to recommend that an ‘experiment’ be done. He did not propose a randomised controlled trial – he was writing before Fisher developed the theoretical reasons for random allocation – but Pearson clearly understood the need for comparability of groups. His solution was to call for volunteers, register them all, and only inoculate every second one. The data available to Pearson were presented in 2 2 tables. To create a measure of effect, he computed for each table the tetrachoric correlation, which he had described a few years earlier. The approach assumes the data come from a bivariate normal distribution and derives the correlation based on that distribution. Today we would use the data in the tables to find other measures, for example, the relative odds (odds ratios). Table 1 shows Pearson’s values for the correlations, along with estimates of the relative odds. Following Pearson, the results are presented separately for the relation between inoculation and escaping typhoid (enteric) fever, and the relation between inoculation and case survival. The rank orders of the correlations and odds ratios are the same for the first set of tables and almost identical for the second. What is striking is that, even when the odds ratio reached 7.9 (the relative risk for this table was 6.9), the correlation was only 0.445, which fell in the range (0.25–0.5) that Pearson labelled ‘moderate’. (Pearson used as outcomes ‘escaping’ disease and ‘survival given disease’, so that protection resulting from inoculation is reflected in positive correlations and in odds ratios greater than 1. We are used to seeing the odds ratios presented so that values below 1 show a benefit of treatment. In this case, the inverse of 7.9 is 0.13.) A formal test of heterogeneity for the odds ratios in the first set of tables confirms Pearson’s observation (Breslow-Day X1⁄4 90.6 on 4 df, p< 0.001). However, this is not so for the second set, for which the test is not conventionally statistically significant: X1⁄4 6.9 on 5 df, p1⁄4 0.23. Given this, it is legitimate to compute a pooled odds ratio: the Mantel-Haenszel estimate is 1.77 (95% CI 1.5–2.1). A final point: Pearson considered the effectiveness of inoculation in two steps – whether it prevented soldiers from acquiring typhoid fever and whether it reduced mortality in those who had developed the disease. For four of the groups, it is possible to explore directly the relationship between inoculation and mortality from the disease. The odds ratios are in the range of 2.2–6.8. They are not significantly different from each other – X1⁄4 5.2 on 3 df, p1⁄4 0.16. The pooled estimate is 4.5 (95% CI 3.1–6.6). At face value, it is a strong effect (the inverse is 0.22, 95% CI 0.15–0.32) by current criteria. Even so, I suspect that Pearson would still not have been convinced of the value of vaccination but would have continued to insist that further work was needed, including a proper controlled trial.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research