In the following example, a researcher attempts to determine if a disease is associated with the presence of a particular gene. Individuals without the disease are controls and individuals with the disease are cases. Within the cases and controls, individuals with the hypothesized disease gene are marked as positive for the presence of the gene and individuals without the gene are marked as negative.
+ | - | total | ||
Cases | + | 101 | 59 | 160 |
- | 121 | 33 | 154 | |
totals | 222 | 92 | 314 |
The cells can be represented in the following manner by the letters a, b, c and d, The totals across rows and columns marginal totals, and the grand total is represented by n:
+ | - | total | ||
Cases | + | ''a'' | ''b'' | ''a''+''b'' |
- | ''c'' | ''d'' | ''c''+''d'' | |
totals | ''a''+''c'' | ''b''+''d'' | ''n'' |
Marginal homogeneity occurs when the row totals are equal to the column totals, a and d in each equation can be cancelled; leaving b equal to c:
(a + b) = (a + c)
(c + d) = (b + d)
The McNemar statistic is shown below:
χ2 = (b − c)2 / (b + c)
χ2 is a chi-squared statistic with the df = 1. The formula may be re-written to correct for discontinuity:
χ2 = ( | b − c | − 1)2 / (b + c)
The marginal frequencies are not homogenous, if the the χ2 result is significant p < 0.05. If b and/or c are small, (b + c) < 10, χ2 is not approximated by the chi-square distribution instead a Fisher's exact test should be used.
No comments:
Post a Comment