Rates

HDNo HDRow Total
Smoker381496215000
Non-Smoker448495685000
Column Total8299918100000
  • Marginal Rate
    • % of population that satisfies the condition
    • Rate(Smoker)
  • Conditional Rate
    • A rate based on part of the population that satisfies a certain condition
    • % of people out of population that satisfies “given” condition
    • Rate(HD | Smoker) → Rate of HD given Smoker
    • Rate(HD | Smoker)
  • Joint Rate
    • % of population that satisfies both conditions
    • Rate(HD & Smoker)

Symmetry Rule

If rate(A | B) [op] rate(A | not B), then rate(B | A) [op] rate(B | not A). Where [op] is

Basic Rule of Rates

  1. Given subgroups A, B and C, the overall rate(A) is always between rate(A | B) and rate(A | C)
  2. From 1, when C = “not B”, rate(A) is always between rate(A | B) and rate(A | not B)
  3. The closer rate(B) gets to 100%, the closer rate(A) gets to rate(A | B)
  4. Rate(A) is exactly between rate(A | B) and rate(A | not B) if rate(B) =

Association

BNot BRow Total
A
Not A
Col. Total
  • Categorical variables A and B are associated to each other if rate(A | B) rate(A | not B)
  • A and B are positively associated if rate(A | B) rate(A | not B)
  • Negatively associated if rate(A | B) rate(A | not B)
  • Comparing rate(A | B) vs rate(A | not B) is exactly the same as comparing rate(B | A) vs rate(B | not A)
  • Association does not establish causation
  • Whether the association can be generalised from sample to population is based on Generalisability

Confounder

A confounder is a third variable, associated with both the dependent and independent variables It causes any association determined between the dependent and independent variable to be unreliable

Find Confounders

Male, HDMale, No HDFemale, HDFemale, No HDRow Total
Smoker25958213538015000
Non Smoker3034954145000285000
Column Total55445362755382100000

To determine if sex is a confounding variable, check for association between sex vs smoking and sex vs HD e.g. since rate(Smoker | Male) rate(Smoker | Female) and rate(HD | Male) rate(HD | Female), sex is a confounder

Control for Confounders

  • Split the data
    • If there is an association observed in both groups, then there is an association

Simpson’s Paradox

  • When a trend appears in more than half of the groups of data, but disappears or reverses when the groups are combined
    • Relationship between rates in subgroups disappear/reverse when the groups are combined
  • Simpson’s paradox there is a confounding variable (confounder Simpson’s paradox)
MajorNo. of Applications (Male)No. of Successful Applications (Male)Rate (Male)No. of Applications (Female)No. of Successful Applications (Female)Rate (F)
A200080040%10220%
B10880%100060060%
C10880%100060060%
Overall202081640.3%2010120259%

Paradox: In a majority of majors, rate of successful male applications is higher than female, but overall, rate of successful male applications is lower