Definitions
Random Process / Probability Experiment
- Must be repeatable
- Allows for exact listing of all possible outcomes
- e.g. rolling a die and observing the top face
Sample Space
- The collection of all possible outcomes of a random process
- Denoted
- e.g. for rolling a die,
Event
- A subset of the sample space
- e.g. for rolling a die, the event “die shows an even number” is
Mutually Exclusive
Events and are mutually exclusive when they cannot occur at the same time
Independent
Events and are independent when the occurrence of one event does not influence the likelihood of the other event happening Conditional independence, where and are conditionally independent given an event with :
Types of Random Variable
- Discrete Random Variable
- Where the sample space forms a discrete numerical variable
- e.g. Rolling a die
- Continuous Random Variable
- Where the sample space forms a continuous numerical variable
- e.g. Normally distributed IQ scores with mean 100 and variance 25
Uniform probability
- Where every outcome in the sample space has the same probability
Rules of Probabilities
- The probability of each event , denoted is between 0 and 1 (inclusive)
- For a sample space ,
- If and are Mutually Exclusive events,
Conditional Probability
- Probability that the outcome , if we already know that the outcome
- Denoted “Probability of , given ”
Properties of a test
- Sensitivity
- True positive rate
- P(Test +ve | Actually +ve)
- Specificity
- True negative rate
- P(Test -ve | Actually -ve)
- These are not very useful, we would want to know P(Actually +ve | Test +ve)
- To find this, we need the base rate, or P(Actually +ve)
- To find this, draw the table with columns Test +ve, Test -ve, Row total and rows Actually +ve, Actually -ve, Column total. Then start with any number for the total population, and fill in the other blanks to determine the rate (and associated probability)
Comparison with Rates
| Probability | Rates |
|---|---|
| Probability experiment | Random sampling |
| Sample space | Sampling frame |
| An event of the sample space | A subgroup of of the sampling frame |
| rate() | |
| Independent events | Not associated |
| This comparison can be made using the random process of selecting one unit from the population with [[#uniform-probability | Uniform probability]]. Then, all the laws for probabilities and rates can be applied to each other. |
| e.g. rate(Heart Disease | Smoker) = P(Heart Disease |
Probability Laws
Note: ,
Complement Rule
Addition Rule
If and are mutually exclusive
Multiplication Rule
If and are independent
Law of Total Probability
If , and are events from the sample space such that
- and are mutually exclusive
- Then
Fallacies
Prosecutor’s Fallacy
When thinking that
Base Rate Fallacy
When ignoring the rate of occurrence of some trait (base rate) when answering the question e.g. in the case of a covid test P(infected | +ve test), base rate is P(infected). For countries with different P(infected), P(infected | +ve) will be different
Conjunction Fallacy
Thinking that the chance of two things happening together the chance of one of these things happening alone e.g. Someone is loitering with a knife. Thinking that P(He is a thief and is jobless) > P(He is a thief)