Академический Документы
Профессиональный Документы
Культура Документы
Causal Concepts
Inductive reasoning: The process of making
generalized inferences about ‘causation’ based
on repeated observations.
Deductive reasoning: The process of infer-
ring that a general ‘law of nature’ exists and
has application in a specific, or local, instance.
Cause: Any factor that produces a change in
severity or frequency of the outcome.
Necessary cause: One without which the
disease cannot occur.
Sufficient cause: Produces the disease if the
factor is present.
Component-cause: One of a number of
factors that, in combination, con
Target Population: The population to
which it might be possible to extrapolate re-
sults from a study.
Source Population: The population from
which the study subjects are drawn.
Study Sample/Group: Consists of the indi- Outcomes and data analysis
viduals (animals or groups of animals) that Continuous // dichotomous // nominal // count // time to event
end up in the study. Animals
Internal validity: The study results are valid Herds causal
for members of the source population. Areas inferences
External validity: The study results are valid Direct
for the source population, target population, Causal-web model: Consists of multi- cause
and beyond. ple indirect and direct causes. The fol- Indirect Outcome
lowing is an example of a Causal-web Cause
model. Direct
Cause
Sampling (Exposure)
Non-probability sampling: individual’s Stratified random sample: Prior to sampling, Sampling frame: List of all sampling
probability of selection is not determined the population is divided into mutually exclu- units in the source population
(Judgment, Convenience, Purposive) sive strata based on factors likely to affect the
Type I (α) error: Concluding that the
Probability sampling: every element has outcome.
outcomes in the groups being compared
a known non-zero probability of being Cluster sampling: Every study subject within are different (association exists) when they
included in the sample the cluster (collection of subjects with 1 or are not.
Simple random sample: Every study more common characteristics) is included in the
Type II (β) error: Concluding that the
subject in the source population has an sample and the primary sampling unit is larger
outcomes are not different (no associa-
equal probability of being included. than the unit of concern.
tion) when they are
Systematic random sample: A complete Multistage sampling: After the primary sam-
Power: Probability that you will find a
list of the population to be sampled is not pling unit is chosen, then a sample of secondary
statistically significant difference when it
required provided an estimate of the total sampling units is selected.
exists and is of a certain magnitude (i.e.
number of animals is available and all the Targeted (risk-based) sampling: Animals are power = 1-β)
animals are sequentially available. assigned point values based on the probability
of them having the disease of interest and sam- Created by Keila Perez
pling is proportional to that estimate of risk. kviol@tamu.edu
Types of Error:
Sampling Equations True state of nature
Conclusion of stat.
analysis Effect present Effect absent
1) n= total sample size Effect present (reject Type I (α)error
To estimate a sample proportion with a null) Correct (power) (p-value)
desired precision: Effect absent (accept Type II (β) er-
null) ror Correct
For adjusting the sample size (n) for clus- For continuous and binary covariates, new
To estimate a sample mean with a desired tering, the size of new n(n’) depends on n (n’) (VIF= Variance Inflation Factor):
precision: intra-cluster correlation (ρ) and number of
individuals sampled per cluster (m):
6) General formula for the width of CI of
2) (n=sample size per group)
To compare 2 proportions: a parameter
4) Sampling to confirm disease absence Parameter ± Z*SE(parameter), where for
Where p=(p1+p2)/2 and q=1-p
From finite population <1000: - Estimating a mean in a single sample
To compare 2 means: From a large (infinite) population: - Comparisons of means from 2 samples
If sampling from a finite population in 5) Adjustment of sample size (n) in multi- - If expected interaction between two di-
descriptive studies, the required sample variable studies: chotomous variables
size (n’) can be adjusted using FPC formu- For k continuous covariates, new n (n’)
la: ρce =average correlation between expo-
sure and confounders
Questionnaires
Questionnaire: A data-collection tool that Focus Groups: Normally a group of 6-12 Quantitative: ’Structured’ questionnaires
can be used in a wide variety of clinical people that provide opportunity for a designed to capture information about
and epidemiological research settings. structured form of consultation with mem- study subjects and their environment
bers of the intended study population, the
Survey: An observational study designed Open Question: There are no re-
end users and/or the interviewers.
to collect descriptive information about an strictions on the types of responses ex-
animal population (such as prevalence of Qualitative: ‘Explorative’ questionnaires pected.
disease, level of production etc.) consisting mainly of open questions.
Closed Question: The response has to
be selected from a pre-set list of answers.
Interpretation of Risk ratio (RR), Rate ratio (IR), and Odds ratio (OR): <1 exposure is protective, =1 no effect, and >1 exposure is
positively associated with disease
Interpretation of Risk difference (RD) and Incidence difference (IR): <0 exposure is protective, =0 no effect, and >0 exposure is
positively associated with disease
The range for AFe: Values from 0 (risks equal regardless of exposure) to 1 (no disease in non-exposedà i.e. all disease is due to ex-
posure). Vaccine efficacy is a form of AFe.
Diagnostic Tests
Accuracy: Average is close to true value Multiple Tests Interpretation: True prevalence: The true state of nature.
Precision: The amount of variability - Series: Result is considered positive only Apparent prevalence: The result in the
among test results. if both tests are positive study due to imperfections in the diagnos-
Coefficient of variation (CV): Standard - Parallel: result is considered positive if tic tests.
Devation/Mean (for repeat runs on same either test is positive Predictive Values: The probability that
sample) the animal has or does not have the dis-
Sensitivity (Se): proportion of diseased
Pearson correlation coefficient (PCC): animals that test positive (TP): p(T+|D+) ease, given the test result.
Ignores the scales of the 2 sets of results PV(+) = p(D+|T+)
Specificity (Sp): proportion of non- PV(-) = p(D-|T-)
Concordance correlation coefficient diseased animals that test negative (TN):
(CCC): Takes into account data position p(T-|D-) Define cutoff: Sp increases, Se decreases.
from equality line. See graph below.
Bias
Selection bias: Composition of the study Confounding: Due to effects of factors Interaction: Stratum specific measures
group(s) differs from that in the source other than the exposure of interest on the different (based on the homogeneity test)
population (and target population). observed measure of association. providing a more detailed description of
the relationship between exposure and
Information bias: Incorrectly measured/ Confounding control at the study de-
disease. Needs to be measured on either
classified subject’s exposure, outcome, sign stage includes:
the additive or multiplicative scale.
extraneous factors - Exclusion (Restricted sampling)
- Matching: Involves making distribution No interaction on an Additive scale:
Misclassification: Rearrangement of
of the extraneous factor(s) in the groups (RR11 ‑ 1) = (RR10 ‑ 1) + (RR01 ‑ 1)
study individuals into incorrect categories
because of errors in classifying exposure, being compared the same. Prevents con- No interaction on a Multiplicative scale:
outcome or both founding and may increase power of the RR11 = RR10 * RR01
study.
Non-differential Misclassification: If R11 = Pr(D|A1B1)
misclassification of the exposure and the Confounding control during analysis:
R10 = Pr(D|A1B0)
outcome “disease” are independent. Will The Mantel-Haenszel (MH) estimator for
R01 = Pr(D|A0B1)
bias the measures of association toward Categorical data with dichotomous expo-
R00 = Pr(D|A0B0)
the null. sure. Will need: 1) to stratify data accord-
RR11 =R11/R00,
ing to the combination of levels of the
Differential Misclassification: If the RR10 =R10/R00,
confounding variables; 2) examine stratum
errors in exposure classification are related RR01 =R01/R00
specific measures; 3) assure that stratum
to the status of the outcome under study. specific measures are equal using a homo-
Resulting bias in the measure of associa- geneity test; 4)calculate a pooled weighted
tion might be in any direction (adjusted) estimate of association
Measurement error: Errors in measuring
quantitative factors can lead to biased
measures of association.
AN OVERVIEW OF MEASUREMENTS IN EPIDEMIOLOGY [VER 3, 2007]
?
EXPOSURES OUTCOME
Epidemiology is about identifying associations between exposures and outcomes. To identify any association, exposures and outcomes must first be measured in
a quantitative manner. Then rates of occurrence of events are computed. These measures are called “measures of disease frequency.” Once measured, the
association between exposures and outcomes are then evaluated by calculating “measures of association or effect.” Finally, the impact of removal of an exposure
on the outcome is evaluated by computing “measures of potential impact.” In general, measures of disease frequency are needed to generate measures of
association, and both these are needed to get measures of impact. There is some overlap between these measures, and terminology is poorly standardized.
The superscript numbers refer to the formulae used to compute those measures (formulae shown separately in the following pages) 1
Madhukar Pai, McGill University [madhukar.pai@mcgill.ca], Kristian Filion, McGill University [kristian.filion@mail.mcgill.ca]
AN OVERVIEW OF MEASUREMENTS IN EPIDEMIOLOGY [VERSION 3, 2007]
The following formulae are based on this typical epi 2 x 2 table with standard notation:
Outcome (Disease)
Yes No
Exposure Yes a b a+b
No c d c+d
a+c b+d
Other notation used:
2
Number Needed to Harm (NNH) = 1 / RD
3
Risk Ratio (RR, CIR) = a/(a + b) = Ie / Io
c/(c + d)
4
Rate Ratio (RR, IDR) = see end of this handout
7
Attributable Risk (AR) = Same formula as Risk Difference
8
Attributable Risk Percent (AR%) = Ie – Io * 100 = AR * 100
Ie Ie
= a/(a + b) – c/(c + d)
a/(a + b)
9
Population Attributable Risk (PAR) = It – Io
10
Population Attributable Risk Percent = It – Io * 100
(PAR%) It
4
Rate Ratio (RR, IDR) = a/N1
b/N2
This formula for Rate Ratio is based on the following 2 x 2 table format:
In some situations (such as a clinical drug trial or a vaccine efficacy study), the exposure
is protective. Therefore, incidence of disease in the exposed/intervention group (Ie) will
usually be lower than incidence in the unexposed/control group (Io). Hence, measures
such as RR and OR will be < 1.0 [i.e. protective effect].
In such situations, some of the above formulae will have to be computed and interpreted
differently. Also, the names will change.