23 Novl CA - Co Studies
23 Novl CA - Co Studies
23 Novl CA - Co Studies
Doke Professor, Department of Preventive and Social Medicine MGM Medical College, Kamothe, Navi Mumbai
M.D., DNB., Ph.D., FIPHA
The case-control study is an analytic epidemiologic research design in which the study population consists of groups who either have (cases) or do not have a particular health problem or outcome (controls) The investigator looks back in time to measure exposure of the study subjects. The exposure is then compared among cases and controls to determine if the exposure could account for the health condition of the cases
Case-Referent Case-Compeer
Retrospective ?
Observational / Non-experimental Occasionally Exploratory Explanatory (Analytical) Retrospective Effect to Cause Both Exposure & Disease have already occurred Uses Comparison Group
Long
induction
period
between
the
Cohort Study: Waiting years for accrual of cases Case-Control Study: Compress time Case-Control Studies hence suitable for
RCT: Methodological Standard of Excellence However, Case-Control; Not only SIMPLE to perform but some times the ONLY approach to solve a problem. Philosophically no design is Gold Standard. Understand strengths and weaknesses . Select appropriate study design to address your Research Question
Directionality Outcome to exposure 2. Timing Retrospective for exposure, but caseascertainment can be either retrospective or concurrent 3. Sampling Almost always on outcome, with matching of controls to cases
1.
Exposed
Not Exposed
Exposed
Not Exposed
Disease
No Disease
CASES
CONTROLS
Diagnostic Criteria
Risk of Disease Misclassification Continuous / Discrete Outcome Variable
Relatively simple & straightforward: Children with cleft palates (physical
examination)
Sometimes difficult: Hypertension
Criteria Specific
Operational versus Rigid Standard Definition (WHO, CDC, etc) Reference (growth references NCHS, CDC, New WHO)
Eligibility Criteria
Women who are not sexually active or who have had a tubal ligation are not likely to have recently used any contraceptive method including IUDs
Conceptual definition
Obesity defined as body fat percentage > 33%
Operational definition
Body Mass Index > 30
A severe case definition may exclude people who have been cured or who died of disease before the condition was severe enough to be labelled as case Standard/consensus definitions if available, must be used
For example, Lack of agreement over definition may introduce variability in estimates of effect
Rheumatoid arthritis Rome criteria, NY criteria, 1987 ARC criteria
The issues of severity, diagnostic criteria and subjectivity of criteria all lead to potential problems of misclassification of cases The researcher can choose between more restrictive and inclusive definitions Think in terms of sensitivity and specificity of definition and its effect on validity, sample size, precision and power It is observed that;
Restrictive definition (less sensitive) leads to lack of precision and power by reducing sample size Broad criteria (less specificity) produce misclassification leading to biased measure of effect So, weigh validity - specificity over sensitivity (Restrictive definition over inclusive definition)
Industrial Population
The goal is to
Ensure that all true cases have an equal probability of entering the study and that no false cases enter Example: Conceptual definition of HIV
Factors affecting decision to test/access the test and Sn & Sp of test will decide who eventually becomes a case under operational definition Selection bias ??
Selection bias
Berksons bias
Unequal chance of getting into study Variable rate of hospitalization affecting case selection Incident case Vs prevalent case
Due to closer medical attention, detection of endometrial cancer was more in a group using estrogen
1.
Representativeness: Ideally, cases should be a random sample of all cases of interest in the source population (e.g. from vital data, registry data). More commonly they are a selection of available cases from a medical care facility. (e.g. from hospitals, clinics)
2. Method of Selection
Selection may be from incidence or prevalence case: Incident cases are those derived from ongoing-ascertainment of cases over time. Prevalent cases are derived from a cross-sectional survey.
Who is the best control? What universe should controls come from?
If cases are a random sample of cases in the population. Then controls should be a random sample of all non-cases in the population sampled at the same time.
1.
2. 3.
4.
Similar misclassification errors in cases & controls Same potential of recall bias in cases & control
Hospital or clinic control Dead control Controls with similar diseases Peer or case-nominated (friend/neighbor) control Population controls
Readily available hence commonly used Main reasons to use hospital controls are
To select controls whose referral pattern is similar to cases To obtain similar quality of examination For convenience
Might use dead controls for dead cases In some situations, this might lead to use of surrogate informant The problem is the dead control is not representative of the living population McLaughlin compared dead controls with living controls and noticed that the dead controls smoked more cigarettes and consumed more alcohol than living controls Appropriateness depends on the exposure being studied
Reasons
To minimize the recall bias To minimize the interviewer bias To examine the specificity of an exposure for a particular type of cancer For practical but unspecified reasons
Problem ??
Search starts from house of the case and door-to-door search conducted for eligible controls in a standardized pattern
Randomly drawn from population Truly representative of population Ideal way of selecting controls Practically, very difficult to carry out Study base ???
Way the pros and cons Analyze the situation for bias being introduced If possible,
select different sources of controls and compare with each other Compare the inferences drawn
Statistical consideration
When the number of subjects available in one group (cases) is limited, an increase in the other group increases the study power Gain in power is till the ratio of 4:1 Thereafter, the gain is not substantial but cost increases When the study of power with equal allocation is as high as 0.9 or as low as 0.1, additional fails to increase the power
Validity of inferences
Even when there is no statistical need, more than one control may be recruited per case Enrolling two or more types of controls is a way of checking for biases introduced by choice of control group If the measure of effect is similar when comparing cases with each control group
Probably no biases (no surety) If different measure of effect, then the bias is there and the researcher can understand it
MATCHING
Purpose: To adjust - effects of relevant confounders Matching in Design - Accounted in Analysis Misconception: The goal is to make the case and control groups similar in all respects, except for disease status An Optimal Matching Scheme involves only those variables which improve statistical efficiency or eliminate bias from the effect of interest
MATCHING
Which variables are appropriate for matching? Risk factors from prior work may be identified for matching Matching by interviewer or hospital may be used to balance out the effects of interviewer and observer errors It is best to limit matching to basic descriptors (age, sex, socio-economic status, etc) Non-modifiable risk factors Use few matching factors
MATCHING
Overzealous matching may have adverse effects: Matching on a strong correlate of the exposure, which is not an independent risk factor for the outcome (overmatching) may lead to an underestimate of OR Matching may lead to a false sense of security that a particular variable is adequately controlled
1.
Control selection is usually through matching. Matching variables (e.g. age), and matching criteria (e.g. within the same 5 year age group) must be set up in advance.
3. Avoid over-matching, match only on factors KNOWN to be cause of the disease. 4. Obtain POWER by matching MORE THAN ONE CONTROL per case. In general, N of
Questionnaire
Type of respondent Administration of questionnaire Salience of exposure Way in which information is retrieved Ways in which responses are formulated and recorded
Records
Abstraction of data from record Quality control measures are important Careful design and testing of abstraction form Training and supervision of abstractors Priori definition of terms Specifications of rules for handling conflicting or missing data
FIRST:
Select
CASES (With Disease) CONTROLS (Without Disease)
a c a+c
b d b+d
Proportions Exposed
a a+c
b b+d
Odds Ratio =
a c b d a a+b c c+d
E+
ad bc
Case a Control b
Risk
E-
Exposed Controls
Unexposed Mixed
Neither
Unexposed
McNemar 2=(t+s)2/(t-s)
Stroke
Control
total
30 70 100
10 90 100
40 160 200
Odds ratio =
Total
10
90 100
1. Only realistic study design for uncovering etiology in rare diseases 2. Important in understanding new diseases 3. Commonly used in outbreaks investigation 4. Useful if inducing period is long 5. Relatively inexpensive
Advantages:
1. Susceptible to bias if not carefully designed 2. Especially susceptible to exposure misclassification 3. Especially susceptible to recall bias 4. Restricted to single outcome 5. Incidence rates not usually calculate 6. Cannot assess effects of matching variables
Dolls 1952 study of smoking and lung cancer. The problem was that the control population ( lung disease) was biased in relation to the exposure. McMahons 1981 study of coffee and pancreatic cancer. Problem was that some of the controls may have been biased in relation to the exposure, because diseases related to coffee were excluded from the control series.
The odds ratio is a good estimate of the relative risk when the disease is rare (prevalence <20%) Can be extended to N>1 controls Statistical testing is by simple chi-square (unmatched analysis) or by McNemars chi- square (matched-pairs analysis) Can be extended to multiple strata ( Mantel-Haenzel chi-square)
Case-control studies should be viewed as efficient sampling schemes of the disease experience of the underlying open or closed cohorts The exposure odds ratio derived from case-control studies equals/closely matches the relative risk derived from cohort studies
Thank you