Midterm Review STA216: Generalized Linear Models: I I I I I I
Midterm Review STA216: Generalized Linear Models: I I I I I I
Midterm Review STA216: Generalized Linear Models: I I I I I I
1. GLM Basics
yi θi − b(θi )
f (yi ; θi , φ) = exp + c(yi , φ) ,
a(φ)
ηi = g(µi ) = x0i β,
where ηi is the linear predictor, g(·) is the link function, and xi are
predictors
1
2. Basics of Frequentist Inference
approximation at MLE.
(d) For variable selection, stepwise procedures are often used - forward,
2
3. Basics of Bayesian Inference in GLMs
priors.
3
4. Latent Variable Models for Binary Data
4
5. Using Hierarchical GLMs for Correlated Data
rameters.
otherwise.
5
6. Survival Analysis
standard relationships.
(d) Cure rate models - definition and data augmentation trick for pos-
terior computation
6
7. Missing Data
tifiability issues.
7
8. Discrete Time Survival Analysis
8
9. Alternatives to Hierarchical Models for Multivariate Binary
outcomes.
pretations.
9
10. Interval Censored Data
(b) How to analyze data of this type using a discrete time Bayesian
survival analysis.
the analysis.
ple)?
10
11. Bayesian Variable Selection and Order Restricted Inference
dictors.
(c) Define a mixture prior for addressing this problem and describe pos-
(d) How does this approach relate to variable selection via SSVS algo-
rithms?
11
12. Poisson Log-Linear and Logistic Regression Cases
(a) Now suppose data are Poisson distributed counts following a log-
linear model.
(e) Modify this prior for variable selection and inferences on ordered
categorical covariates.
12
13. Bayesian Generalized Additive Models with Constraints
(a) Focusing again on the normal or underlying normal data cases, con-
(b) Define a generalized additive model, and place a prior on the un-
straints.
13
Example Exam Problem Set
Question 1:
Suppose that 2500 pregnant women are enrolled in a study and the outcome
clude age of the woman, smoking, socioeconomic status, body mass index,
bleeding during pregnancy, serum level of dde, and several dietary factors.
the GLM, including the link function and distribution (in exponential family
form). Describe (briefly) how estimation and inference could proceed via a
frequentist approach.
14
Possible Solution:
yi ∼ Bernoulli(πi )
where
πi
θi = log , b(θi ) = log(1 + eθi ), a(φ) = φ = 1,
1 − πi
and c(yi , φ) = 0.
Link function:
Any mapping from < → [0, 1]. A convenient choice is the canonical link,
πi
ηi = θi = log ,
1 − πi
which is the logit. The probit and complementary log-log are alternatives.
Frequentist Estimation:
Frequentist Inference:
One can select the important predictors to be included in the model by step-
Alternatively, one can just fit the model with all the predictors and then do
inferences based on the MLEs and asymptotic standard errors. For example,
details) to test for significant differences in fit between the nested models
16
Question 2:
Women are enrolled in a study when they go off of contraception with the
to achieve a pregnancy, whether or not they smoke cigarettes, and their age
statistical model within a Bayesian framework and outline the details of model
fitting and inference (including the form of the posterior density, an outline
of the algorithm for posterior computation, and the approach for addressing
17
Discrete time survival analysis:
350 T rij
1−y
x0i β)yij {1 x0i β)} ij
Y Y
Φ(αj + − Φ(αj +
i=1 j=1
18
We complete a Bayesian specification of the model with prior densities for
α = (α1 , . . . , αT )0 and β,
T
Y
1(α1 > α2 > . . . > αT ) N (αj ; α0j , σα2 j ) N (β; β 0 , Σβ ),
j=1
where the order constraint models the selection process where more fertile
derlying normal variables. In particular, let yij = 1(zij > 0), where zij ∼
N (αj + x0i β, 1) are independent. Posterior computation can then proceed via
2. Sample zij , for all i, j : rij = 1, from its full conditional density, which is
5. Repeat steps 2-4 until apparent convergence and calculate posterior sum-
19
Our primary goal is to address the question: Is cigarette smoking related to
time to pregnancy.
Based on the Gibbs iterates, we can estimate Pr(β1 < 0 | data). If this poste-
rior probability is high (say, greater than 95%) than we have strong evidence
that the hazard of conception is lower for smokers than non-smokers (at least
in the population).
This implies that smoking may be associated with an increased time to preg-
nancy.
20
Question 3:
different birth defects. A physician working with you on the study notes
that certain children have several birth defects, possibly due to defects in im-
latent variable model for analyzing these data and outline (briefly) the details
of the algorithm for posterior computation, and the approach for addressing
21
Multiple binary outcomes:
yi = (yi1 , . . . , yi5 )0 , where yij = 1 if child i has jth defect and 0 otherwse
22
Note that we don’t need to assume probit, we could use other link functions.
To assess the effect of alcohol intake on the different defects, estimate the
For defects having high values of Pr(βj1 > 0 | data), we have evidence of a
23
Question 4:
of the sickness of each pup in each litter, with 1 = healthy, 2 = low birth
weight but otherwise healthy, 3 = malformed, and 4 = dead. The goal of the
study is to see if dose is associated with health of the pup. Describe a model
24
Let yij ∈ {1, 2, 3, 4} be the outcome for the jth pup in the ith litter, and let
A possible model for relating the ordinal ranking of pup health to dose while
Strategy:
sion model.
3. Proceed with posterior computation via Albert and Chib (1993) - provide
some details.
25
The parameter of primary interest is the slope parameter, β, which is inter-
increase in dose.
One assumption being made is that one parameter can be used to characterize
Potentially the shape of the distribution may be completely different and one
Another assumption is that we have non-informative cluster size - that is, the
26