Assume an already propensity score matched cohort.
One of the variables included in the propensity score model is sex.
The main cox model investigates the comparative effect of two drugs for cancer remission.
This is the main effect I was looking for:
coxph(Surv(time, status) ~ drug, data = data)
Now I want to investigate a possible moderation (by sex).
OPTION 1:
I could use the original cohort from above investigating a possible moderation:
coxph(Surv(time, status) ~ drug:sex, data = data)
Thus, I have two models: One for the main effect estimate and one for the moderation.
OPTION 2:
According to Green & Stuart (2014), estimating separate propensity score models could show the best balance.
Thus, I created two datasets (for men and women) and calculated the propensity score for matching separately.
Next, I combined both matched datasets and calculated:
coxph(Surv(time, status) ~ drug:sex, data = data)
As I used 1:1 matching using a caliper, the original data and the combined stratified approach do have different sample sizes due to the fact that not everyone got a matched reference as well as different matched reference persons.
Questions:
For OPTION 1, is it legit to calculate one model for the main effect
drug
and one model for possible direct moderation usingdrug:sex
? Or do I need to include both in one model asdrug*sex
?For OPTION 2, do I need to calculate the main effect in the combined sample? Or do I use the combined data just to investigate the moderation and use my original dataset for the main effect?
Assume I want to investigate another moderation by age categories (<= 40 and > 40 years). Thus, I would need to calculate separate propensity score models, match them and combine both separate datasets. But the main effect will of course be different between the combined data that has previously been stratified by sex and the one that has been stratified by age category.In general, how do I derive separate hazard ratios and CIs for both, men and women?