Original article
microscopy and mycobacterial culture during treatment to have primary analysis was restricted to participants included in the
low sensitivity and modest specificity in predicting relapse.11 original studies’ per-protocol analyses. A sensitivity analysis with
Since 2009, several high-quality randomised controlled trials all participants included in the original studies’ modified inten-
(RCTs) have been published that evaluate the WHO standard tion to treat analyses was also performed. Additional analyses
regimen and report standardised outcomes, including relapse.12–19 also examined relapse determined at 12 months post-treatment
Individual patient data from three of these studies12–14 were completion and the combined outcome of relapse and reinfec-
published on the Critical Path TB Clinical Trial Data-Sharing tion for all included studies.
Platform,20 providing an opportunity for a meta-analysis of We considered two types of patient-level factors in our
individual patient data from phase III trials in which at least meta-analysis: (1) patient characteristics and (2) combinations
one group received the WHO standard regimen. The primary of characteristics. Patient-level factors were chosen a priori
purpose of our study was to determine clinical and microbio- and based on clinical relevance as well as clinical expertise and
logical factors associated with relapse in patients treated with experience. We performed a one-step IPD meta-analysis, using
the WHO standard regimen. Our secondary aim was to evaluate random intercept regression models to estimate the pooled OR
the accuracy of each factor at predicting an outcome of relapse. and 95% CIs of relapse for each type of patient-level factor.22 We
performed both crude and multipredictor analyses. For crude
Methods analysis, an unadjusted mixed logistic regression model was used
Search strategy and selection criteria to estimate ORs for each prespecified marker. For multipredictor
The studies considered for this individual patient-level data (IPD) analysis, mixed logistic regression was performed including
meta-analysis were identified from a systematic review eval- covariates chosen a priori to account for potential confounding,
uating the efficacy of dosing schedules in first-line pulmonary with the study as a random effect.
TB therapy.21 Study selection included in that systematic review Using the multipredictor regression models, we generated
has been described elsewhere in detail.21 In brief, studies were receiver operating characteristic (ROC) curves to express the
restricted to high-quality RCTs with treatment regimens that predictive accuracy of each models at its ability to distinguish
used rifampin for 6 months or longer. For this analysis, studies between patients who will relapse from those who will experi-
were also restricted to a 20-year period from 1 March 1996 to ence treatment success. We calculated the area under the curve
1 March 2016 given the limitations in trial data availability. To
be consistent with current WHO recommendations, only studies
reporting a trial arm using the WHO standard regimen were
included in our analysis.
Data collection
Corresponding authors of all identified studies were contacted
and invited to share trial data. Study data were included if
authors agreed to submit individual patient data from published
studies or if the individual patient data were available on the
Critical Path TB Clinical Trial Data-Sharing Platform. We
excluded studies that did not provide individual patient data in
our analysis. We only used data from participants assigned to
the control arms in each study, that is, we excluded participants
from the various intervention arms. Participants were included
if they had: (1) successfully completed adequate treatment with
the WHO standard regimen for newly diagnosed, microbio-
logically confirmed, pulmonary TB and (2) were classified as
either having treatment success or relapse at the end of the study
follow-up period.
The individual patient data obtained included patient
demographics, clinical markers of disease severity, treatment
regimen doses and treatment outcomes. We checked all data
for internal consistency and compared it with the trial protocol
and published reports. Any inconsistencies were checked with
the data provider. Variables from each original database were
extracted, their meaning and coding verified and then mapped
to a common set of variables for all patients. Missing data were
treated as such (ie, imputation was not performed).
Statistical analysis
Our primary outcome of interest was relapse, defined as posi-
tive smears and/or cultures requiring therapy after successful
treatment completion, according to specific study protocols.
Participants were not categorised as relapsed if genotyping
demonstrated reinfection. Participants re-treated without full
microbiological confirmation were counted as relapsed if they
Figure 1 Flowchart of study and participant selection.
Table 1 Characteristics of studies meeting inclusion criteria
Total assigned
MIRU-VNTR to receive
Treatment Duration of DST based confirmed WHO standard Total eligible Total Risk of
Study (year) Population frequency * follow-up exclusion relapse regimen for analysis † relapsed relapse (%)
Oflotub1412 Adults with newly diagnosed, Daily 24 months RIF No ‡ 846 577 44 7.6
(2014) microbiologically confirmed,
pulmonary tuberculosis
REMoxTB1213 Adults with newly diagnosed, Daily 18 months RIF, FQN Yes 639 452 18 4
(2014) microbiologically confirmed,
pulmonary tuberculosis
Rifaquin1314 Adults with newly diagnosed, Daily 18 months RIF, INH, FQN, Yes 275 160 5 3.1
(2014) microbiologically confirmed, ETH
pulmonary tuberculosis
*Daily defined as 5 or more days per week.
†Classified as either treatment success or relapse in per-protocol analysis set.
‡Composite outcome of relapse or reinfection.
DST, drug susceptibility testing; ETH, ethambutol; FQN, fluoroquinolone; INH, isoniazid; RIF, rifampin.
(AUC) or c-statistic for each ROC curve and performed bootstrap during the continuation phase. ReMoxTB excluded patients
resampling among all patients to obtain 95% CIs. The predictive with rifampin or fluoroquinolone resistance and also adminis-
power of each model was assessed using AUC and compared tered therapy on a daily-supervised basis. OFLOTUB excluded
using DeLong’s Test.23 The negative predictive values (NPV), patients with rifampin resistance and provided directly observed
positive predictive values (PPV) and maximum sensitivity, when therapy 6 days a week during the intensive phase, and then
specificity was set to 0.95 and 0.80, were also determined. For assessed adherence every 2 week by a count of tablets remaining
added insight regarding classification by the different models, we in weekly treatment boxes. MIRU-VNTR confirmation data was
also assessed the next classification index (NRI). Reclassification used to distinguish between relapse and reinfection, except in
tables for patients who did or did not experience relapse were the OFLOTUB trial where genotyping results were not available.
constructed using <5%, 5% to 15% and >15% predicted prob- Combining the control arms of the three trials yielded data on
ability categories. Heterogeneity between studies was accounted 1760 participants, all assigned to receive WHO standard therapy.
for by having the studies as a random effect. Of those, 1189 were eligible for our primary analysis, including
Analyses were conducted using RStudio (V.1.0.44: The R
Foundation for Statistical Computing).24 Random intercept
logistic regression models were created using lme4 (V.1.1–13).25
Table 2 Pooled baseline characteristics of patients included in the
Receiver operating curve analysis was performed using pROC
(V.1.9.1).26 Bootstrap validation and NRI calculations were
performed using rms (V.5.1.0).27 P values less than 0.05 were Baseline characteristics Total patients (n=1189)
considered statistically significant in both unadjusted and multi- Age (IQR)
predictor analyses. Years, median 29.0 (23–37)
The study was performed per PRISMA-IPD guideline recom-
Weight (IQR)
mendations28 and prospectively registered in the PROSPERO
database (CRD42016040050). kg, median 52.0 (47.0–58.1)
Results Male 823 (69.2)
Of the 56 studies identified in the systematic review, 12 were Female 366 (30.8)
eligible for inclusion in our analysis. Ultimately, patient-level
HIV coinfection
data were obtained from three recent multicentre RCTs avail-
able on the Critical Path TB Clinical Trial Data-Sharing Platform Negative 1010 (85.1)
(figure 1).12–14 Characteristics of excluded studies are presented Positive 177 (14.9)
in the appendix table S1 in online supplementary appendix). Cavitary disease at baseline
Table 1 summarises the characteristics of the included studies.
Yes 690 (63.8)
Each of the three studies was a double-blind, multicentre, phase
III trial conducted in resource-limited, high TB incidence coun- No 391 (32.2)
tries. The ReMoxTB and RIFAQUIN trials had defined relapse at Smear status at month 2
a follow-up duration of 18 months; OFLOTUB used 24 months Negative 936 (82.0)
and also provided outcomes determined at 18 months.
Positive 205 (18.0)
HIV status was assessed at enrolment in all three studies. Indi-
viduals coinfected with HIV who required antiretroviral therapy Culture status at month 2
(ART) were not eligible for the Oflotub and ReMoxTB trials. The Negative 879 (78.6)
RIFAQUIN trial excluded all patients with resistance to isoni- Positive 240 (21.4)
azid, rifampin, ethambutol or fluoroquinolones at study entry Data were missing for HIV coinfection (2 patients), cavitary disease at baseline (108
and provided directly observed therapy at the health facility patients), smear status at month 2 (48 patients) and culture status at month 2 (70
during the intensive phase and relative supervised treatment patients).
Table 3 Risk of relapse by specified factor Table 4 Summary of association of patient level factors with
Total number Number of patients who Risk of treatment relapse
of patients experienced relapse relapse (%) Crude OR (95% CI) Multipredictor OR (95% CI)*
Total population 1189 67 5.6 Sex
Sex Female Reference Reference
Male 823 54 6.6 Male 1.9 (1.0 to 3.5) 2.1 (1.1 to 4.0)
Female 366 13 3.6 Age (years)
HIV coinfection Mean (SD) 1.0 (1.0 to 1.1) 1.0 (1.0 to 1.1)
Positive 177 18 10.2 Weight (kg)
Negative 1010 49 4.9 Mean (SD) 1.0 (0.9 to 1.0) 1.0 (0.9 to 1.0)
Cavitary disease at HIV coinfection
Negative Reference Reference
Yes 690 43 6.2
Positive 2.2 (1.2 to 3.9) 2.6 (1.4 to 4.6)
No 391 22 5.6
Cavitary disease at baseline
Smear status at 2 months
No Reference Reference
Positive 205 17 8.3
Yes 1.3 (0.7 to 2.2) 1.2 (0.7 to 2.2)
Negative 932 49 5.2
Smear status at 2 months
Culture status at
Negative Reference Reference
2 months
Positive 1.9 (1.0 to 3.4) 1.8 (1.0 to 3.4)
Positive 240 21 8.8
Culture status at 2 months
Negative 879 44 5
Negative Reference Reference
Presence of baseline cavitary disease and positive smear at month 2
Positive 2.0 (1.2 to 3.4) 1.8 (1.0 to 3.1)
Yes 158 16 10.1
Presence of both baseline cavitary disease and positive smear at month 2
No 923 49 5.3
No Reference Reference
Presence of baseline cavitary disease and positive culture at month 2
Yes 2.3 (1.3 to 4.2) 2.3 (1.3 to 4.2)
Yes 178 18 10.1
Presence of both baseline cavitary disease and positive culture at month 2
No 881 46 5.2
No Reference Reference
Yes 2.3 (1.3 to 4.0) 2.1 (1.2 to 4.0)
67 (5.6%) that relapsed (figure 1). Pooled demographic and clin- *Covariates included age, sex and HIV status at baseline.
ical characteristics of the total population are shown in table 2,
with risk of relapse by specified factors shown in table 3.
As seen in table 4, HIV coinfection had the highest odds of optimism in the estimated AUC was less than 0.04 (table S2 in
relapse in univariable analysis among patient characteristics (OR online supplementary appendix). table 6 shows sensitivity for
2.2 (95% CI 1.2 to 3.9)). When combinations of characteristics held specificities for all multipredictor models.
were examined, the odds of relapse for those with baseline cavi- Overall, results with NRI were similar to our findings with
tary disease and positive smear at 2 months were very similar to AUC; few participants had clinically meaningful changes in risk
the odds of relapse for those with baseline cavitary disease and categories, which resulted in non-significant net reclassification
2 month culture positivity (OR 2.3 (95% CI 1.3 to 4.2) vs OR improvement (table S3 in online supplementary appendix).
2.3 [95% CI 1.3 to 4.0)). When HIV coinfection was combined Our sensitivity analyses with the modified intention-to-treat
with the presence of positive culture at 2 months and baseline analysis set, with relapse determined at 12 months post-treat-
cavitary disease, the odds of relapse was 5.7 (95% CI 1.7 to ment completion, and when combining the outcomes of relapse
17.9); however, due to limited sample size, CIs were quite wide and reinfection for all studies, yielded results very similar to
(n combined risk factor=16, n relapse=4). those in our primary analysis (supplementary tables S4–S7 in
In multipredictor analysis, HIV coinfection (OR 2.6 (95% online supplementary appendix).
CI 1.4 to 4.6)), the presence of baseline cavitary disease with
positive smear at 2 months (OR 2.3 (95% CI 1.3 to 4.3)) and Discussion
the presence of baseline cavitary disease with positive culture at Over the past two decades, several studies have reported
2 months (OR 2.1 (95% CI 1.2 to 3.8)) had the highest odds of 2-month culture positivity or cavitary disease as independent
relapse. predictor of relapse. Indeed, recent ATS/CDC/IDSA guidelines
Comparative analysis of ROC curves is summarised in cite a 20% relapse rate in patients with cavitary disease at base-
figure 2. As shown in table 5, Model 1 (the reference model) line and 2-month sputum culture positivity.10 However, studies
contained age, sex and HIV status and yielded a fair AUC cited for these guidelines did not use daily or five times a week
value of 0.66 (95% CI 0.59 to 0.73). The addition of clinical therapy under trial conditions. Instead, the cited studies used
risk factors, or combinations of clinical risk factors, resulted two times a week regimens or analysed outcomes under cohort
in small but non-statistically significant increases in predictive conditions and were limited by a small sample size.8 9 Using
power (DeLong’s test p>0.05). Bootstrap validation indicated individual patient data meta-analysis, we analysed clinical and
that our results showed little evidence of over fitting, that is, the microbiological factors associated with relapse in patients with
with 2-month positive smear were associated with increased
odds of relapse, we also identified that the ability of these
markers to reliably discriminate between individuals who will
or will not experience an outcome of relapse remains modest.
All clinical and microbiological markers investigated in our anal-
ysis had poor sensitivities when specificity was set to 0.95 and
0.80 and lacked adequate positive predictive values, indicating
that despite having the risk factor, an individual will not neces-
sarily experience relapse. The accuracy of our models presented
in table 4 is similar to that recently reported by Phillips et al,
for sputum-based markers of bacillary clearance.29 Horne et al
also reported that 2-month culture and 2-month smear status
were poorly sensitive and moderately specific as predictors
of relapse.11 Overall, these results imply that discrimination
between low-risk and higher-risk patients remains poor for
current clinical and microbiological at predicting an outcome of
relapse and better predictors are needed.
Until improved markers are available, the combined marker
of cavitary disease and 2-month smear positivity may be the best
currently available option for identifying persons at higher risk
for relapse, particularly in low-resource settings. Using 2-month
smear and chest radiography results, we were able to identify
a subgroup of participants with a relapse risk of at least 10%,
Figure 2 Performance of each of the multipredictor models in predicting an outcome of relapse.
predicting an outcome of relapse. tions. Given the higher risk of relapse observed in this group
and the widespread availability of chest radiography30 and smear
new pulmonary TB treated with the WHO Standard Regimen. microscopy, our findings suggest that this combined marker be
The most notable finding of our study was that the presence used in future trials to assess which patients may benefit from
of cavitary disease with 2-month positive smear status was treatment prolongation or closer post-treatment follow-up.
associated with increased odds of relapse, and this association There were several limitations to this study. First, we were
was maintained even after adjusting for key covariates and in only able to obtain individual patient data from 3 of the
numerous sensitivity analyses. Moreover, the strength of this 12 eligible studies. Five principal investigators could not be
association was similar to that of commonly accepted deter- contacted despite extensive efforts. One investigator agreed to
minants of relapse: HIV coinfection and cavitary disease with forward the data; however, the data were not received. Another
2-month positive culture. investigator was unable to be reached following initial contact.
While our findings demonstrated that HIV coinfection, cavi- Two investigators refused to participate; reasons for not partici-
tary disease with 2-month positive culture and cavitary disease pating included lack of resources and time constraints. Since not
Table 6 Multipredictor model for relapse at set specificities
Specificity=0.95 Specificity=0.80
Maximum Maximum
sensitivity NPV PPV sensitivity NPV PPV
Model Predictors (95% CI) (95% CI) (95% CI) (95% CI) (95% CI) (95% CI)
1 Reference model 0.18 0.95 0.18 0.46 0.96 0.12
(age to sex, HIV status) (0.09 to 0.28) (0.95 to 0.96) (0.10 to 0.25) (0.34 to 0.57) (0.95 to 0.97) (0.09 to 0.15)
2 Reference model plus 0.18 0.95 0.19 0.35 0.95 0.1
Baseline cavitary disease (0.09 to 0.29) (0.94 to 0.95) (0.11 to 0.27) (0.25 to 0.48) (0.94 to 0.96) (0.07 to 0.13)
3 Reference model plus 0.21 0.95 0.21 0.42 0.96 0.12
Positive smear at 2 months (0.12 to 0.32) (0.95 to 0.96) (0.13 to 0.28) (0.29. 0.56) (0.95. 0.97) (0.08 to 0.15)
4 Reference model plus 0.18 0.95 0.19 0.46 0.96 0.12
Positive culture at 2 months (0.09 to 0.29) (0.94 to 0.96) (0.10 to 0.27) (0.32 to 0.60) (0.95. 0.97) (0.09. 0.16)
5 Reference model plus presence 0.22 0.95 0.22 0.41 0.96 0.12
of both baseline cavitary
disease and
Positive smear at 2 months (0.12 to 0.33) (0.94 to 0.96) (0.14 to 0.30) (0.28 to 0.33) (0.95 to 0.97) (0.08 to 0.15)
6 Reference model plus presence 0.2 0.95 0.21 0.46 0.95 0.13
of both baseline cavitary
disease and
Positive smear at 2 months (0.11 to 0.31) (0.94 to 0.96) (0.12 to 0.29) (0.33 to 0.59) (0.95 to 0.97) (0.10 to 0.16)
NPV, negative predictive value; PPV, positive predictive value.
all eligible patient data were included in our analysis, retrieval There were also a number of strengths to this study. First, by
bias may have been introduced.31 Unfortunately, due to limited using individual patient data, we had greater power to identify
reporting of outcomes within subgroups, we were unable to predictors of relapse than traditional, aggregate data meta-anal-
test if results including the additional trials differed from our yses.37 Second, the individual patient data were of high quality
findings, although as seen when comparing table 1 with online and came from three independent, large, high-quality multina-
supplementary table S1, the characteristics and outcomes of tional RCTs, each with detailed reporting and robust, microbi-
patients in the included and excluded studies are similar. We ologically confirmed treatment outcomes. Using these detailed
were able to perform a meta-analysis of relapse proportions data, we were able to analyse participants who all received iden-
using the published study data and found the relapse proportion tical treatment regimens and duration of therapy. Finally, we
to be 4% (95% CI 3% to 6%), which our relapse proportion of presented results using measures of association and of diagnostic
5.6% falls within (figure S1 in online supplementary appendix). accuracy, and these results were maintained in numerous sensi-
The small number of available studies also prevented reliable tivity analyses.
testing for heterogeneity; this was accounted for by having the These individual patient data meta-analysis of 1189 patients
studies set as a random effect. From the obtained data, we were treated with standard, first-line therapy suggests that individ-
missing variables that have been previously identified as factors uals with the presence of both cavitary disease and 2-month
increasing the risk of relapse: <5% wt gain during the first 2 positive smear status are at an increased risk of relapse, even
months of treatment and diabetes mellitus.32–35 Regardless, after accounting for other risk factors. These tests, which are
we did have near complete and high-quality data on other key widely available even in resource-limited settings, may be the
patient characteristics. best currently available for identifying persons at high risk of
A second limitation is that variation in study protocols meant relapse, and further investigation is required to assess whether
we were unable to distinguish between true relapse versus rein- this combined factor can be used to indicate different treatment
fection for all cases. This could have biased the estimation of the requirements in clinical practice.
effect of several demographic and clinical factors towards the
null; it might also have resulted in an overestimation of relapse Author affiliations
risk associated with HIV coinfection. Given that relapse is more TB Services, British Columbia Centre for Disease Control, Vancouver, British
Columbia, Canada
likely to occur soon after treatment completion,36 we performed 2
Centre for Healthcare Innovation, University of Manitoba, Winnipeg, Manitoba,
a sensitivity analysis examining outcomes at 12 months Canada
post-treatment completion with no substantive change in results. 3
Respiratory Epidemiology and Clinical Research Unit, Centre for Outcomes Research
Last, we analysed the per-protocol analysis sets from RCTs. This and Evaluation, Research Institute of the McGill University Health Centre, Montreal,
clearly limits the generalisability of these data to programmatic Quebec, Canada
Department of Epidemiology, Biostatistics and Occupational Health, Faculty of
conditions; however, we felt that an understanding of per-pro- Medicine, McGill University, Montreal, Quebec, Canada
tocol outcomes under idealised conditions would be key when 5
McGill International TB Centre, Research Institute of the McGill University Health
informing programmatic policy. Centre, Montreal, Quebec, Canada
Division of Respiratory Medicine, Faculty of Medicine, University of British Columbia, 15 Blanc FX, Sok T, Laureillard D, et al. Earlier versus later start of antiretroviral therapy in
Vancouver, British Columbia, Canada HIV-infected adults with tuberculosis. N Engl J Med 2011;365:1471–81.
16 Lienhardt C, Cook SV, Burgos M, et al. Efficacy and safety of a 4-drug fixed-dose
Acknowledgements We would like to thank Leslie Chiang for his assistance with combination regimen compared with separate drugs for treatment of pulmonary
the initial data cleaning. tuberculosis: the Study C randomized controlled trial. JAMA 2011;305:1415–23.
17 Mfinanga SG, Kirenga BJ, Chanda DM, et al. Early versus delayed initiation of highly
Contributors JCJ, FAK and KR initiated the project and were responsible for the active antiretroviral therapy for HIV-positive adults with newly diagnosed pulmonary
design of the protocol. KR was responsible for data management and performed tuberculosis (TB-HAART): a prospective, international, randomised, placebo-controlled
the statistical analysis with advice from RFB and AB. KR, RFB, AB, JRC, DM, FAK and trial. Lancet Infect Dis 2014;14:563–71.
JCJ interpreted the data. KR and JCJ wrote the initial draft of the manuscript. RFB, 18 Swaminathan S, Narendran G, Venkatesan P, et al. Efficacy of a 6-month versus
AB, JRC, DM and FAK were responsible for critical revisions of the manuscript and
9-month intermittent treatment regimen in HIV-infected patients with tuberculosis: a
provided important intellectual content. All authors gave their final approval of the
randomized clinical trial. Am J Respir Crit Care Med 2010;181:743–51.
version submitted for publication.
19 Johnson JL, Hadad DJ, Dietze R, et al. Shortening treatment in adults with
Funding JCJ is supported by the Michael Smith Foundation for Health Research. noncavitary tuberculosis and 2-month culture conversion. Am J Respir Crit Care Med
FAK received salary support from the Fonds de Recherche Québec Santé. The 2009;180:558–63.
researchers were independent of the funders, which had no role in this study. 20 New tb clinical trial data-sharing platform available for researchers | critical path
Competing interests None declared. institute. https://c-p (accessed 24 May 2017).
21 Johnston JC, Campbell JR, Menzies D. Effect of intermittency on treatment outcomes
Patient consent Not required. in pulmonary tuberculosis: An updated systematic review and metaanalysis. Clin Infect
Ethics approval University of British Columbia Clinical Ethics Review Board (H16- Dis 2017;64:1211–20.
01282). 22 Turner RM, Omar RZ, Yang M, et al. A multilevel model framework for meta-analysis
of clinical trials with binary outcomes. Stat Med 2000;19:3417–32.
Provenance and peer review Not commissioned; externally peer reviewed.
23 DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or
more correlated receiver operating characteristic curves: a nonparametric approach.
