Tuberculosis-associated respiratory impairment and disability in children, adolescents and adults: protocol for a systematic review and individual participant data meta-analysis

    Approximately 2% of the global population has survived tuberculosis (TB). Increasing evidence indicates that a significant proportion of pulmonary TB survivors develop TB-associated respiratory impairment and disability—commonly referred to as post-TB lung disease—marked by impaired respiratory function, persistent symptoms and activity limitations. However, the prevalence, risk factors and progression of TB-associated respiratory disability throughout the life course are not well understood. To address these gaps, we will undertake a systematic review and individual participant-level data meta-analysis (IPD-MA) focusing on TB-associated respiratory impairment and disability in children, adolescents and adults successfully treated for pulmonary TB.

    We will systematically search MEDLINE, Embase, CENTRAL, Global Index Medicus and medRxiv for original studies investigating TB-associated respiratory impairment and disability in people of all ages who have completed treatment for microbiologically confirmed or clinically diagnosed pulmonary TB. Authors of eligible studies will be invited to contribute deidentified data and form a collaborative group. Primary outcomes will be (1) abnormal lung function based on spirometry parameters and (2) chronic respiratory symptoms. We will estimate the overall and subgroup-specific prevalence of each outcome through IPD-MA. Next, we will develop clinical prediction tools assessing the risk of future TB-associated respiratory impairment and disability. Finally, we will use stepwise hierarchical modelling to identify epidemiological determinants of respiratory impairment and disability.

    This study has been approved by the ethics review boards at the Rhode Island Hospital (2138217-2) and the Research Institute of the McGill University Health Centre (2024-10345). Individual study authors will be required to obtain institutional approval prior to sharing data. Results will be disseminated through open-access, peer-reviewed publications and conference presentations.

    CRD42024529906.

    http://creativecommons.org/licenses/by-nc/4.0/

    This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

    If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

    Globally, over 10 million people develop tuberculosis (TB) each year,1 and as of 2020, an estimated 155 million individuals who had TB disease in the preceding 40 years were still alive.2 Approximately 86% of people who had TB disease had pulmonary TB,1 3 which even after successful treatment, may lead to respiratory sequelae, including restrictive lung disease, chronic obstructive pulmonary disease and infectious pulmonary complications.4 5 This spectrum of TB-associated respiratory disorders, commonly known as post-TB lung disease—a misnomer as the pathogenesis occurs during TB disease—encompasses both respiratory impairment and disability.6 7 Respiratory impairment is characterised by abnormal lung function, commonly measured through spirometry or structural assessments. Disability relates to functional impacts such as chronic respiratory symptoms and limitations or restrictions in daily activities and participation.8

    A recent systematic review and aggregate-level meta-analysis, which included 61 original studies and over 40 000 participants, found 59% (95% CI 49%, 69%) of people who survived TB had abnormal spirometry and 25% (95% CI 19%, 32%) reported breathlessness.9 Additional reviews have been conducted to identify risk factors for TB-associated respiratory impairment and disability, but consistent, robust associations have not been found.10 11 It remains unclear who is at the highest risk for TB-associated respiratory impairment and disability. Consequently, guidelines recommend screening all pulmonary TB survivors for respiratory impairment and disability,12–14 a practice that is not feasible in the resource-limited settings where the majority of TB survivors reside.1 2 An improved understanding of risk factors and epidemiological determinants of TB-associated respiratory impairment and disability is crucial to enhancing the efficiency and feasibility of screening.

    Another significant limitation in the existing evidence is the underrepresentation of children and adolescents in studies of TB-associated respiratory impairment and disability, along with a lack of age-disaggregated outcomes.15 Research across all age groups is essential due to the differences between children, adolescents and adults in the host response to Mycobacterium tuberculosis, clinical presentation of pulmonary TB and ongoing lung development in children and adolescents.6

    Individual participant data meta-analysis (IPD-MA) can overcome many limitations of aggregate methods through harmonisation of variables and outcomes, standardised analysis, analysis of subgroups not reported in primary studies such as age-stratified results and increased sample size.16 Using IPD-MA, we can identify subgroups at high risk for TB-associated respiratory impairment and disability, supporting risk stratification of individuals for screening. Additionally, this approach allows us to assess the underlying epidemiological determinants of TB-associated respiratory impairment and disability. With a large body of literature now available, an IPD-MA is an attractive approach to improve the targeting and feasibility of screening strategies for TB-associated respiratory impairment and disability.

    The overall aim of this systematic review and IPD-MA is to estimate the burden, risk, and determinants of TB-associated respiratory impairment and disability in children, adolescents and adults treated for pulmonary TB. We will address the following specific objectives. First, we will estimate the prevalence of the two components of TB-associated respiratory impairment and disability: (1) impairment to respiratory function and structure and (2) symptoms and functional limitations. As part of this aim, we will estimate prevalence among subgroups at different times since TB diagnosis and examine risk factors. Second, we will develop and validate risk prediction models to predict future TB-associated respiratory (1) impairment and (2) disability. Third, we will identify the epidemiological and clinical determinants of (1) respiratory impairment and (2) disability, using a conceptual framework and hierarchical approach.

    The systematic review and IPD-MA will be reported according to the 2020 PRISMA guidelines.17 This protocol has been reported according to the PRISMA-P guidelines18 and has been prospectively registered with PROSPERO (CRD42024529906). Any key changes or amendments will be documented there.

    In preparation for this project, we conducted an initial literature search of MEDLINE, Embase, CENTRAL, Global Index Medicus and medRxiv for original studies published between 1 January 2004 and 26 April 2024, using a comprehensive search strategy (online supplemental appendix) developed in collaboration with a medical librarian experienced in systematic reviews. We selected 2004 as the earliest publication year as there is minimal chance of individual participant-level study data being held for over 20 years. Two reviewers (KR and SAL) screened titles, abstracts, full texts and any studies identified as relevant from reviews and reference lists of eligible articles. Disagreements regarding inclusion or exclusion were resolved by a third reviewer (JRC). Retrieved references were uploaded into Zotero (Center for History and New Media, George Mason University), a reference management software, for deduplication and then subsequently imported into Covidence (Veritas Health Innovation, Australia), a web-based platform designed to streamline the systematic review process. We will update this search through March 2025 to identify new studies published since our initial search.

    Our study population will include children (0–9 years old), adolescents (10–19 years old) and adults (≥20 years old) with pulmonary TB that is microbiologically confirmed—through smear microscopy, mycobacterial culture and/or molecular assays including GeneXpert—or clinically diagnosed, and in the case of children, meeting established clinical criteria.19 Wherever possible, our comparator population will include children, adolescents or adults without a history of pulmonary TB. In the absence of a formal comparison group, the comparator will be assumed (eg, for spirometry and other lung function measurements, international reference values for age, sex and height will be used as the comparator).

    Studies will be eligible for inclusion if they meet all of the following criteria: (1) prospective or retrospective cohorts, cross-sectional studies or clinical trials, (2) included ≥10 participants who completed treatment for pulmonary TB, (3) measure and report at least one aspect of TB-associated respiratory disability, as detailed below and (4) evaluate ≥80% of all participants for these outcomes to minimise selection bias associated with selective testing. We will include studies written in any language.

    Our primary outcome will encompass the two aspects of TB-associated respiratory impairment and disability: (1) impairments to respiratory function and structure and (2) chronic respiratory symptoms and/or limitations in daily activities and participation. These outcomes are based on the terminology used by the WHO International Classification of Functioning, Disability and Health, a classification system that provides a standard language and framework for describing health and health-related states.8

    Our primary outcome for impairments to respiratory function and structure will be abnormal lung function based on prebronchodilator spirometry parameters (table 1). As secondary outcomes, we will also consider other impairments to respiratory function and structure, including (1) postbronchodilator spirometry, (2) prebronchodilator and postbronchodilator oscillometry, (3) measures of lung volume, (4) diffusion capacity measurements, (5) other tidal breathing techniques and (6) the presence of bronchiectasis.

    Table 1

    Outcome measures of tuberculosis-associated respiratory impairment and disability

    Our primary outcome for disability will be chronic respiratory symptoms (table 1). As secondary outcomes, we will also consider: (1) grades 3–5 on the Medical Research Council (MRC) Dyspnoea Scales,20 (2) grades 2–4 on the modified MRC Dyspnoea Scale,21 (3) values higher than ‘moderate breathlessness’ on the Borg Dyspnoea Scale22 and (4) 6 min Walk Test results below predicted lower limit of normal, calculated using standard equations based on age, sex, height and weight.23

    Corresponding authors of eligible studies will be contacted via email and invited to join the collaborative group and share their deidentified study data. If there is no response from the author within 4 weeks, we will try a second time. If the author does not respond, we will attempt to contact other authors. If the author(s) do not respond or indicate that the data are unavailable or cannot be shared due to access restrictions, we will note both of these occurrences.

    After accepting the invitation to collaborate, signing a data transfer agreement and obtaining institutional approvals, authors will transfer their data securely via a mutually agreed on secure data-sharing platform. Data will be housed on a secure server at The Research Institute of the McGill University Health Centre.

    From each eligible study, we will collect study-level and individual-level variables. Study-level variables will include country, funding source, country-level health characteristics, study design, population, aims, recruitment period and test(s) used to measure outcomes. Individual-level variables will include demographic, clinical, radiographic, microbiologic and outcome data (table 2). All received study data will be reviewed for missing, incomplete, or implausible data and compared against published information; authors will be further consulted for clarifications. As age and outcome data are critical pieces of information for this review, prior to processing, we will exclude all participants with missing information on age or those without an outcome measure of interest. We will standardise outcomes and covariates between studies using systematic harmonisation methodology.24 Study-specific data items will be processed into a common format for analysis.

    Table 2

    Individual-level variables to be requested from study authors

    We will quantify individual-level missing data in each study. We will impute missing data using multiple imputations via chained equations (R package mice) while respecting participant clustering by source study.25 We will generate 20 imputed datasets, each undergoing 25 between-imputation iterations. Analyses will be done in each of the 20 imputed datasets, with estimates pooled according to Rubin’s rules.26 No imputations will be done for outcome measures.

    Two study investigators will independently assess the risk of bias in included studies using an adapted version of the ROBINS-E tool (table 3).27 We will evaluate the following criteria at the study level: (1) selection of participants, (2) measurement of exposure, (3) confounding, (4) postexposure interventions, (5) measurement of outcome and (6) missing data. Any disagreements will be resolved through discussion or consultation with a third study investigator. For each subdomain, we will assign a risk of bias of low, medium, high, very high or uncertain. As no detailed guidance has been developed on providing an overall risk of bias for each study, we will not give an overall risk of bias but rather discuss the potential impacts of identified sources of bias on the interpretation of our findings.

    Table 3

    Risk of bias criteria

    We will assess the existing evidence using categories in the Cochrane Handbook, which considers inconsistency, indirectness, imprecision and bias.28 Bias will be assessed for each individual study using the risk of bias assessment described above, and globally we will assess publication bias visually using Egger plots and forest plots for our two primary outcomes. Inconsistency will be assessed according to the similarity in the magnitude and direction of effects across studies; we will use stratified analyses to evaluate potential sources of heterogeneity. Indirectness will be evaluated based on applicability (tests or evaluations performed, time frame) and participant selection. Imprecision will be measured according to precision/CIs around estimates while considering sources of heterogeneity. Based on the above considerations, the strength of the existing evidence will be graded as high (no concerns with any of the above considerations), moderate, low or very low (based on the number of concerns). In line with Cochrane, we may adjust certainty based on other factors, such as large effects, observing a dose response or the presence of plausible confounding.28

    Unless otherwise specified, we will conduct analyses on the imputed datasets as our primary analysis and complete case analysis as a confirmatory secondary analysis. We will consider stratified analyses on the variables included in table 2, as possible. All analyses will be done using R (The R Project for Statistical Computing, the latest version available at the time of analysis start).

    Objective 1

    We will estimate the overall and subgroup-specific prevalence of TB-associated respiratory impairment and disability in two stages,29 conducting such analysis among participants grouped according to timing of outcome measurement. The timing of outcome measurements analysed will be dictated by the follow-up time points among data contributed to the IPD. Potential subgroups for analysis, described in table 2, have been selected based on evidence from previous reviews, expert opinion of the investigators and biological plausibility. In the first stage, we will use the IPD to estimate the proportion of people with respiratory impairment and disability and the SE within each individual study. In the second stage, we will pool the logit-transformed proportions of impairment and disability across studies using generalised linear mixed models. We will back-transform the pooled estimates and SEs to obtain prevalence estimates and 95% CIs and generate forest plots to compare prevalence estimates across strata.30 31 We will use I2, τ2 and prediction intervals to describe between-study heterogeneity. Subgroups will only be analysed if at least two studies provided information allowing such subgroup analysis.

    Next, we will use one-stage IPD-MA to estimate the prevalence ratio for the outcomes of (1) respiratory impairment and (2) disability across strata. We will use generalised linear mixed log-binomial models with the variables to include in these models as covariates or effect modifiers selected from table 2. We will consider variables in two ways: variables that would only be known at TB treatment initiation and variables that would be known at the end of TB treatment. Further, we will conduct an exploratory analysis which includes the time point of outcome measurement to see how outcomes change over time; we will only include time points for which at least two studies have a measurement. In all analyses, we will account for clustering at the study and participant level. Continuous variables will be included in models using flexible forms, such as splines. Heterogeneity will be assessed with I2 and prediction intervals.32

    Based on findings from a previous aggregate meta-analysis, we estimated the sample size required to estimate a prevalence of 59% for respiratory impairment and 25% for disability, with an estimated average cluster size of 140.9 Under the assumption that the intraclass correlation coefficient between studies was 0.02,33 34 with a power of 80%, type I error rate of 5% and absolute precision of 10%, we will require a sample size of 352 for our outcome of respiratory impairment and 273 for our outcome of disability. This suggests we will likely have sufficient power for our primary outcomes and for several key subgroups. For subgroups where we do not have sufficient power, these will be classified and interpreted as exploratory.

    Objective 2

    We will develop risk prediction models adhering to Transparent Reporting of a Multivariate Model for Individual Prognosis or Diagnosis35 guidance to predict the future risk of TB-associated respiratory impairment and disability. Our first model will aim to predict future TB-associated respiratory impairment and disability at the end of treatment based on demographic and clinical information known at the start of TB treatment. If the data permit, we will also look at time points further in the future and develop two other models: (1) predicting future risk from the end of TB treatment among those who do not already have evidence of TB-associated respiratory impairment and/or disability and (2) predicting persistence from the end of TB treatment among those with evidence of TB-associated respiratory impairment and/or disability. These latter two outcomes are important as existing data suggest up to one-quarter of people who go on to develop TB-associated respiratory impairment and disability are asymptomatic at the end of treatment,36 and it is of interest to determine if there are people with impairment and/or disability at the end of treatment who resolve over time.

    For our prediction models, we will use a sequential method to select candidate predictors. First, we will use a priori selection, based on consensus among investigators, published data and expert opinion, to sequentially include the chosen predictors in a layered fashion.37 Next, we will use elastic net penalised regression models to identify key predictors from among the larger set of variables available to evaluate if our a priori predictors were missing highly predictive variables.38 All models will include random effects for study, which will be considered when generating predictions. We will evaluate predictive performance using discrimination and calibration measures and validate the models using the internal-external cross-validation framework.39

    To estimate the required sample size for risk prediction models, we used a conservative predictive performance estimate of a previous model (0.71)40 and estimated sample size with 10 predictors, intraclass correlation coefficient of 0.02 and 140 average cluster size. If the proportion of survivors with disability is 25%, we require a sample size of 3134 to develop a prediction model; if the prevalence of respiratory impairment is 59%, we require a sample size of 2435.41

    Objective 3

    To improve our understanding of the contribution of TB to respiratory impairment and disability, as opposed to other sociodemographic, clinical and behavioural determinants, this specific aim will include only studies with a comparator group of people who never had TB disease. We will use a stepwise hierarchical modelling approach, which can systematically delineate confounding and mediating factors from our outcome of respiratory impairment and disability.42 Our modelling approach will follow the conceptual framework in figure 1, with factors organised according to how proximate each factor is to our outcome of respiratory impairment and disability.

    We will use generalised linear log-binomial models with random effects for each study. Models will be structured by first including distal determinants and subsequently including intermediate and proximate determinants in a hierarchical fashion. In this framework, the impact of TB on respiratory impairment and disability is mediated by disease-related factors such as bacillary burden and length of treatment. In our final, fully adjusted model, we will estimate the effect of TB on respiratory disability and impairment while accounting for confounders at each hierarchical level, ensuring that adjustments do not control for variables that may act as mediators. By structuring our analysis this way, we aim to isolate the independent effect of TB but acknowledge that some confounders or mediators may be underreported or unavailable. To assess the potential impact of unmeasured confounding, we will calculate E-values to quantify the strength of association that an unmeasured confounder would need to have with both the exposure and outcome to explain away the observed effect.

    A major challenge for any IPD-MA is mapping and harmonising variables due to inconsistencies in data collection methods, variable definitions and missing information across studies. We will partner with Maelstrom Research, a group with recognised expertise in data harmonisation, for rigorous data preprocessing and harmonisation techniques.24 Given the heterogeneity in data collection across studies in all fields, our work on data harmonisation will also help us make recommendations to improve the uniformity, accuracy and completeness of data collection for future studies.

    Authors’ willingness to share data is another barrier, and data for certain regions or subgroups may be challenging to obtain.43 We have budgeted resources to facilitate data sharing for authors from all settings and have established guidelines in our data-sharing agreements to promote transparency and collaboration. We will also establish a data repository, allowing us to include more data as it becomes available. Authors contributing data likely come from large centres, with adequate resources for monitoring and evaluation post-treatment. This may not be generalisable to lower-resource settings; however, we have designed our risk stratification analysis to provide evidence to programmes of all resource levels. Bias related to participant selection, data collection or reporting may affect the validity of our analyses. We are using a robust bias assessment tool to structure our bias and study quality assessment and have implemented inclusion criteria to minimise participant selection. Finally, paediatric subgroup analyses may be limited by small sample size. The inclusion of children and adolescents in this IPD-MA supports advocacy efforts for more data in this group and allows for the inclusion of currently underway studies.

    Our analytical approach also has limitations. When analysing prevalence of respiratory impairment or disability, we are limited by the data available, time points measured and heterogeneity among the included studies. Despite this, we will be evaluating several subgroups which may increase our type I error rate, and so will interpret and report all results with caution. While we propose various approaches to assess and explore characteristics associated with respiratory impairment or disability, it is unlikely all important confounders will be measured at each follow-up, such as smoking, which poses a challenge for time-varying confounders. In addition, while we propose several approaches to explore heterogeneous effects, including subgroup analysis and effect modification, it is unlikely all sources of heterogeneity will be measured as it is unlikely heterogeneity is driven by single, observable characteristics. To address this, we will consider a sensitivity analysis which evaluates heterogeneous treatment effects using Bayesian latent class models to better understand heterogeneity.44

    No direct patient or public involvement has taken place during the development of this protocol. However, we will work with the Community Advisory Boards of the McGill TB Centre (Canada) and the Desmond Tutu TB Centre (South Africa), as well as TB survivor networks and advocacy groups such as STOP TB USA, STOP TB Canada, TB Proof, and the Child and Adolescent Working Group of the WHO/Stop TB Partnership during the interpretation and dissemination of the study results.

    The IPD-MA was approved by institutional ethics review boards at Rhode Island Hospital, USA (2138217-2) and The Research Institute of the McGill University Health Centre, Canada (2024-10345). Individual studies will share deidentified data and follow institutional and national guidelines for data sharing.

    All contributing authors will sign data-sharing agreements with the institution hosting the secure data server (The Research Institute of the McGill University Health Centre), which outlines standards of data security, management and ownership. The initial length of this data-sharing agreement will be five years. All data will be treated as confidential and will remain the property of the contributing institution; data can be withdrawn at any time. All contributing authors will enter a consortium, which collaboratively completes the outlined analyses. JRC will act as the data custodian and will be the primary point of contact between consortium members.

    An oversight committee, consisting of seven members, will be established with the data custodian (JRC) acting as an additional member in a non-voting role. For the first four years, this committee will comprise the other three study principal investigators (SSC, JCJ and MvdZ) and four elected members of the consortium; thereafter, all oversight committee members will be elected. As the data custodian, JRC will remain on the oversight committee in an unelected position with a non-voting role. The oversight committee has three specific roles: (1) review requests for individuals or organisations to have access to the IPD for the purposes of agreed on analyses; (2) review and discuss proposals by members of the consortium for projects or analyses not outlined in this protocol and (3) review opportunities and requests for authorship and/or participation in analyses to ensure contribution and opportunity is equitable among the consortium. Issues deemed significant by the oversight committee may be brought forth to all consortium members for input.

    Given that approximately 1 in 50 people globally has survived pulmonary TB and the growing demand for evidence to inform post-TB care,2 this IPD-MA has the potential to significantly impact recommendations surrounding TB-associated respiratory impairment and disability. We have explicit plans to make the IPD database we establish available to interested research groups and organisations, such as the WHO, to advance research in this field and permit analyses beyond those initially proposed in this protocol. As described in the Patient and public involvement section, we will engage various groups, including TB survivors, advocates, researchers and civil society when planning and disseminating our findings. We plan to publish our results in peer-reviewed academic journals, present our research at conferences and give seminars to stakeholders. As our results can support clinical trial design and recruitment for interventions to prevent or mitigate TB-associated respiratory consequences, we will disseminate our findings within global TB clinical trial networks. To reach TB-affected communities, we will create non-technical summary reports and infographics in multiple languages. Finally, we will create a user-friendly multilingual website to host the risk prediction model(s) we develop.

    In summary, we will conduct an IPD-MA of TB-associated respiratory impairment and disability among TB survivors of all ages to determine who is most at risk, help predict those who might benefit from screening and improve our understanding of TB’s contribution to respiratory impairment and disability. The insights gained from these analyses may enhance strategies for detecting and preventing TB-associated respiratory impairment and disability and inform the design of clinical trials of interventions.

    Not applicable.

    Read the full text or download the PDF: