Common methodological pitfalls in ICI pneumonitis risk prediction studies

Chen, Yichen K.; Welsh, Sarah; Pillay, Ardon M.; Tannenwald, Benjamin; Bliznashki, Kamen; Hutchison, Emmette; Aston, John A. D.; Schönlieb, Carola-Bibiane; Rudd, James H. F.; Jones, James; Roberts, Michael

doi:10.3389/fimmu.2023.1228812

SYSTEMATIC REVIEW article

Front. Immunol. , 25 September 2023

Sec. Cancer Immunity and Immunotherapy

Volume 14 - 2023 | https://doi.org/10.3389/fimmu.2023.1228812

Common methodological pitfalls in ICI pneumonitis risk prediction studies

Yichen K. Chen¹

Sarah Welsh²

Ardon M. Pillay³

Benjamin Tannenwald⁴

Kamen Bliznashki⁴

Emmette Hutchison⁴

John A. D. Aston⁵

Carola-Bibiane Schönlieb¹

James H. F. Rudd⁶

James Jones⁷

Michael Roberts^1,6*

¹Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
²Department of Surgery, Cambridge University Hospitals, Cambridge, United Kingdom
³School of Clinical Medicine, University of Cambridge, Cambridge, United Kingdom
⁴Digital Health, Oncology R&D, AstraZeneca, Gaithersburg, MD, United States
⁵Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, United Kingdom
⁶Department of Medicine, University of Cambridge, Cambridge, United Kingdom
⁷Department of Oncology, Cambridge University Hospitals, Cambridge, United Kingdom

Background: Pneumonitis is one of the most common adverse events induced by the use of immune checkpoint inhibitors (ICI), accounting for a 20% of all ICI-associated deaths. Despite numerous efforts to identify risk factors and develop predictive models, there is no clinically deployed risk prediction model for patient risk stratification or for guiding subsequent monitoring. We believe this is due to systemic suboptimal approaches in study designs and methodologies in the literature. The nature and prevalence of different methodological approaches has not been thoroughly examined in prior systematic reviews.

Methods: The PubMed, medRxiv and bioRxiv databases were used to identify studies that aimed at risk factor discovery and/or risk prediction model development for ICI-induced pneumonitis (ICI pneumonitis). Studies were then analysed to identify common methodological pitfalls and their contribution to the risk of bias, assessed using the QUIPS and PROBAST tools.

Results: There were 51 manuscripts eligible for the review, with Japan-based studies over-represented, being nearly half (24/51) of all papers considered. Only 2/51 studies had a low risk of bias overall. Common bias-inducing practices included unclear diagnostic method or potential misdiagnosis, lack of multiple testing correction, the use of univariate analysis for selecting features for multivariable analysis, discretization of continuous variables, and inappropriate handling of missing values. Results from the risk model development studies were also likely to have been overoptimistic due to lack of holdout sets.

Conclusions: Studies with low risk of bias in their methodology are lacking in the existing literature. High-quality risk factor identification and risk model development studies are urgently required by the community to give the best chance of them progressing into a clinically deployable risk prediction model. Recommendations and alternative approaches for reducing the risk of bias were also discussed to guide future studies.

1 Introduction

Immune checkpoint inhibitors (ICIs) have dramatically improved outcomes in cancer treatment in the past decade. Success has been seen in melanoma, lung and kidney cancer, although their use is rapidly expanding to other cancer types (1). In addition to their use in advanced cancer, they are also used in the perioperative settings to reduce risk of cancer recurrence (2). ICIs most commonly block the programmed cell death protein 1 (PD-1) and cytotoxic T-lymphocyte–associated antigen 4 (CTLA-4) pathways, key negative regulators of the anti-tumour immune response. Despite the success of ICIs, their mechanism of action means that they can trigger immune reactions against non-tumour, healthy tissues. These ‘immune-related adverse events’ (irAEs) may necessitate stopping treatment, or adding other drugs such as steroids to dampen the immune reaction; in rare cases, irAEs are severe enough to require hospital treatment or lead to patient death (3). Early recognition and ideally prevention of irAEs are therefore key challenges in oncology practice.

One of the more common irAEs that lead to drug discontinuation is pneumonitis, also known as interstitial lung disease (ILD), an inflammation of the lung tissue. It accounts for 20% of all ICI-associated deaths (4). Patients with ICI induced pneumonitis (ICI pneumonitis) most commonly present with symptoms of dyspnoea and cough (53 and 35 percent, respectively), while approximately one-third of patients are asymptomatic (5). More than half of patients with ICI pneumonitis may also present with another immune-related adverse event, such as colitis, dermatitis, or thyroiditis (5).

ICI pneumonitis can be challenging to distinguish from other pathologies such as pulmonary embolus, infection, heart failure or underlying cancer progression (6). Strategies to identify patients at risk of pneumonitis, and to recognize it early are a clinical priority (6).

Since the introduction of ICIs, many studies have been conducted to discover risk factors and to build risk prediction models. Both systematic and non-systematic reviews have been written to identify possible mechanisms for ICI pneumonitis (7), to summarize risk factors (8) and to recommend management strategies (9). A meta-analysis published in 2022 summarized the odds ratios from 35 studies between 2000 and 2022 and identified several risk factors that have significant pooled effect (Table 1) (8). Three studies aimed at building risk prediction models for ICI pneumonitis in human were also published in the same year (10–12). Chao et al. developed a nomogram from a 164-subject dataset; chronic obstructive pulmonary disease (COPD) diagnosis, PD-L1 expression and interleukin 8 (IL-8) levels were included as final predictors for incidence of ICI pneumonitis in non-small cell lung cancer (NSCLC) patients (10). Jia et al. took the nomogram approach as well, with a 209-subject training set, they identified hypertension, ILD emphysema and platelet/lymphocyte ratio (PLR) as model predictors (11). In contrast, Tan et al. trained a deep neural network on 48 subjects to combine pre-ICI imaging and clinical data, which represents the first application of modern machine learning techniques for ICI pneumonitis risk prediction (12).

TABLE 1

Table 1 Risk factors found to have significant pooled odds ratio (OR) in a meta-analysis conducted by Zhou et al. (8).

Despite these efforts, none of the risk factors and risk prediction models have yet translated through to clinical deployment of risk prediction tools. Due to the lack of large-scale high-quality validation studies for the commonly investigated risk factors, the community does not have good evidence to rely on to form a consensual set of risk factors for risk modelling. Findings on individual risk factors are also inconsistent between studies. This inconsistency may be a result of suboptimal methodology, examples include 1) bias in the study populations; 2) difficulty in ICI pneumonitis diagnosis; 3) increased risk of chance findings in small datasets; 4) bias in statistical analysis.

Risk of bias analysis has been reported only in one previous review (8), which, together with other existing reviews, did not provide any detailed assessment on the methodology or the prevalence of bias-prone practices (7, 9, 13–16). Therefore, in the current systematic review, we present a thorough critical appraisal of the methodology of ICI pneumonitis risk factor identification and risk model development studies. Prevalence of bias-prone approaches is quantified as well.

2 Materials and methods

2.1 Search strategy and selection criteria

Published works and preprints were identified using a Python interface for arXiv (arxiv=1.4.2) and R interfaces for PubMed (RISmed=2.3.0), medRxiv (medrxivr=0.0.5) and bioRxiv (also medrxivr=0.0.5). The databases were searched from 1 January 2000 to 30 September 2022. An initial high-level search was performed for risk factor and risk prediction studies on anti-cancer drug-related pneumonitis. The subsequent search had a narrower scope, focusing on risk factor and risk prediction studies for ICI-related pneumonitis and specifically included individual ICIs in the search terms. The only preprint was removed after confirming that it did not investigate ICI pneumonitis.

A two-stage process was then adopted to identify papers that reported risk factors or risk models for ICI pneumonitis; first, the title and abstract were screened followed by a second screen of the full article text.

2.1.1 Stage I: title and abstract screening

Three reviewers (Y.C., S.W., M.R.) determined the relevance of the studies based on titles and abstracts. Each paper was assessed by two reviewers independently. Conflicts were resolved by consensus between the three reviewers.

The inclusion criteria were: 1. Indication of risk factors, biomarkers and/or predictive models for ICI pneumonitis, including comparison of ICI pneumonitis incidence between different patient subgroups. 2. Reporting of imaging characteristics for ICI pneumonitis

2.1.2 Stage II: full-text screening

Four reviewers (Y.C., S.W., M.R., J.J.) determined the relevance of the studies based on the full text. Each paper was assessed by two reviewers independently, conflicts were resolved by consensus between the four reviewers.

In this review, we included any original study that reported: risk factors, predictive biomarkers and/or models for ICI pneumonitis (including a comparison of ICI pneumonitis incidence between different patient subgroups). The analysis must use statistical tests or predictive modelling for inclusion in our review.

2.2 Risk of bias in individual studies

The QUIPS tool (17) was adopted to assess the risk of bias in the risk factor studies. For studies that aim to develop a risk prediction model, the PROBAST (18) method was used to assess the risk of bias. Each study was independently evaluated by two reviewers (Y.C., A.P.), disagreements were resolved by consensus.

2.3 Data analysis

The following information was extracted from the papers (Supplementary Table 1) (1): outcome of interest; (2) country in which the data were collected; (3) type of cancer treated by ICI; (4) whether the study reported risk factors or a risk prediction model; (5) sample size; (6) statistical tests or models used; (7) data pre-processing, including discretization and handling of missing-values (8) method for validation if a study describes a risk prediction model; (9) whether the code for training the model and the trained model was publicly available (only for studies reporting risk prediction models); and (10) whether imaging features were involved and how they were extracted.

The extracted information was then profiled to evaluate the prevalence of each suboptimal practice.

2.4 Keywords in search strategy

2.4.1 Initial search

The initial search looked for the presence of a combination of keywords in the title and abstract of each study. A study was retained if either the title or the abstract contains at least one of: chemotherapy, TKI, tyrosine kinase inhibitor, immune checkpoint inhibitor, immune checkpoint blockade, ICPI, ICI, mTOR, targeted, immune-related; and at least one of: pneumonitis, interstitial lung disease, ILD; as well as one of: biomarker, biomarkers, predictor, predictors, predictive, predict, predicts, prediction, risk factor, risk factors.

2.4.2 ICI-specific search

The secondary search was conducted in the same way but using different search terms. The title or the abstract must contain at least one of: immune checkpoint inhibitor, immune checkpoint blockade, ICPI, ICI, pembrolizumab, nivolumab, cemiplimab, durvalumab, avelumab, atezolizumab, ipilimumab, tremelimumab, immune-related; and at least one of: pneumonitis, interstitial lung disease, ILD; as well as one of: biomarker, biomarkers, predictor, predictors, predictive, predict, predicts, prediction, risk factor, risk factors.

3 Results

3.1 Study selection

711 distinct studies were found in the initial search, and 199 of these were deemed relevant during abstract screening. Of the 199 that were eligible for full-text screening, 51 were retained for discussion in this analysis (Figure 1, selection criteria are detailed in the Methods section). ICI pneumonitis of different grades were investigated in the studies, including any-grade (48/51), grade 2 or above (3/51), grade 3 or above (3/51), and grade 5 (1/51).

FIGURE 1

Figure 1 PRISMA flowchart describing the number of studies identified, excluded at abstract and full text screening, and finally included in the analysis.

50/51 studies aimed to identify baseline or pre-treatment risk factors for developing ICI pneumonitis from clinical data (46/50), imaging data (27/50, all but 2 included clinical data as well) or specialized laboratory tests such as genetics (2/50) and antibody abundance (2/50). 2/50 studies additionally developed a risk prediction model after risk factor identification from clinical data (10, 11). The remaining study focused on risk prediction model development (with clinical and imaging data) and did not investigate the significance of individual risk factors (12).

3.1.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

Most (47/50) studies used private data collected from their authors’ affiliated institutions. The remaining 3/51 studies used public databases (Shah 2020 used VigiBase, Asada 2021 and Bai 2021 used data from FDA Adverse Event Reporting System, FAERS), which have limited availability for clinical variables and diagnostic method. Study sample size ranged between 17 and 40826 (19) (median=169, IQR:93-248), with a mean case:control ratio of 21:100 for any-grade (N=47), 16:100 for grade 2 or above (N=3), 11:100 for grade 3 or above (N=3), 14:100 for grade-5 (N=1).

Close to 50% (24/50) of the studies used data from Japan, followed by USA (10/50), China (10/50), Australia (1/50), South Korea (1/50), Mexico (1/50) and Spain (1/50). 3/51 studies were analysed data from the FAERS database (2/3) and the VigiBase database (1/3) (Figure 2A), both contain drug adverse event reports from multiple countries. In addition, 78% (21/27) of studies that examined imaging-based risk factors for ICI pneumonitis development used only Japanese population (Figure 2B).

FIGURE 2

Figure 2 (A) Number of studies of each type (risk factor only, risk factor with model development, model development only) conducted in each country. (B) Number of studies that used pre-ICI imaging data in each country and each study type (risk factor only, risk factor with model development, model development only). USA: United States. ^✶One of the ten studies also conducted analysis on data from a multinational database (Shah et al., 2020 (20)).

74% (20/27) of imaging-based risk factor studies used computerized tomography (CT) as the sole imaging technique. The remaining 7 did not specify the imaging modality. All the studies relied on manual interpretation to determine radiographic features. 11/27 studies had 2-3 radiologists or pulmonologists reviewing the CT scans, 2/27 studies used a central review committee, 14/27 did not report the number of investigators involved in extracting the imaging findings. Acquisition parameters for the images were described in only 7/27 studies.

A majority of risk factor studies (39/50) conducted analyses on lung cancer, of which 33/38 recruited only NSCLC patients (Figures 3A, B). Melanoma was investigated in 3/50 studies. Acute myeloid leukaemia (AML) was considered in one study (Figure 3A). Cancers in different organs were combined in the analyses from 10/50 studies, 3/10 excluded subjects with lung cancer (Figures 3A, C). One study (19) used data from the FAERS database which contains all the adverse event reports submitted to FDA regardless of cancer type; but the authors did not report the proportion of different cancer types in the data analysed.

FIGURE 3

Figure 3 Number of studies of each type (risk factor, risk factor with model development, model development only) that conducted analyses on different types of cancers. (A) high-level grouping. If a study conducted two analyses each for a different cancer type, the two cancer types are separated by a semicolon. (B) breakdown of lung cancer studies by cancer subtypes. (C) break down of combined-cancer studies by inclusion of lung cancer.

19/50 studies summarized follow-up duration. The median time to onset is shorter than the median follow-up duration in all studies that reported both follow-up time and time to onset.

3.1.2 Studies developing ICI pneumonitis risk prediction models (n=3)

A total of 3 studies attempted to develop risk prediction models (10–12). All of them focused on any-grade ICI pneumonitis and used data collected from the their authors’ affiliated institutions, which were all in China (Figure 2A). Sample sizes for model development were 48 (12), 164 (10) and 209 (11). Only one study had an internal validation (holdout) set (i.e. was not used in cross-validation or bootstrapping), the same study is also the only one that utilized an external validation set (11). The internal and external validation sets consisted of 209 and 172 subjects, respectively.

Imaging features were investigated in 2/3 studies. The modality of imaging was described in one study (12) that used a deep neural network to implicitly and automatically extract relevant CT imaging features. The other study used imaging-based diagnoses (pre-existing ILD and emphysema status) as candidate predictors but did not explicitly state the imaging domain (11).

In terms of cancer type, all models were built on data from lung cancer patients (Figures 3A, B): NSCLC in Chao 2022 and Jia 2022, lung cancer with different histology in Tan 2022.

3.2 Risk of bias

Following the recommendations from the Cochrane Prognosis Methods Group, the Quality In Prognosis Studies (QUIPS) tool (17) was adopted to assess the risk of bias in the risk factor studies. A total of six domains are considered by the QUIPS tool: study participation, study attrition, prognostic factor measurement, outcome measurement, study confounding, and statistical analysis and reporting. In the current analysis, the study attrition domain was considered irrelevant since only five studies were prospective and their primary endpoints were either safety or efficacy rather than ICI pneumonitis development. This means the concept of study completion is ill-defined with respect to ICI pneumonitis onset. In addition, due to the lack of understanding and consensus on potential confounding factors for ICI pneumonitis, the risk of bias due to study confounding was not assessed.

For studies that aim to develop a risk prediction model, the Risk Of Bias Assessment Tool (PROBAST) was used (18). The four domains of focus are: participants, predictors, outcomes and analysis.

3.2.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

3.2.1.1 Study participation

3/50 studies were determined to have high risk of bias (Table 2). The three studies with high risk of bias (19, 20, 36) did not provide any summary statistics on the baseline demographic and clinical characteristics of the participants. The remaining 47/50 studies provided a sufficient description of the source of subjects involved and on the distribution of clinical and demographic characteristics in the population, so were considered to have low risk of bias.

TABLE 2

Table 2 Risk of bias assessment based on QUIPS for the 51 studies that reported risk factors.

3.2.1.2 Prognostic factor measurement

37/50 studies were considered to have low risk of bias for prognostic factor measurement, with clear information on data source and data handling (Table 2). 13/50 studies were evaluated to have high risk of bias due to unclear description on the source of the prognostic factors (2/13, Table 3) (33, 55), use of data-driven discretization that was optimized to maximise discrimination (5/13, Table 3) (11, 39, 44, 53, 58), and lack of information on data processing such as the handling of missing data or discretization (5/13, Table 3) (11, 19, 21, 39, 64). Additionally, 5/13 studies simply excluded subjects with missing data from the analysis when the proportion of missing values were high (> 10% missing, Table 3) (32, 40, 53–55).

TABLE 3

Table 3 Number of risk factor studies judged to have high, moderate and low risk of bias for prognostic factor (risk factor) measurement for different reasons.

3.2.1.3 Outcome measurement

All studies explicitly stated ICI pneumonitis as the outcome of interest, alternative terms used in the studies include: pneumonitis (as a type of immune-related adverse event), immune-related pneumonitis/ILD and exacerbation of interstitial pneumonia after ICI administration. 16/50 studies had sufficient description on ICI pneumonitis diagnosis (although no gold-standard diagnostic test exists) or excluded other possible cause of lung inflammation by design, so were considered to have low risk for bias for outcome measurement (Table 2). 9/50 had moderate risk, where the authors attempted to distinguish ICI pneumonitis from alternative diagnoses such as infection, tumour progression and pre-existing lesions, but there remains a risk of mistaking radiation pneumonitis (RP) for ICI pneumonitis due to prior thoracic radiotherapy or a lack of information on thoracic radiotherapy (Table 4). The remaining 25/50 studies had high risk of bias, 21 of which did not mention isolating ICI pneumonitis from other causes of lung inflammation, 4 had explicitly included RP and other ILD in the definition of ICI pneumonitis (Table 4) (19, 34, 48, 55).

TABLE 4

Table 4 Number of risk factor studies judged to have high, moderate and low risk of bias for outcome measurement for different reasons.

3.2.1.4 Statistical analysis and reporting

8/50 studies were considered to have low risk of bias (Table 2). 21/50 had moderate risk due to lack of clarity in some but not all analytical steps (4/21, Table 5), the use of univariate analysis to select variables for multivariable analysis (11/21, Table 5), and discretization of continuous variables (16/21, Table 5). The remaining 21/50 were high risk studies (Table 5): 11 applied significance test on over 20 predictors in at least one of the analytical steps but used uncorrected p-value < 0.05 as significance threshold; 6 tested the same factor more than once using different discretization thresholds without appropriate multiple testing adjustment; 7 did not provide enough detail to indicate if there was selective reporting of results.

TABLE 5

Table 5 Number of risk factor studies judged to have high, moderate and low risk of bias for statistical analysis and reporting for different reasons.

3.2.2 Studies developing ICI pneumonitis risk prediction models (n=3)

3.2.2.1 Participants

According to PROBAST, all three studies reporting risk prediction models had low risk of bias in terms of participant or study sample selection. They retrospectively included data from cancer patients who were given ICI treatments in hospitals, no bias was identified from the inclusion/exclusion criteria.

3.2.2.2 Predictors

All three studies reporting risk prediction models had low risk of bias introduced by predictors or their assessment. Candidate predictors were all extracted pre-treatment or at baseline without knowledge of outcome data. All the predictors were available in training, internal validation, and the external validation sets when used.

3.2.2.3 Outcome

All three studies had unclear risk of bias for outcome determination due to lack of description on how ICI pneumonitis were distinguished from other alternative diagnoses such as infection and tumour progression.

3.2.2.4 Analysis

All three studies had high risk of bias for data analysis due to low sample size:feature ratio (24 cases and 24 controls for deep learning in Tan 2022) (12), discretization of continuous variables (10, 11), exclusion of missing values when a large proportion of a variable is missing (17% missing in Chao 2022) (10), use of univariate analysis to select predictors (10) and uncorrected optimism in the reported model performance (hyperparameter selection with cross-validation and absence of a holdout set in Tan 2022, univariate feature selection based on full population and lack of a holdout set in Chao 2022) (10, 12).

3.3 Data analysis

3.3.1 Missing data and imputation

3.3.1.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

Missing values were reported in 22/50 studies, the maximum proportion of missing values in a single-variable ranges between 0.2% and 64.7% across the studies (median:17.3%, IQR: 7.5%-36.1%). 4/22 of the studies included the missing values as a separate level in regression analysis (11, 35, 41, 46), 5/22 were unclear about the imputation method used (19, 21, 39, 61, 64), the rest simply excluded subjects with missing values. None of the studies investigated whether the missing values were independent from values in other variables.

3.3.1.2 Studies developing ICI pneumonitis risk prediction models (n=3)

2/3 studies reported missing values and also investigated risk factors. One had a maximum of 17% of values missing in a single variable and excluded samples with missing values from analysis (10); the other included missing values as a separate category during model development but did not mention the frequency of missingness (11). Similar to the risk factors studies, the relationship between the missing values and other variables (except for ICI pneumonitis status) were not examined.

3.3.2 Univariate and multivariable analysis

3.3.2.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

Univariate analysis was performed in 47/50 studies, most of them (26/47) used logistic regression in combination with other tests or by itself: 6/26 studies used contingency and/or two-sample tests to determine variables that should be further assessed by univariate logistic regression (26, 36, 41, 44, 47, 58). One study used contingency test for categorial factors and logistic regression for continuous factors (50). The remaining 19/26 studies used only logistic regression to identify risk factors.

Survival analysis was the second most popular method for univariate analysis, and was performed in 8/47 studies that conducted univariate analysis: 5/8 studies used survival analysis alone to identify risk factors (2 applied the Fine-Gray test, 3 applied Cox proportional hazard model) (40, 46, 53, 61, 63). One study conducted a contingency test and used a Cox proportional hazard model on the same factor (49), the other two used contingency and/or two-sample tests to select variables that should be further assessed in univariate survival modelling (39, 58).

Amongst the remaining studies that conducted univariate analysis, 12 used nothing but contingency and/or two-sample tests, one applied significance test on area under the receiver operating characteristics curve (ROC) (11), and one used general estimating equation (GEE) (56).

46/47 studies that performed univariate analysis used p=0.05 as significance threshold, one implied a threshold of p=0.001 (59). None, except one study, applied multiple testing correction (24).

Multivariable analysis was performed in 31/51 studies, most of them (23/31) used only logistic regression, another 6/31 used only survival modelling (3 with the Fine-Gray test, 3 with the Cox proportional hazard model) (39, 40, 46, 53, 61, 63). One study implied the existence of multivariable analysis for ICI pneumonitis but did not report corresponding results (50), the other applied both logistic regression and Cox proportional hazard model (58). Overall, in either univariate or multivariable analyses, 31/51 studies used logistic regression, 8/51 studies applied survival modelling.

A total of 21/31 studies performing multivariable analysis relied solely (14/21) or partially (7/21) on univariate analysis for feature preselection, so only variables that were below a small p-value (the p-value thresholds ranged between 0.05 and 0.2) or had sufficiently large effect size were passed to the multivariable analysis. 6/31 studies did not conduct any data-driven pre-selection at all (11, 23, 28, 38, 56, 61). 3/31 were unclear on the method of pre-selection (21, 29, 34). All the studies that were partially dependent on univariate analysis for feature selection had additional pre-specified factors. Feature selection during multivariable modelling was conducted in 4/31 studies via stepwise selection (34, 61), shrinkage (43) and p-value cut-off (48).

3.3.2.2 Studies developing ICI pneumonitis risk prediction models (n=3)

Each of the three studies used different tests for univariate analysis. Chao et al. used logistic regression to select predictors for model building (10); Jia et al. used ROC analysis to determine the best dichotomization threshold for continuous variables before multivariable modelling (11); Tan et al. only used univariate contingency and two-sample tests to compare the baseline characteristics between ICI pneumonitis and control subject (12).

The risk prediction models were all multivariable. 2/3 were based on logistic regression (10, 11), they all investigated risk factors as well. The remaining paper experimented different deep learning approaches (unimodal with either clinical factors or CT images alone, multimodal combining the two, and each of these approaches enhanced by contrastive learning) (12).

4 Discussion

4.1 Study aims

In this analysis, we found that almost all of papers on ICI pneumonitis prediction focused on statistical significance and effect sizes of risk factors rather than risk model development. While statistical significance and the effect sizes (e.g. odds ratio and hazard ratio) may inform clinicians about the relative risk of ICI pneumonitis development for a patient, a clinical decision made without an estimate of the absolute risk can be suboptimal: a 2-fold increase in relative risk could represent a change of absolute risk from 40% to 80%, or from 5% to 10%. We suggest that future studies should particularly focus on validation of existing risk factors and the development of risk prediction models. Both types of study require a much larger sample size than the existing studies (median = 169) for the findings to be considered reliable enough for clinical application. For example, the Liverpool Lung Project (LLP) lung cancer risk prediction model was developed on 1736 subjects and validated on two large cohorts with sample sizes of 2922 and 7652, respectively (67, 68). Furthermore, since distinguishing low grade from high grade ICI pneumonitis is critical for ICI management decisions and the urgency of treatment, risk factors and models that predict high-grade pneumonitis and their time of onset would be highly valuable. However, due to the rarity of high-grade ICI pneumonitis, single-centre studies may find the sample sizes required to be impractical. Multi-national and multi-centre collaboration may be an attractive option in this case.

4.2 Dataset considered

In terms of the datasets considered, 47% of studies (24/51, Figure 2) were conducted using data from Japan. The proportion increases to 71% (21/28, Figure 2) if just the imaging-based studies are considered. This could be due to historically high incidence rates of drug-related ILD in Japan (69). The over-representation of Japanese patient data may lead to inaccurate risk prediction in other countries and ethnicities as models fail to generalize (70). Studies including non-Japanese populations should therefore be encouraged. Most (39/51) studies had specific focus on lung cancer patients, and we suggest that future studies may also concentrate on discovering and validating risk factors and models in non-lung cancer populations to identify whether any generalise to other cancers.

In studies that investigated imaging-based risk factors, all but one study used imaging features that were derived from manually identified abnormal radiological patterns in the lungs. Although the studies sought agreement between multiple radiologists or pulmonologists, subjective bias may still exist and lead to inaccuracy in the labels due to varying levels of experience and training. Automated tools should be developed to identify the radiological patterns to limit such bias.

4.3 Outcome definition

One of the main observations from the risk of bias analysis is the between-study inconsistency in the definition of the ICI pneumonitis population and a lack of description on how ICI pneumonitis was distinguished from other diagnoses. This reflects the lack of gold-standard diagnostic criteria for ICI pneumonitis.

Regardless of the cause, patients experiencing pneumonitis will undergo the same or similar clinical management and differentiating the cause of pneumonitis may not necessarily add value to clinical decision-making during its direct management. However, for cancer patients, accurate identification of the cause of pneumonitis in similar subjects is crucial, as if severe ICI induced pneumonitis is suspected this may lead to discontinuation of an effective treatment.

Further studies should endeavour to accurately establish the cause and type of pneumonitis, and document the associated risk factors, so that the community can build a better understanding of managing this clinical situation

4.4 Statistical analysis

The risk of bias analysis also revealed some suboptimal techniques in the statistical analysis of the reviewed studies. One prevalent method was the use of univariate analysis to select variables to include in the multivariable analysis, this was observed in 21/31 studies reporting risk factors from multivariable analysis and 1/3 studies reporting risk model development. With this approach, variables that are only informative after controlling for other variables will be dropped out from the final model (71). This phenomenon was observed in one of the studies in our review: Uchida et al. yielded insignificant univariate result for association between pre-existing ILD and risk of symptomatic ICI pneumonitis, but when adjusted for lung metastasis, the association became statistically significant (62). A better alternative would be a step-wise regression or a sparse regularized model (e.g. Least Absolute Shrinkage and Selection Operator (72)).

The widespread lack of multiple testing correction (in all but one study) was another common source of bias when many potential risk factors were simultaneously tested. The bias was exaggerated when the same factors were tested multiple times with different discretization thresholds. For instance, under the assumption of independence between the risk factors, a study should expect at least one false discovery when more than 20 factors are tested at a significance threshold of p = 0.05. This can lead to optimistic and non-reproducible results. To reduce the Type I error, i.e. when a null hypothesis is rejected when it is actually true, methods such as Benjamini-Hochberg correction and Bonferroni correction could be employed (72).

In studies that report risk model development, the most concerning source of optimism came from ill-defined cross-validation where univariate feature selection and data-driven feature transformation (e.g. feature dichotomization based on univariate ROC analysis) was conducted on the entire population rather than the training set in each round of cross-validation. Similarly, reporting only the cross-validation performance after it has been used in hyperparameter optimisation can also lead to an overestimated model performance. To obtain the least biased estimate on model performance, investigators should ideally use internal and (whenever possible) external holdout sets that have not been exposed to any part of model optimisation.

Many (13/22) of the reviewed studies that contained missing values simply discarded the observations from corresponding analyses. This approach assumes that the missing values are randomly distributed and are not related to the outcome or other potential risk factors; this assumption was not verified in any of the studies. As a result, the studies that had sizable numbers of missing values were likely to contain biased results due to failure to account for informative missing values. Statistical power can also be jeopardized due to the reduction in sample size. Future studies should explain and check assumptions on the distribution of missing values, reasons for missingness, and use appropriate imputation methods before considering excluding observations (73). One way to check the informativeness of missing values is to include them as a separate category in the regression analysis, as done by four studies reviewed in this analysis. However, this approach requires discretizing continuous variables, which itself introduces bias (74), so should be discouraged for continuous variables. In places where missing values have been imputed or excluded, non-linear trends involving continuous variables should be investigated using techniques such as spline regression rather than with discretization (74).

We noted the overwhelming popularity of logistic regression in the risk factor analysis studies (55%, either univariate or multivariable) and in risk model development studies (75%). The results from these studies could be interpreted as identifying the odds ratios for (or predicting the risk of) developing ICI pneumonitis before the last follow-up or death regardless of the timing of the events. This limits clinical utility, as the user would not know whether the intervention should be urgent. Recorded follow-up time ranges from days to months, making survival analysis a better and clinically-actionable alternative.

5 Conclusion

Overall, our work highlighted several common methodological pitfalls in ICI pneumonitis risk factor identification and risk model development studies, covering areas from diagnosis of ICI pneumonitis to statistical analysis and risk modelling. The majority of the studies considered here are likely to have reported biased results due to those pitfalls. We also provided recommendations and alternative approaches for reducing the risk of bias. Studies with low risk of bias in all domains are lacking in the existing literature (only 2/51, Table 2), high-quality risk factor identification and especially risk model development studies (i.e. predicting absolute rather than relative risk) are urgently required by the community in order to progress into formation of a clinical deployable risk prediction model.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

Original idea (SW/MR), paper searching (YC), title/abstract/full-text screen (YC/SW/MR/JJ), data extraction and review of bias (YC/AP), draft manuscript (YC/JR/JJ/MR). All authors contributed to the article and approved the submitted version.

Acknowledgments

The authors are grateful for the following funding: AstraZeneca (YC), the EU/EFPIA Innovative Medicines Initiative project DRAGON (101005122) (C-BS, MR), the Trinity Challenge (C-BS, MR), the EPSRC Cambridge Mathematics of Information in Healthcare Hub EP/T017961/1 (JA, C-BS, JR, MR), the Cantab Capital Institute for the Mathematics of Information (C-BS), the European Research Council under the European Union’s Horizon 2020 research and innovation programme grant agreement no. 777826 (C-BS), the Alan Turing Institute (C-BS), Wellcome Trust (JR), Cancer Research UK Cambridge Centre (C9685/A25177) (C-BS), British Heart Foundation (JR), the NIHR Cambridge Biomedical Research Centre (JR), HEFCE (JR). In addition, C-BS acknowledges support from the Leverhulme Trust project on ‘Breaking the non-convexity barrier’, the Philip Leverhulme Prize, the EPSRC grants EP/S026045/1 and EP/T003553/1 and the Wellcome Innovator Award RG98755.

Conflict of interest

Authors BT, KB and EH were employed by the company AstraZeneca.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

This study received funding from AstraZeneca. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1228812/full#supplementary-material

References

1. Hargadon KM, Johnson CE, Williams CJ. Immune checkpoint blockade therapy for cancer: An overview of FDA-approved immune checkpoint inhibitors. Int Immunopharmacology (2018) 62:29–39. doi: 10.1016/j.intimp.2018.06.001

CrossRef Full Text | Google Scholar

2. Choueiri TK, Tomczak P, Park SH, Venugopal B, Ferguson T, Chang Y-H, et al. Adjuvant pembrolizumab after nephrectomy in renal-cell carcinoma. New Engl J Med (2021) 385(8):683–94. doi: 10.1056/NEJMoa2106391

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Haanen J, Obeid M, Spain L, Carbonnel F, Wang Y, Robert C, et al. Management of toxicities from immunotherapy: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up☆. Ann Oncol (2022) 33(12):1217–38. doi: 10.1016/j.annonc.2022.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Wang DY, Salem JE, Cohen JV, Chandra S, Menzer C, Ye F, et al. Fatal toxic effects associated with immune checkpoint inhibitors: A systematic review and meta-analysis. JAMA Oncol (2018) 4(12):1721–8. doi: 10.1001/jamaoncol.2018.3923

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Naidoo J, Wang X, Woo KM, Iyriboz T, Halpenny D, Cunningham J, et al. Pneumonitis in patients treated with anti–programmed death-1/programmed death ligand 1 therapy. J Clin Oncol (2016) 35(7):709–17. doi: 10.1200/JCO.2016.68.2005

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Reuss JE, Suresh K, Naidoo J. Checkpoint inhibitor pneumonitis: mechanisms, characteristics, management strategies, and beyond. Curr Oncol Rep (2020) 22(6):56. doi: 10.1007/s11912-020-00920-z

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Zhai X, Zhang J, Tian Y, Li J, Jing W, Guo H, et al. The mechanism and risk factors for immune checkpoint inhibitor pneumonitis in non-small cell lung cancer patients. Cancer Biol Med (2020) 17(3):599–611. doi: 10.20892/j.issn.2095-3941.2020.0102

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Zhou P, Zhao X, Wang G. Risk factors for immune checkpoint inhibitor-related pneumonitis in cancer patients: A systemic review and meta-analysis. Respiration (2022) 101(11):1–16. doi: 10.1159/000526141

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Kim S, Lim JU. Immune checkpoint inhibitor-related interstitial lung disease in patients with advanced non-small cell lung cancer: systematic review of characteristics, incidence, risk factors, and management. J Thorac Disease (2022) 14(5):1684–95. doi: 10.21037/jtd-22-93

CrossRef Full Text | Google Scholar

10. Chao Y, Zhou J, Hsu S, Ding N, Li J, Zhang Y, et al. Risk factors for immune checkpoint inhibitor-related pneumonitis in non-small cell lung cancer. Transl Lung Cancer Res (2022) 11(2):295–306. doi: 10.21037/tlcr-22-72

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Jia X, Chu X, Jiang L, Li Y, Zhang Y, Mao Z, et al. Predicting checkpoint inhibitors pneumonitis in non-small cell lung cancer using a dynamic online hypertension nomogram. Lung Cancer (2022) 170:74–84. doi: 10.1016/j.lungcan.2022.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Tan P, Huang W, Wang L, Deng G, Yuan Y, Qiu S, et al. Deep learning predicts immune checkpoint inhibitor-related pneumonitis from pretreatment computed tomography images. Front Physiol (2022) 13:978222–. doi: 10.3389/fphys.2022.978222

PubMed Abstract | CrossRef Full Text | Google Scholar

13. von Itzstein MS, Khan S, Gerber DE. Investigational biomarkers for checkpoint inhibitor immune-related adverse event prediction and diagnosis. Clin Chem (2020) 66(6):779–93. doi: 10.1093/clinchem/hvaa081

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Xu Y, Fu Y, Zhu B, Wang J, Zhang B. Predictive biomarkers of immune checkpoint inhibitors-related toxicities. Front Immunol (2020) 11:. doi: 10.3389/fimmu.2020.02023

CrossRef Full Text | Google Scholar

15. Chennamadhavuni A, Abushahin L, Jin N, Presley CJ, Manne A. Risk factors and biomarkers for immune-related adverse events: A practical guide to identifying high-risk patients and rechallenging immune checkpoint inhibitors. Front Immunol (2022) 13:779691. doi: 10.3389/fimmu.2022.779691

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Zhang Y, Zhang X, Li W, Du Y, Hu W, Zhao J. Biomarkers and risk factors for the early prediction of immune-related adverse events: a review. Hum Vaccin Immunother (2022) 18(1):2018894. doi: 10.1080/21645515.2021.2018894

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Hayden JA, van der Windt DA, Cartwright JL, Côté P, Bombardier C. Assessing bias in studies of prognostic factors. Ann Intern Med (2013) 158(4):280–6. doi: 10.7326/0003-4819-158-4-201302190-00009

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: A tool to assess the risk of bias and applicability of prediction model studies. Ann Internal Med (2019) 170(1):51. doi: 10.7326/M18-1376

CrossRef Full Text | Google Scholar

19. Asada M, Mikami T, Niimura T, Zamami Y, Uesawa Y, Chuma M, et al. The risk factors associated with immune checkpoint inhibitor-related pneumonitis. Oncology (2021) 99(4):256–9. doi: 10.1159/000512633

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Shah KP, Song H, Ye F, Moslehi JJ, Balko JM, Salem JE, et al. Demographic factors associated with toxicity in patients treated with anti-programmed cell death-1 therapy. Cancer Immunol Res (2020) 8(7):851–5. doi: 10.1158/2326-6066.CIR-19-0986

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Fujimoto D, Yoshioka H, Kataoka Y, Morimoto T, Kim YH, Tomii K, et al. Efficacy and safety of nivolumab in previously treated patients with non-small cell lung cancer: A multicenter retrospective cohort study. Lung Cancer (2018) 119:14–20. doi: 10.1016/j.lungcan.2018.02.017

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Cui P, Liu Z, Wang G, Ma J, Qian Y, Zhang F, et al. Risk factors for pneumonitis in patients treated with anti-programmed death-1 therapy: A case-control study. Cancer Med (2018) 7(8):4115–20. doi: 10.1002/cam4.1579

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Nakahama K, Tamiya A, Isa SI, Taniguchi Y, Shiroyama T, Suzuki H, et al. Association between imaging findings of airway obstruction adjacent to lung tumors and the onset of interstitial lung disease after nivolumab. In Vivo (2018) 32(4):887–91. doi: 10.21873/invivo.11324

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Owen DH, Wei L, Bertino EM, Edd T, Villalona-Calero MA, He K, et al. Incidence, risk factors, and effect on survival of immune-related adverse events in patients with non-small-cell lung cancer. Clin Lung Cancer (2018) 19(6):e893–900. doi: 10.1016/j.cllc.2018.08.008

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Suresh K, Voong KR, Shankar B, Forde PM, Ettinger DS, Marrone KA, et al. Pneumonitis in non-small cell lung cancer patients receiving immune checkpoint immunotherapy: incidence and risk factors. J Thorac Oncol (2018) 13(12):1930–9. doi: 10.1016/j.jtho.2018.08.2035

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Cho JY, Kim J, Lee JS, Kim YJ, Kim SH, Lee YJ, et al. Characteristics, incidence, and risk factors of immune checkpoint inhibitor-related pneumonitis in patients with non-small cell lung cancer. Lung Cancer (2018) 125:150–6. doi: 10.1016/j.lungcan.2018.09.015

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Yamaguchi T, Shimizu J, Hasegawa T, Horio Y, Inaba Y, Yatabe Y, et al. Pre-existing pulmonary fibrosis is a risk factor for anti-PD-1-related pneumonitis in patients with non-small cell lung cancer: A retrospective analysis. Lung Cancer (2018) 125:212–7. doi: 10.1016/j.lungcan.2018.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Sukari A, Nagasaka M, Alhasan R, Patel D, Wozniak A, Ramchandren R, et al. Cancer site and adverse events induced by immune checkpoint inhibitors: A retrospective analysis of real-life experience at a single institution. Anticancer Res (2019) 39(2):781–90. doi: 10.21873/anticanres.13175

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Davila-Dupont D, Motola-Kuba D, Dorantes-Heredia R, Gonzalez-Alonso BK, Alcantara-Velarde T, Garcia-Santisteban R, et al. Risk of pneumonitis with the use of different immune checkpoint inhibitors in a mexican population. Oncology (2019) 96(5):268–72. doi: 10.1159/000497405

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Shibaki R, Murakami S, Matsumoto Y, Goto Y, Kanda S, Horinouchi H, et al. Tumor expression and usefulness as a biomarker of programmed death ligand 1 in advanced non-small cell lung cancer patients with preexisting interstitial lung disease. Med Oncol. (2019) 36(6):49–. doi: 10.1007/s12032-019-1274-0

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Duma N, Abdel-Ghani A, Yadav S, Hoversten KP, Reed CT, Sitek AN, et al. Sex Differences in Tolerability to Anti-Programmed Cell Death Protein 1 Therapy in Patients with Metastatic Melanoma and Non-Small Cell Lung Cancer: Are We All Equal? Oncologist. (2019) 24(11):e1148–e55. doi: 10.1634/theoncologist.2019-0094

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Nakanishi Y, Masuda T, Yamaguchi K, Sakamoto S, Horimasu Y, Nakashima T, et al. Pre-existing interstitial lung abnormalities are risk factors for immune checkpoint inhibitor-induced interstitial lung disease in non-small cell lung cancer. Respir Investig (2019) 57(5):451–9. doi: 10.1016/j.resinv.2019.05.002

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Komiya K, Nakamura T, Abe T, Ogusu S, Nakashima C, Takahashi K, et al. Discontinuation due to immune-related adverse events is a possible predictive factor for immune checkpoint inhibitors in patients with non-small cell lung cancer. Thorac Cancer (2019) 10(9):1798–804. doi: 10.1111/1759-7714.13149

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Fukihara J, Sakamoto K, Koyama J, Ito T, Iwano S, Morise M, et al. Prognostic impact and risk factors of immune-related pneumonitis in patients with non-small-cell lung cancer who received programmed death 1 inhibitors. Clin Lung Cancer (2019) 20(6):442–50.e4. doi: 10.1016/j.cllc.2019.07.006

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Tone M, Izumo T, Awano N, Kuse N, Inomata M, Jo T, et al. High mortality and poor treatment efficacy of immune checkpoint inhibitors in patients with severe grade checkpoint inhibitor pneumonitis in non-small cell lung cancer. Thorac Cancer (2019) 10(10):2006–12. doi: 10.1111/1759-7714.13187

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Tahir SA, Gao J, Miura Y, Blando J, Tidwell RSS, Zhao H, et al. Autoimmune antibodies correlate with immune checkpoint therapy-induced toxicities. Proc Natl Acad Sci U S A (2019) 116(44):22246–51. doi: 10.1073/pnas.1908079116

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Nishiyama N, Honda T, Sema M, Kawahara T, Jin Y, Natsume I, et al. The utility of ground-glass attenuation score for anticancer treatment-related acute exacerbation of interstitial lung disease among lung cancer patients with interstitial lung disease. Int J Clin Oncol (2020) 25(2):282–91. doi: 10.1007/s10147-019-01576-x

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Sugano T, Seike M, Saito Y, Kashiwada T, Terasaki Y, Takano N, et al. Immune checkpoint inhibitor-associated interstitial lung diseases correlate with better prognosis in patients with advanced non-small-cell lung cancer. Thorac Cancer (2020) 11(4):1052–60. doi: 10.1111/1759-7714.13364

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Suzuki Y, Karayama M, Uto T, Fujii M, Matsui T, Asada K, et al. Assessment of immune-related interstitial lung disease in patients with NSCLC treated with immune checkpoint inhibitors: A multicenter prospective study. J Thorac Oncol (2020) 15(8):1317–27. doi: 10.1016/j.jtho.2020.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Li M, Spakowicz D, Zhao S, Patel SH, Johns A, Grogan M, et al. Brief report: inhaled corticosteroid use and the risk of checkpoint inhibitor pneumonitis in patients with advanced cancer. Cancer Immunol Immunother (2020) 69(11):2403–8. doi: 10.1007/s00262-020-02674-w

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Okada N, Matsuoka R, Sakurada T, Goda M, Chuma M, Yagi K, et al. Risk factors of immune checkpoint inhibitor-related interstitial lung disease in patients with lung cancer: a single-institution retrospective study. Sci Rep (2020) 10(1):13773–. doi: 10.1038/s41598-020-70743-2

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Ikeda S, Kato T, Kenmotsu H, Ogura T, Iwasawa S, Sato Y, et al. A phase 2 study of atezolizumab for pretreated NSCLC with idiopathic interstitial pneumonitis. J Thorac Oncol (2020) 15(12):1935–42. doi: 10.1016/j.jtho.2020.08.018

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Moda M, Saito H, Kato T, Usui R, Kondo T, Nakahara Y, et al. Tumor invasion in the central airway is a risk factor for early-onset checkpoint inhibitor pneumonitis in patients with non-small cell lung cancer. Thorac Cancer (2020) 11(12):3576–84. doi: 10.1111/1759-7714.13703

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Chu X, Zhao J, Zhou J, Zhou F, Jiang T, Jiang S, et al. Association of baseline peripheral-blood eosinophil count with immune checkpoint inhibitor-related pneumonitis and clinical outcomes in patients with non-small cell lung cancer receiving immune checkpoint inhibitors. Lung Cancer (2020) 150:76–82. doi: 10.1016/j.lungcan.2020.08.015

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Shimoji K, Masuda T, Yamaguchi K, Sakamoto S, Horimasu Y, Nakashima T, et al. Association of preexisting interstitial lung abnormalities with immune checkpoint inhibitor–induced interstitial lung disease among patients with nonlung cancers. JAMA Netw Open (2020) 3(11):e2022906–e. doi: 10.1001/jamanetworkopen.2020.22906

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Isono T, Kagiyama N, Takano K, Hosoda C, Nishida T, Kawate E, et al. Outcome and risk factor of immune-related adverse events and pneumonitis in patients with advanced or postoperative recurrent non-small cell lung cancer treated with immune checkpoint inhibitors. Thorac Cancer (2020) 12(2):153–64. doi: 10.1111/1759-7714.13736

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Zhang C, Gao F, Jin S, Gao W, Chen S, Guo R. Checkpoint inhibitor pneumonitis in Chinese lung cancer patients: clinical characteristics and risk factors. Ann Palliat Med (2020) 9(6):3957–65. doi: 10.21037/apm-20-1823

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Atchley WT, Alvarez C, Saxena-Beem S, Schwartz TA, Ishizawar RC, Patel KP, et al. Immune checkpoint inhibitor-related pneumonitis in lung cancer: real-world incidence, risk factors, and management practices across six health care centers in north carolina. Chest (2021) 160(2):731–42. doi: 10.1016/j.chest.2021.02.032

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Zhou J, Zhao J, Jia Q, Chu Q, Zhou F, Chu X, et al. Peripheral blood autoantibodies against to tumor-associated antigen predict clinical outcome to immune checkpoint inhibitor-based treatment in advanced non-small cell lung cancer. Front Oncol (2021) 11:625578–. doi: 10.3389/fonc.2021.625578

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Bai S, Tian T, Pacheco JM, Tachihara M, Hu P, Zhang J. Immune-related adverse event profile of combination treatment of PD-(L)1 checkpoint inhibitors and bevacizumab in non-small cell lung cancer patients: data from the FDA adverse event reporting system. Transl Lung Cancer Res (2021) 10(6):2614–24. doi: 10.21037/tlcr-21-464

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Lin X, Deng H, Yang Y, Wu J, Qiu G, Li S, et al. Peripheral blood biomarkers for early diagnosis, severity, and prognosis of checkpoint inhibitor-related pneumonitis in patients with lung cancer. Front Oncol (2021) 11:698832–. doi: 10.3389/fonc.2021.698832

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Yamaguchi T, Shimizu J, Hasegawa T, Horio Y, Inaba Y, Hanai N, et al. Pre-existing interstitial lung disease is associated with onset of nivolumab-induced pneumonitis in patients with solid tumors: a retrospective analysis. BMC Cancer (2021) 21(1):924. doi: 10.1186/s12885-021-08661-3

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Yamamoto N, Nakanishi Y, Gemma A, Nakagawa K, Sakamoto T, Akamatsu A, et al. Real-world safety of nivolumab in patients with non-small-cell lung cancer in Japan: Postmarketing surveillance. Cancer Sci (2021) 112(11):4692–701. doi: 10.1111/cas.15117

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Sierra-Rodero B, Cruz-Bermúdez A, Nadal E, Garitaonaindía Y, Insa A, Mosquera J, et al. Clinical and molecular parameters associated to pneumonitis development in non-small-cell lung cancer patients receiving chemoimmunotherapy from NADIM trial. J Immunother Cancer (2021) 9(8). doi: 10.1136/jitc-2021-002804

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Ichimura T, Hinata M, Ichikura D, Suzuki S. Safety of immune checkpoint inhibitors in non-small-cell lung cancer patients with idiopathic interstitial pneumonia: a matched case-control study. Cancer Chemother Pharmacol (2021) 89(1):21–30. doi: 10.1007/s00280-021-04362-7

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Reuss JE, Brigham E, Psoter KJ, Voong KR, Shankar B, Ettinger DS, et al. Pretreatment lung function and checkpoint inhibitor pneumonitis in NSCLC. JTO Clin Res Rep (2021) 2(10):100220–. doi: 10.1016/j.jtocrr.2021.100220

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Yamaguchi T, Shimizu J, Oya Y, Watanabe N, Hasegawa T, Horio Y, et al. Risk factors for pneumonitis in patients with non-small cell lung cancer treated with immune checkpoint inhibitors plus chemotherapy: A retrospective analysis. Thorac Cancer (2022) 13(5):724–31. doi: 10.1111/1759-7714.14308

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Wang H, Zhou F, Zhao C, Cheng L, Zhou C, Qiao M, et al. Interleukin-10 is a promising marker for immune-related adverse events in patients with non-small cell lung cancer receiving immunotherapy. Front Immunol (2022) 13:840313. doi: 10.3389/fimmu.2022.840313

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Ohe Y, Yamazaki N, Yamamoto N, Murakami H, Yoh K, Kitano S, et al. The real-world safety of atezolizumab as second-line or later treatment in Japanese patients with non-small-cell lung cancer: a post-marketing surveillance study. Jpn J Clin Oncol (2022) 52(6):623–32. doi: 10.1093/jjco/hyac024

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Xu H, Feng H, Zhang W, Wei F, Zhou L, Liu L, et al. Prediction of immune-related adverse events in non-small cell lung cancer patients treated with immune checkpoint inhibitors based on clinical and hematological markers: Real-world evidence. Exp Cell Res (2022) 416(1):113157–. doi: 10.1016/j.yexcr.2022.113157

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Sheshadri A, Goizueta AA, Shannon VR, London D, Garcia-Manero G, Kantarjian HM, et al. Pneumonitis after immune checkpoint inhibitor therapies in patients with acute myeloid leukemia: A retrospective cohort study. Cancer (2022) 128(14):2736–45. doi: 10.1002/cncr.34229

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Uchida Y, Kinose D, Nagatani Y, Tanaka-Mizuno S, Nakagawa H, Fukunaga K, et al. Risk factors for pneumonitis in advanced extrapulmonary cancer patients treated with immune checkpoint inhibitors. BMC Cancer (2022) 22(1):551–. doi: 10.1186/s12885-022-09642-w

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Uhara H, Tsuchida T, Kiyohara Y, Akamatsu A, Sakamoto T, Yamazaki N. Safety and effectiveness of nivolumab in Japanese patients with Malignant melanoma: Final analysis of a post-marketing surveillance. J Dermatol (2022) 49(9):862–71. doi: 10.1111/1346-8138.16432

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Ikeda S, Kato T, Kenmotsu H, Ogura T, Sato Y, Hino A, et al. Atezolizumab for pretreated non-small cell lung cancer with idiopathic interstitial pneumonia: final analysis of phase II AMBITIOUS study. Oncologist (2022) 27(9):720–e02. doi: 10.1093/oncolo/oyac118

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Abed A, Law N, Calapre L, Lo J, Bhat V, Bowyer S, et al. Human leucocyte antigen genotype association with the development of immune-related adverse events in patients with non-small cell lung cancer treated with single agent immunotherapy. Eur J Cancer (2022) 172:98–106. doi: 10.1016/j.ejca.2022.05.021

PubMed Abstract | CrossRef Full Text | Google Scholar

66. Lu X, Wang J, Zhang T, Zhou Z, Deng L, Wang X, et al. Comprehensive pneumonitis profile of thoracic radiotherapy followed by immune checkpoint inhibitor and risk factors for radiation recall pneumonitis in lung cancer. Front Immunol (2022) 13:918787–. doi: 10.3389/fimmu.2022.918787

PubMed Abstract | CrossRef Full Text | Google Scholar

67. Raji OY, Duffy SW, Agbaje OF, Baker SG, Christiani DC, Cassidy A, et al. Predictive accuracy of the Liverpool Lung Project risk model for stratifying patients for computed tomography screening for lung cancer: a case-control and cohort validation study. Ann Intern Med (2012) 157(4):242–50. doi: 10.7326/0003-4819-157-4-201208210-00004

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Cassidy A, Myles JP, van Tongeren M, Page RD, Liloglou T, Duffy SW, et al. The LLP risk model: an individual risk prediction model for lung cancer. Br J Cancer (2008) 98(2):270–6. doi: 10.1038/sj.bjc.6604158

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Azuma A. High prevalence of drug-induced pneumonia in Japan. JMA Journal. (Tokyo, Japan: Japan Medical Association and the Japanese Association of Medical Sciences). (2007) 50(5):405–411.

Google Scholar

70. Toll DB, Janssen KJM, Vergouwe Y, Moons KGM. Validation, updating and impact of clinical prediction rules: A review. J Clin Epidemiol (2008) 61(11):1085–94. doi: 10.1016/j.jclinepi.2008.04.008

PubMed Abstract | CrossRef Full Text | Google Scholar

71. Sun G-W, Shook TL, Kay GL. Inappropriate use of bivariable analysis to screen risk factors for use in multivariable analysis. J Clin Epidemiol (1996) 49(8):907–16. doi: 10.1016/0895-4356(96)00025-X

PubMed Abstract | CrossRef Full Text | Google Scholar

72. James G, Witten D, Hastie T, Tibshirani R. An introduction to statistical learning. New York, NY: Springer (2013).

Google Scholar

73. van Ginkel JR, Linting M, Rippe RCA, van der Voort A. Rebutting existing misconceptions about multiple imputation as a method for handling missing data. J Pers Assessment (2020) 102(3):297–308. doi: 10.1080/00223891.2018.1530680

CrossRef Full Text | Google Scholar

74. Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med (2006) 25(1):127–41. doi: 10.1002/sim.2331

PubMed Abstract | CrossRef Full Text | Google Scholar

75. Dávila-Dupont D, Motola-Kuba D, Dorantes-Heredia R, González-Alonso BK, Alcántara-Velarde T, García-Santisteban R, Martínez-Sámano JE, Grimaldo-Roque HJ, Ruiz-Morales JM, et al. Risk of Pneumonitis with the Use of Different Immune Checkpoint Inhibitors in a Mexican Population. Oncology (2019) 96(5):268–272. doi: 10.1159/000497405

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: immune checkpoint inhibitor induced pneumonitis, review of methodology, risk of bias assessment, study design, statistical analysis

Citation: Chen YK, Welsh S, Pillay AM, Tannenwald B, Bliznashki K, Hutchison E, Aston JAD, Schönlieb C-B, Rudd JHF, Jones J and Roberts M (2023) Common methodological pitfalls in ICI pneumonitis risk prediction studies. Front. Immunol. 14:1228812. doi: 10.3389/fimmu.2023.1228812

Received: 25 May 2023; Accepted: 04 September 2023;
Published: 25 September 2023.

Edited by:

Vassiliki A. Boussiotis, Harvard Medical School, United States

Reviewed by:

Ajay Sheshadri, University of Texas MD Anderson Cancer Center, United States
Jennifer Possick, Yale University, United States
Stefania Canova, San Gerardo Hospital, Italy

Copyright © 2023 Chen, Welsh, Pillay, Tannenwald, Bliznashki, Hutchison, Aston, Schönlieb, Rudd, Jones and Roberts. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Michael Roberts, bXI4MDhAY2FtLmFjLnVr

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Common methodological pitfalls in ICI pneumonitis risk prediction studies

1 Introduction

2 Materials and methods

2.1 Search strategy and selection criteria

2.1.1 Stage I: title and abstract screening

2.1.2 Stage II: full-text screening

2.2 Risk of bias in individual studies

2.3 Data analysis

2.4 Keywords in search strategy

2.4.1 Initial search

2.4.2 ICI-specific search

3 Results

3.1 Study selection

3.1.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

3.1.2 Studies developing ICI pneumonitis risk prediction models (n=3)

3.2 Risk of bias

3.2.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

3.2.1.1 Study participation

3.2.1.2 Prognostic factor measurement

3.2.1.3 Outcome measurement

3.2.1.4 Statistical analysis and reporting

3.2.2 Studies developing ICI pneumonitis risk prediction models (n=3)

3.2.2.1 Participants

3.2.2.2 Predictors

3.2.2.3 Outcome

3.2.2.4 Analysis

3.3 Data analysis

3.3.1 Missing data and imputation

3.3.1.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

3.3.1.2 Studies developing ICI pneumonitis risk prediction models (n=3)

3.3.2 Univariate and multivariable analysis

3.3.2.1 Studies investigating risk factors for developing ICI pneumonitis (n=50)

3.3.2.2 Studies developing ICI pneumonitis risk prediction models (n=3)

4 Discussion

4.1 Study aims

4.2 Dataset considered

4.3 Outcome definition

4.4 Statistical analysis

5 Conclusion

Data availability statement

Author contributions

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

References

95% of researchers rate our articles as excellent or good