Background
Idiopathic interstitial pneumonias (IIPs) consist of heterogeneous interstitial lung diseases of unknown aetiology. Based on their clinical, radiological and pathologic features, IIPs comprise several disease entities, such as idiopathic pulmonary fibrosis (IPF), nonspecific interstitial pneumonia (NSIP) and cryptogenic organizing pneumonia (COP) [
1].
The precise classification of IIPs is essential for selecting treatment and predicting prognosis; however, this is not simple in clinical practice [
1]. First, diagnosis at a single point is sometimes difficult, but this can be achieved by monitoring disease behaviour. For example, some patients with IIPs lacking honeycombing experience disease progression over several years, and they develop a usual interstitial pneumonia (UIP) pattern characterized by honeycombing on high-resolution computed tomography (HRCT) that is eventually diagnosed as IPF. In these patients, it is difficult to accurately diagnose the disease in the first examination. Additionally, patients with advanced IIPs other than IPF sometimes develop radiological and pathological honeycombing that mimics IPF [
2]. Second, pathological patterns can vary among different sites of the lungs in patients with IIPs [
3]. If surgical lung biopsy was not performed for representative lesions, it is difficult to ensure an accurate diagnosis [
4,
5]. Furthermore, discordance in diagnosis even among specialists is frequent [
6,
7]. Third, surgical lung biopsy, an essential procedure for the diagnosis of IIPs, cannot be performed in all patients [
8]. In fact, according to a recent ATS/ERS/JRS/ALAT guideline, surgical lung biopsy is not necessarily required for the diagnosis of IPF in patients with typical radiologic findings of UIP [
9]. Transbronchial cryobiopsy has attracted attention as a less invasive procedure than surgical lung biopsy; however, its utility for the diagnosis of IIPs remains a matter of debate [
10,
11].
A new trend of classifying interstitial lung diseases (ILDs) using clinical characteristics, regardless of conventional disease entities, has emerged [
12]. Recently, the concept of progressive fibrosing ILD (PF-ILD) was proposed, and this concept covers several IIPs featuring self-sustaining fibrosis, a progressive decline in lung function and early mortality [
12]. PF-ILD comprises a comprehensive group of ILDs, including IIPs, connective tissue disease-associated ILD (CTD-ILD), sarcoidosis and chronic hypersensitivity pneumonia (CHP). Although the concept of PF-ILD has not been validated, it is expected to play a certain role from a therapeutic perspective [
13]. Interstitial pneumonia with autoimmune features (IPAF) is another attempt to identify IIPs with features suggestive of, but not definitive for, CTD-ILD [
14]. The diagnostic criteria of IPAF are positivity for two of the following three domains: clinical domain (extra-thoracic symptoms associated with autoimmune diseases), serologic domain (serum autoantibodies) and morphologic domain (HRCT and histopathologic findings and multi-compartment involvement other than IIPs) [
14].
Clinical phenotyping using cluster analysis, which groups subjects according to the similarities and differences of their clinical features, has recently attracted attention for classifying heterogeneous diseases. For example, clinical phenotypes determined via cluster analysis using clinical data reveal distinct clinical outcomes in asthma and COPD [
15,
16]. In all ILDs, clinical phenotypes determined using cluster analysis illustrated considerable predictive accuracy for clinical outcomes [
17]. However, more than one-third of studied patients had non-IIPs such as CTD-ILD and CHP, both of which exhibit distinct characteristics and different clinical outcomes from IIPs [
17].
We hypothesized that the clinical phenotypes of IIPs provide more useful information about clinical outcomes among patients. In this study, we performed cluster analysis using the clinical data of patients with IIPs and evaluated the clinical utility of the phenotypes.
Methods
Study design
This retrospective observational study followed the ethical standards of the Declaration of Helsinki. The study protocol was approved by the Institutional Review Board of Hamamatsu University School of Medicine (Hamamatsu, Japan, approval No. E15-197).
Patients
The medical records of consecutive patients with IIPs who were diagnosed at Hamamatsu Medical University Hospital between September 2004 and August 2018 were retrospectively analysed. The IIP diagnosis followed American Thoracic Society (ATS)/European Respiratory Society (ERS) guidelines [
1,
9,
18,
19]. Patients who exhibited acute exacerbation (AE) at the first visit were excluded [
20].
Data collection
The following data were collected at the time of IIP diagnosis: age, sex, body mass index (BMI), pack-year smoking history, history of dust exposure, laboratory data (C-reactive protein [CRP], lactate dehydrogenase [LDH], albumin, Krebs von den Lungen-6 [KL-6], surfactant protein D [SP-D]), pulmonary function (percent predicted forced vital capacity [%FVC], percent predicted forced expiratory volume in 1 s [%FEV
1]) and the clinical and pathological diagnoses (if performed). The history of dust exposure was based on each patient’s reported history. Dust was classified as organic or inorganic, and the type of exposure was classified as occupational or environmental. The existence of emphysema and honeycombing was evaluated via chest high-resolution computed tomography (HRCT) at the diagnosis. The definitions of emphysema and honeycombing were described elsewhere [
21]. Chest HRCT data were reviewed by two independent observers who were masked to patient data. Autoimmune features were recorded according to the IPAF diagnostic criteria [
14].
Statistical analysis
Hierarchical clustering was performed using age, sex, BMI, histories of smoking and dust exposure, autoimmune features, laboratory data (CRP, LDH, albumin, KL-6, SP-D), pulmonary function (%FVC, %FEV1) and HRCT findings (emphysema, honeycombing) to identify clinical IIP subtypes. The number of clusters in which the scree plot of the distances between the clusters in a dendrogram of hierarchical clustering rose sharply, indicating different characteristics between clusters, was determined. Overall survival (OS) and the time to the first acute exacerbation (AE) were measured from the IIP diagnosis. The Kaplan–Meier method and log-rank test were used to analyse OS, and Grey’s test was used to analyse the time to the first AE. Wilcoxon’s signed-rank test was used for continuous variables, and Fisher’s exact test was applied for categorical variables. All comparisons among clusters were adjusted using Bonferroni’s correction. Data were expressed as the median (range) unless otherwise indicated. All statistical tests were two-sided, and p < 0.05 indicated significance. All values were analysed using EZR (Saitama Medical Center, Jichi Medical University, Saitama, Japan), which is a graphical user interface for R (The R Foundation for Statistical Computing, Vienna, Austria), except the cluster analysis, which was performed using JMP v13.0.0 (SAS Institute Japan, Tokyo, Japan).
Discussion
Four IIP clusters with distinct clinical features and outcomes were identified. Cluster I, in which > 80% of the patients had autoimmune features, featured a low risk for AE and good OS. Clusters II and III had the greatest decreases in %FVC and the highest honeycombing rates. More than 80% of patients with IPF belonged to these two clusters. Cluster II had the lowest %FVC, BMI and albumin levels. Cluster III had the second lowest %FVC and normal BMI and albumin levels, as well as the highest LDH, KL-6 and SP-D levels. OS was worst in Cluster II, followed by Cluster III. Both clusters had high risks of AEs. Cluster IV, representing the mildest IIPs with preserved %FVC and almost normal laboratory data, had the best OS and a low risk of AE. In our patients, more than half did not undergo surgical lung biopsies. Additionally, the radiologic findings used in the cluster analysis were simple; namely the existence of honeycombing and emphysema on chest CT. Our clustering of patients with IIPs using clinical data is highly feasible in practice, and this strategy may provide useful information about patients’ clinical outcomes, even without detailed pathological or radiologic findings.
Interestingly, our cluster analysis revealed that the clinical features associated with collagen vascular diseases were important components of the clinical phenotypes of IIPs. A considerable number of patients with IIPs have some features of collagen vascular disease without fulfilling the defined criteria of any collagen vascular diseases. Several classifications have been proposed for such patients, but controversies remain unresolved [
22‐
25]. The IPAF concept was proposed by ATS/ERS to establish a unified platform for such patients with IIPs [
14]. Several studies reported that patients with IPAF have better OS and lower risks of AE than their counterparts [
26,
27]. Similarly, Cluster I, in which most patients had autoimmune features, was linked to better clinical outcomes than Clusters II and III. Additionally, patients with IPAF are younger and more commonly female than those without IPAF, coinciding with the characteristics of Cluster I [
28‐
30]. The clinical significance and utility of the IPAF concept are poorly validated. Additionally, several studies reported that the IPAF criteria must be revisited. In this context, our simple clustering successfully extracts patients with IIPs and autoimmune features who have distinct outcomes (Cluster I).
Clusters II and III were characterized by advanced fibrosis accompanied by decreased %FVC and honeycombing, which may lead to poor prognoses and high risks of AE. Most study patients with IPF belonged to these clusters. Patients with IPF have higher AE rates and worse OS than those with other IIPs [
31]. The high prevalence of IPF might have strong effects on clinical outcomes in Clusters II and III. However, other factors determined the characteristics of these clusters. More than 50% of patients in these clusters had non-IPF IIPs. Specifically, OS was poorest in Cluster II despite its lower proportion of patients with IPF than Cluster III. In this study, we identified age, low serum albumin levels, decreased %FVC and radiologic honeycombing as independent risk factors for OS in all patients with IIP (Additional file
2: Table 1). Additionally, the latter two were also risk factors for AE (Additional file
2: Table 2). Among the two clusters, Cluster II was associated with significantly older patient age, lower serum albumin levels, lower %FVC and higher rates of radiologic honeycombing than Cluster III. These differences may explain the clinical outcomes of Clusters II and III.
A previous study reported the usefulness of four clinical clusters for predicting clinical outcomes in 770 patients with ILDs [
17]. The study included non-IIPs including CTD-ILD and CHP and diverse ethnicities (mainly Caucasian and secondarily African American). Therefore, the clusters were not completely consistent with those in the current study. However, the cluster with female-dominant demographics, elevated antinuclear antibody levels, and the best clinical outcomes and another cluster of elderly male smokers with coexisting emphysema and the second worst outcomes were similar to Clusters I and III, respectively, in the current study.
The current study had several limitations. First, the numbers of clusters affected the results. In the current study, we also considered the use of three or five clusters. If we had employed three clusters, the distinct clinical features of Clusters III and IV would have been lost. Meanwhile, if we had employed five clusters, the two new clusters derived from Cluster II would have included small numbers of patients (insufficient for reaching statistical significance); thus, they were grouped together. Second, the numbers and/or optimal combination of variables used for the cluster analysis were not validated. We employed clinical variables that could be representative of demographic, historical, physical, laboratory and radiographic information. However, there were limited data available because the study was retrospective. However, there were limited data available because the study was retrospective. For example, a considerable number of study patients lacked data for diffusion capacity of the lung for carbon monoxide, which would have provided important information regarding the outcomes of ILD. The selection of variables can vary according to the purpose of the classification, such as predicting OS, AE risk, therapeutic response. Further studies are warranted to optimize the clinical classifications of IIPs.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.