Abstract
Aims T1 mapping on cardiac magnetic resonance (CMR) imaging is useful for diagnosis and prognostication in patients with light-chain cardiac amyloidosis (AL-CA). We conducted this study to evaluate the performance of T1 mapping parameters for detection of cardiac amyloidosis (CA) in patients with left ventricular hypertrophy (LVH) and their prognostic values in patients with AL-CA, using a semi-automated deep learning algorithm.
Methods and Results A total of 300 patients who underwent CMR for differential diagnosis of LVH were analyzed. CA was confirmed in 50 patients (39 with AL-CA and 11 with transthyretin amyloidosis), hypertrophic cardiomyopathy in 198, hypertensive heart disease in 47, and Fabry disease in 5. A semi-automated deep learning algorithm (Myomics-Q) was used for the analysis of the CMR images. The optimal cutoff extracellular volume fraction (ECV) for the differentiation of CA from other etiologies was 33.6% (diagnostic accuracy 85.6%). he artificial intelligence (AI)-derived ECV showed a significant prognostic value for a composite of cardiovascular death and heart failure hospitalization in patients with AL-CA (revised Mayo stage III or IV) (adjusted hazard ratio 4.247 for ECV ≥40%, 95% confidence interval 1.215–14.851, p-value=0.024). Incorporation of AI-derived ECV into the revised Mayo staging system resulted in better risk stratification (integrated discrimination index 27.9%, p=0.013; net reclassification index 13.8%, p=0.007).
Conclusions AI-assisted T1 mapping on CMR imaging allows for improved diagnosis of CA from other etiologies of LVH. Furthermore, AI-derived ECV has significant prognostic value in patients with AL-CA, suggesting its clinical usefulness.
Introduction
Differential diagnosis of left ventricular hypertrophy (LVH) is one of the most challenging issues in cardiovascular imaging.(1) The different etiologies of LVH have morphological similarities, but unique pathophysiology, treatment strategy, and prognosis.(2, 3) In particular, differential diagnosis of cardiac amyloidosis (CA) is critical for effective management: light-chain cardiac amyloidosis (AL-CA) is a hematologic malignancy that requires cytotoxic chemotherapy and/or stem cell transplantation, whereas transthyretin CA (TTR-CA) is a rare infiltrative cardiomyopathy that requires specific treatments such as TTR stabilization.(4)
While several combinations of clinical, electrocardiographic, and echocardiographic features have shown acceptable accuracy, cardiac magnetic resonance (CMR) remains the most important imaging modality for the differential diagnosis of LVH.(4) The presence and patterns of late gadolinium enhancement (LGE) have proven to be relevant markers for the detection of CA and hypertrophic cardiomyopathy (HCM) from hypertensive heart disease (HHD).(2, 5) In addition, novel parameters of myocardial fibrosis, including native T1 and extracellular volume fraction (ECV), have proven to be potentially useful for differential diagnosis and prognostication in patients with various etiologies of LVH, especially in those with CA.(6)
Despite the rapid progress of artificial intelligence (AI) in CMR, there has been no study that showed the usefulness of AI in the T1 mapping on CMR images of patients with LVH. Therefore, we aimed to assess the diagnostic performance of an AI-assisted semi-automated T1 mapping for the detection of CA in patients with LVH. In addition, we analyzed the prognostic value of the AI-derived ECV in patients with AL-CA in comparison with current clinical staging system.
Methods
Study design and cohort
This study was conducted in accordance with the principles outlined in the 2013 revised Declaration of Helsinki and approved by Seoul National University Bundang Hospital Institutional Review Board. The requirement for informed consent was waived owing to the retrospective nature of the study and the minimal expected risk to the patients.
We retrospectively identified 300 patients (47 with HHD, 198 with HCM, 50 with CA [39 with AL-CA, and 11 with TTR-CA], and 5 with Fabry disease [FD]) (Figure 1) who underwent CMR for differential diagnosis of the etiology, based on the detection of LVH on echocardiography. The diagnostic criteria for HHD, HCM, AL-CA, TTR-CA, and FD are described below.
Flowchart of patient selection Abbreviations: CMR, cardiac magnetic resonance; LVH, left ventricular hypertrophy; ECV, extracellular volume fraction; HHD, hypertensive heart disease; HCM, hypertrophic cardiomyopathy; CA, cardiac amyloidosis; AL-CA, AL cardiac amyloidosis; TTR-CA, transthyretin cardiac amyloidosis
Hypertensive heart disease
Patients with a history of hypertension who met the diagnostic criteria for LVH on echocardiography (LV mass index [LVMI] >115 g/m2 for men, and >95 g/m2 for women), without other potential causes of LVH.(7)
Hypertrophic cardiomyopathy
Patients who met the diagnostic criteria of HCM (LVWTmax ≥15 mm on echocardiography, in the absence of abnormal loading conditions that could sufficiently explain the LVH) were included.(8, 9) Definite evidence of HCM on CMR or identification of a typical gene mutation was required for a precise diagnosis.
Cardiac amyloidosis
The types of CA were determined by evaluating serum and urine electrophoresis, serum free light chains, CMR imaging, and technetium-99m scintigraphy, and were confirmed based on the findings of myocardial biopsies.(5) AL-CA was defined as biopsy-proven AL amyloidosis and/or monoclonal gammopathy, accompanied by the following findings: (i) increased LV wall thickness without dilated LV (average LV wall thickness ≥12 mm) in the presence of low voltage QRS (amplitude <0.5 mV in the limb leads or <1.0 mV in the anterior leads); or (ii) typical findings on CMR (patchy, subendocardial circumferential, or diffuse fuzzy LGE of the LV). Patients who showed positive or equivocal TTR immunohistochemical staining on myocardial biopsy underwent direct screening analysis for detection of TTR mutations.(10)
Fabry disease
FD was diagnosed based on the assessment of α-galactosidase A activity in the serum (for male patients) and the detection of GLA mutation (for both male and female patients).(11)
Exclusion criteria
Patients with unknown etiology of LVH even after performing all available diagnostic tests were excluded. In addition, patients were excluded if they presented with any of the following: (1) significant LV dysfunction (LV ejection fraction <40%), (2) active malignancy (other than AL-CA) or undergoing chemotherapy, (3) end-stage renal disease, (4) prior coronary revascularization, (5) evidence of combined ischemic cardiomyopathy, (6) significant valve disease, (7) AA amyloidosis, or (8) other metabolic or infiltrative cardiomyopathies.
Clinical data acquisition
Clinical and anthropometric, laboratory, and echocardiographic measurements were obtained at the time of CMR imaging. All echocardiographic images were obtained using a standard ultrasound device with a 2.5-MHz probe, in accordance with the guidelines of the European Association of Cardiovascular Imaging.(7)
CMR imaging acquisition and assessment of myocardial fibrosis
Detailed procedure used for CMR imaging is provided in the Supplementary Material. CMR imaging was performed according to standard protocols using a 3.0-T imager (Ingenia CX, Philips Healthcare, Best, Netherlands) with a 16-channel phased array coil. Cine imaging was obtained using a segmented steady-state free-precession sequence. LGE images were obtained using a phase-sensitive inversion recovery sequence 10 min after the injection of 0.2 mmol/kg of gadobutrol (Gadovist, Bayer Schering Pharma, Berlin, Germany). Pre- and post-contrast (15 minutes) T1 mapping was performed using a mid-ventricular short-axis section at the level of papillary muscles, and the images were acquired using the modified Look-Locker inversion-recovery (MOLLI) sequence (the “3-3-5” standard protocol).(6) The region of interest was delineated on the entire LV myocardium at the level of papillary muscles.(12, 13) T1 maps were generated using MR workstation with in-line motion correction following image acquisition. The recovery rate of T1 relaxation was measured in a mid-ventricular short-axis slice.(13, 14) ECV was calculated as ECV = (1–hematocrit) × [△R1myocardium]/[△R1blood], with hematocrit measured at the time of CMR imaging.(6) All measurements were performed by a single radiologist (E-J.C.).
AI-based assessment of native T1 and ECV
Our deep learning (DL) algorithm was used for automated analysis of T1 maps in accordance with the method described in the previous study,19 in which 2D U-Net for myocardial segmentation incorporated into Myomics-T1 software ver. 1.0.0 (Phantomics Inc.). Details of the architecture of the DL model are presented in the Supplementary Materials and Figure S1. Additional relevant information can be found in previous publications.(13, 14) Representative figures of the AI-derived native T1 and ECV for various etiologies of LVH are shown in Figure 2. The correlation and agreement analyses conducted using 30 randomly selected patients are summarized in Figure S2.
Typical native T1 and ECV measurements obtained through an AI-assisted semi-automated algorithm for various etiologies of LVH: (A) HHD, (B) HCM, (C) AL-CA, (D) TTR-CA, and (E) FD. Abbreviations: FD, Fabry disease; others as in Figure 1.
Study outcomes
The diagnostic outcome was the area under the receiver operating characteristic curve (AUC) for differentiating CA from other etiologies of LVH. The clinical outcome for prognosis of AL-CA was a composite of cardiovascular death and hospitalization for heart failure, assessed through regular outpatient visits, telephone interviews and chart reviews.
Statistical analysis
Categorical variables are presented as frequencies and percentages, and continuous variables as means ± standard deviations. Group comparisons were performed with Mann-Whitney U test and Kruskal-Wallis test for continuous variables, and the χ2 test or Fisher’s exact test for categorical variables. The AUC was used to measure the performance of the DL algorithm in the diagnosis of CA. Kaplan–Meier method and Cox proportional hazard regression model were used for survival analysis. Multivariate Cox proportional hazards regression with backward selection method was performed using univariable markers with p-values <0.100. All statistical analyses were performed using R statistical software version 3.6.3 (The R Foundation, Vienna, Austria), and p-value <0.05 was considered statistically significant.
Results
Baseline characteristics
Baseline characteristics of the study population are summarized in Table 1. The mean LVMI was 118.7±33.7 g/m2 in patients with HHD, 134.9±32.7 g/m2 in those with HCM, 134.0±36.1 g/m2 in those with AL-CA, 143.0±54.0 g/m2 in those with TTR-CA, and 162.2±80.2 g/m2 in those with FD. Patients with CA had the highest native T1 values (1444.6±83.4 msec in AL-CA, and 1418.3±72.8 msec in TTR-CA), followed by patients with HCM (1319.4±56.9 msec), and those with HHD (1291.4±38.9 msec), and patients with FD (1143.9±80.3 msec) (Figure 3). The ECV values showed similar trends: the highest in patients with AL-CA (43.6±8.1%), followed by those with TTR-CA (40.0±4.3%), HCM (29.5±5.9%), and HHD (26.1±3.3%), and FD (23.2±3.0%) (Figure 3).
AI-derived native T1 (A) and ECV (B) measurements are shown across various etiologies of LVH. Abbreviations as in Figure 1.
Accuracy of AI-derived native T1 and ECV measurements
The AI-derived and manual native T1 and ECV measurements were compared on a per-patient basis (Figure S2). AI-derived native T1 showed a strong correlation and good agreement with the reference native T1 (r=0.995, p<0.001; bias 2.3 msec, 95% limits of agreement [LoA] −15.0 to 19.6 msec). In addition, AI-derived ECV was strongly correlated with the reference ECV (r=0.993, p<0.001) with a good agreement (bias 0.2%, 95% LoA −2.2% to 2.6%).
Differentiation of CA from other etiologies of LVH
The diagnostic performance of the AI-derived native T1 and ECV measurements was assessed in terms of differentiating CA from other etiologies of LVH. The AUC values for the diagnosis of CA were 0.899 (95% CI 0.847–0.952; p<0.001) for native T1 and 0.946 (95% CI 0.922–0.971; p<0.001) for ECV (Figure S3). The optimal native T1 and ECV cutoff values were 1364 msec and 33.6%, respectively (accuracy, 84.0% and 85.6%; sensitivity, 84.5% and 87.5%; and specificity, 86.8% and 96.2%, respectively).
Prognostication with native T1 and ECV for AL-CA
Thirty-nine patients in the study population were diagnosed with AL-CA: 14 (35.9%) were at revised Mayo stage III and 25 (64.1%) were at stage IV. During a mean 22 months of follow-up, 17 patients died of cardiovascular causes and 20 patients were hospitalized for heart failure. The AUC values for the prediction of study outcome using native T1 and ECV were 0.658 (95% CI 0.462–0.855; p=0.139) and 0.888 (95% CI 0.764–1.000; p<0.001), respectively (p-for-comparison=0.05; Figure S4). The optimal cutoff value of ECV for study outcome was 40.0% (accuracy, 92.9%; sensitivity, 79.0%; and specificity, 70.8%).
Survival curves of patients with AL-CA are depicted in Figure 4, according to the difference in the serum free light chain (dFLC) (Figure 4A), NT-proBNP (Figure 4B), revised Mayo stage (Figure 4C), and AI-derived ECV (Figure 4D). The AI-derived ECV showed a significant discrimination of prognosis (adjusted HR 4.247 for ECV ≥40%, 95% CI 1.215–14.851, p=0.024), whereas other conventional risk parameters, such as the dFLC, NT-proBNP, cardiac troponin, and LV global longitudinal strain, did not show significant prognostic value (Table 2 and Table S1). The prognostic value of the AI-derived ECV remained significant even after adjusting for the revised Mayo stage (adjusted HR 6.324 for ECV ≥40%, 95% CI 1.794–22.297, p=0.004).
Clinical outcomes were compared between the subgroups divided by (A) dFLC, (B) NT-proBNP, (C) revised Mayo stage, and (D) AI-derived ECV. Abbreviations: dFLC, difference in the serum levels of free light chain; NT-proBNP, N-terminal pro-B natriuretic peptide; n.s., not significant; others as in Figure 1.
The prognostic value of the AI-derived ECV in patients with AL-CA was further assessed using the following subgroups: group 1 (revised Mayo stage III and ECV <40%), group 2 (revised Mayo stage IV and ECV <40%), group 3 (revised Mayo stage III and ECV ≥40%), and group 4 (revised Mayo stage IV and ECV ≥40%). The combination of revised Mayo stage and AI-derived ECV showed excellent risk stratification, with the worse prognosis in group 4 and the best in group 1 (Figure 5). The AUC of the combination of revised Mayo stage and AI-derived ECV was significantly higher than that of the Mayo staging system alone (AUC 0.860 vs. 0.588; p=0.006). In addition, net reclassification improvement (NRI) and integrated discrimination improvement (IDI) analyses showed that the combination of ECV and the revised Mayo staging system could provide significantly better risk stratification than the revised Mayo staging system alone (IDI 0.279, 95% CI 0.025–0.499, p=0.013; NRI 0.138, 95% CI 0.000–0.597, p=0.007; Table 3).
Clinical outcomes were compared between the subgroups divided by a combination of revised Mayo staging and AI-derived ECV. Abbreviations as in Figure 1.
Discussion
In this study, we analyzed the performance of AI-derived native T1 and ECV in the diagnosis and prognostication of CA. The results demonstrated that the AI-derived native T1 and ECV were effective in differentiating CA from other etiologies of LVH. Furthermore, in patients with AL-CA, the AI-derived ECV showed independent prognostic value beyond the revised Mayo stage and could provide better risk stratification when incorporated into current staging system. Our findings suggest the usefulness of AI-derived T1 mapping for differential diagnosis of LVH, as well as for prognostication in AL-CA.
Etiologies of LVH and the importance of differential diagnosis
LVH is a common cardiac condition, but various etiologies should be considered because of the considerable differences in the pathophysiology, treatment, and prognosis across the possible etiologies of LVH.(1, 3) If LVH is a result from increased afterload, i.e., HHD, management is mainly focused on the alleviation of blood pressure, and the prognosis is benign. However, in patients with HCM, the assessment of sudden cardiac death risk and its prevention is crucial in the treatment strategy, with special considerations required for its complications, such as arrhythmia, heart failure, or LV outflow tract obstruction.(8, 9) AL-CA requires a complex approach, given that AL-CA is a hematologic malignancy with excessive production of light-chain immunoglobulins.(2) Its management includes cytotoxic chemotherapy and stem cell transplantation, along with the management of cardiovascular complications, but the prognosis is poor: patients with AL-CA have a median survival of 24 months from the initial diagnosis.(5, 15) As a subtype of CA, TTR-CA shares common morphological features with AL-CA; however, its pathophysiology is unique. The accumulation of TTR, due to genetic or unknown causes, is the main target for specialized treatment (i.e., TTR tetramer stabilizers).(10) Another unique etiology of diffuse LVH should be noted: FD is a rare X-linked lysosomal storage disorder, caused by mutations in GLA, responsible for coding the lysosomal enzyme α-galactosidase A.(11) The absent or insufficient α-galactosidase A activity leads to the accumulation of globotriaosylceramide and related globotriaosylsphingosine in affected tissues. This accumulation in myocardium results in myocardial fibrosis, requiring specific therapies, primarily enzyme replacement therapy.
Differential diagnosis of LVH etiologies using CMR
The morphological similarities of different etiologies of LVH and the lack of pathognomonic/specific findings makes its differential diagnosis using echocardiography challenging.(3) Therefore, an arbitrary diagnosis based on echocardiography needs to be investigated further using non-invasive imaging modalities, such as CMR. In particular, HCM typically shows multifocal discrete LGE, CA shows diffuse LGE typically at subendocardial layer, FD shows diffuse LGE across entire myocardium, and HHD shows focal fuzzy LGE at the right ventricular insertion points of LV myocardium.(12) However, these patterns can overlap between etiologies, and may not even be apparent in the early stages of the disease. Given these limitations and ambiguity in the assessment of LGE patterns for the differential diagnosis of LVH etiologies, several studies have been conducted to analyze the efficacy of quantitative assessment using native T1 and ECV.(6, 16) Native T1 and ECV typically reflect the degree and amount of myocardial fibrosis, not only in the advanced stage (so called “replacement fibrosis”) but also in the early stage of diffuse myocardial fibrosis.(17) The accumulation of robust evidence has led to steady evolution of the use of native T1 and ECV in myocardial disease, showing different patterns of native T1 and ECV between the etiologies of LVH. According to a study by Baggiano et al., native T1 enabled diagnosis of CA, with a cutoff native T1 of <1036 msec resulting in a 98% negative predictive value and native T1 value of >1164 msec resulting in a 98% positive predictive value.(18) Hinojar et al reported that patients with HCM show significantly higher native T1 and ECV than those with HHD.(19) FD is also known to have significantly lower native T1 values than HHD and HCM.(20, 21) Our findings are consistent with previous studies, showing that CA results in the highest native T1 and ECV, followed by HCM and HHD, and with FD showing the lowest values.
Although both native T1 and ECV can be useful in differential diagnosis of LVH etiologies, it should be noted that native T1 can be affected by MR vendors, the intensity of magnetic field, and the amount and concentration of contrast agent. On the other hand, ECV represents a physiological parameter and is derived from the ratio of T1 signal values, therefore could be more reproducible between different field strengths, vendors, and acquisition techniques than both native and post-contrast T1.(16) ECV measures also exhibit better agreement with histological measures of the collagen volume fraction than isolated post-contrast T1. Given the better performance of ECV (AUC 0.946) in differentiating CA from other etiologies of LVH than that of native T1 (AUC 0.899), our findings support that ECV measurements could be more relevant than native T1 values for diagnosis of CA.
Deep learning algorithm for the measurement of native T1 and ECV
A previous study of 95 participants, including 12 patients with HCM, 12 with CA, and 12 with FD, conducted using our DL model demonstrated excellent correlation and agreement between the AI-derived native T1 and ECV and the reference values on a per-patient basis (native T1: r=0.967, bias 9.5 msec; ECV: r=0.987, bias 0.7%).(13) The excellent accuracy of our DL algorithm, which was further confirmed in the present study, indicates that the use of DL algorithm on the T1 mapping images can considerably reduce the time required for image interpretation.(13) In addition, considering that the differential diagnosis of LVH requires a series of non-invasive and invasive tests, the accuracy of T1 mapping parameters in the detection of CA can improve the diagnostic process in clinical practice. In particular, the differential diagnosis process includes myocardial biopsy for confirming amyloid infiltration in myocardium with specific immunohistochemistry staining for determination of CA subtypes. Genetic tests are required for the confirmation of HCM and TTR-CA, a process that leads to some delays until the final diagnosis. Thus, accurate interpretation of CMR images using T1 mapping can focus down to the possible diagnosis and facilitate the selection of appropriate diagnostic tests.
Prediction of prognosis in AL-CA using T1 mapping parameters
In the present study, we assessed the prognostic value of ECV, because in addition to ECV having theoretical advantages over native T1, the AUC of ECV in the present study was higher than that of native T1, not only for differentiating CA from other LVH etiologies but also for predicting its prognosis. Additionally, because of the small number of events in patients with HHD and HCM, and the small number of patients with TTR-CA and FD, we focused our analysis on patients with AL-CA.
Several clinical tools have been used to assess the prognosis of patients with AL-CA. The revised Mayo staging system suggested in 2012, which is the most widely used clinical staging system for AL-CA, incorporates multiparametric biomarkers, such as the levels of NT-proBNP, cardiac troponin, and the dFLC.(22) However, it could be argued that the incorporation of myocardial status into the system would be more representative of the disease stage, because the most common cause of death in patients with AL-CA is cardiovascular death. Indeed, the incorporation of LGE on CMR image into the staging system provided promising results.(23) However, it should be noted that the determination of LGE in AL-CA is often obscure, due to the fuzzy involvement of amyloid. Instead, the T1 mapping could be promising, given its more feasible quantitation and direct reflection of the burden of amyloid infiltration in myocardium.(24) Moreover, patients with revised Mayo stage III and IV in the present study who had elevated ECV (indicative of advanced myocardial infiltration) demonstrated higher risks of mortality than those with lower ECV. Furthermore, ECV had a more significant predictive value than the components of the revised Mayo staging system, such as the levels of NT-proBNP, cardiac troponin, and dFLC. These findings suggest that direct assessment of myocardial fibrosis using T1 mapping techniques could be more suitable for the prediction of mortality risk than indirect measurement of amyloid infiltration using serum biomarkers. Additionally, given that this is the first study to demonstrate the relevance of DL model not only for the diagnosis of CA, but also for prognostication in patients with AL-CA, our findings will facilitate the clinical application of DL models in CMR imaging.
Limitations
The present study has certain limitations. First, this was a single-center retrospective study with a limited sample size. Second, the prognostic value of T1 mapping parameters was assessed only among the patients with AL-CA, but not for those with other etiologies. Patients with HCM and HHD did not experience sufficient number of events, and the number of patients with FD was too small for further analysis. Third, data on the quantitative measurement of LGE was not available for our study population. Instead, we focused on the clinical usefulness of AI-derived T1 mapping parameters. Nonetheless, we acknowledge that future studies on the radiomics features of LGE in the AI-derived assessment of CMR images are warranted. Fourth, our study focused on the clinical utilization of AI-derived measurements of T1 mapping parameters, rather than the automatization of the entire clinical process. Finally, the findings of the present study should be validated further in terms of improving the diagnostic process (total duration and costs for confirmative diagnosis) and prognosis.
Conclusion
This study demonstrated that AI-derived native T1 and ECV on CMR images could differentiate CA from other etiologies of LVH. Furthermore, this study showed that AI-derived ECV has significant prognostic value in patients with AL-CA, suggesting its usefulness in clinical practice.
Data Availability
Data used in this study cannot be made publicly available because of the strict ethical restrictions set by the IRB of Seoul National University Bundang Hospital (https://e-irb.snubh.org). Please contact the corresponding authors (inchang.hwang{at}gmail.com or humandr{at}snubh.org) or the ethics board at SNUBH (snubhirb{at}gmail.com) for further inquiries regarding data availability within the scope permitted by the IRB.
Funding
This work was supported by the Medical AI Clinic Program through the National IT Industry Promotion Agency (NIPA), funded by the Ministry of Science and ICT (MSIT) of the Republic of Korea, and with a grant from SNUBH (grant number: 06-2020-0130).
Conflict of interest
Pan Ki Kim and Byoung Wook Choi are founders of Phantomics, Inc. (Seoul, Korea), the company that supports the software used in this study. Other authors have no conflicts of interest to declare.
Acknowledgements
None.
Footnotes
Supplemental Appendix: supplementary methods, 4 supplementary figure and 1 supplementary table
Funding: This work was supported by the Medical AI Clinic Program through the National IT Industry Promotion Agency (NIPA) funded by the Ministry of Science and ICT (MSIT) of the Republic of Korea, and with a grant from SNUBH (grant number: 06-2020-0130).
Conflict of interest: Pan Ki Kim and Byoung Wook Choi are founders of Phantomics, Inc. (Seoul, Korea), the company that supports the software used in this study. The other authors have no conflicts of interest to declare.