Abstract
Breast cancer patients exhibit diverse responses to CDK4/6 inhibitor (CDK4/6i)-based therapies, and identifying eligible patients remains a challenge. Artificial intelligence (AI) has demonstrated the potential to address complex clinical problems. Here, we applied a novel AI-based approach, named as CDK4/6i Response Model (CRM), which combined a previously published method and a scoring model based on random forest algorithm for evaluating breast cancer patients’ sensitivity to CDK4/6i-based therapies. To train the CRM, we transformed the genomic data of 980 breast cancer patients from the TCGA database into signaling pathway activity profiles (APSP) by utilizing the modified Damage Assessment of Genomic Mutations (DAGM) algorithm. To mimic the mechanism of action of CDK4/6 inhibitors, a scoring model was then trained to classify the HR+/HER2- and HR-/HER2-breast cancer molecular subtypes by the differential APSP features between the two, which reasonably reflected the potential role played by CDK4/6 molecules in HR+/HER2-breast cancer cells. The effectiveness of the CRM’s ability was verified by accurately classifying HR+/HER2- and HR-/HER2-breast cancer patients in a separate local patient cohort (n = 343) in Guangdong, China. Significantly, the scores were observed to be distinct (p = 0.025) between CDK4/6i-treated patients with different responses. Furthermore, breast cancer patients belonging to different subtypes were grouped into five distinct populations based on the scores assigned by the CRM. The results showed not only the heterogenetic responses across subtypes but also more than half of HR+/HER2+ patients might be benefited from CDK4/6i-based treatment. The CRM empowered us to conduct in-silico clinical trials (ICT) on different types of cancer patients responding to CDK4/6i-based therapies. In this study, we performed twin ICT of previously disclosed clinical trials (NCT02246621, NCT02079636, NCT03155997, NCT02513394, NCT02675231), and observed concerted results as the real-world clinical outcomes. These findings show the potential of CRM as a companion diagnostic for CDK4/6i-based therapies and demonstrate promising applications by ICT to guide pan-cancer treatment using CDK4/6 inhibitors in the clinical ends.
Summary In most cases, the efficacy of CDK4/6 inhibitors (CDK4/6i) for breast cancer patients is tightly correlated with molecular subtypes, which are classified by the status of hormone receptors (HR) and human epidermal growth factor receptor 2 (HER2)1. More than one pathological condition in HR+/HER2-type of patients are approved to use CDK4/6i-based therapies as standard treatment2–10, while patients with HR-/HER2-breast cancer have less chances to benefit from CDK4/6i-based therapies11. However, there is currently no approach to differentiate the two types of breast cancer based on their differences in cellular functions, which leads to the vagueness in how they react differently to same kind of treatment including CDK4/6 inhibitors12,13. Furthermore, from clinical observation, partial HR+/HER2-patients are proved resistant to CDK4/6i14, and identifying eligible patients with all types of breast cancer remains a challenge15–17. Here we show an AI-based function explanatory method, named CDK4/6i Response Model (CRM), for evaluating breast cancer patients’ sensitivity to CDK4/6i-based therapies. The CRM trained by differential genomic data-transformed pathway-level features between HR+/HER2- and HR-/HER2-breast cancers was able to classify HR+/HER2- and HR-/HER2-breast cancer patients in TCGA (n=581) and an independent patient cohort (n=343), which also reflects the different responses to CDK4/6i-based therapies18. We found that the CRM scores are significantly different between CDK4/6i-treated patients with different outcomes. Breast cancer patients across different subtypes were mainly distributed into five populations regarding to the CRM scores, indicating not only the heterogenetic responses across subtypes but also more than half of HR+/HER2+ patients might be benefited from CDK4/6i-based treatment. The CRM empowered us to conduct in-silico clinical trials (ICT) which simulates previously disclosed clinical outcomes. Our ICT results reflected the efficacy differences observed clinically, which shows the potential of CRM to become a companion diagnostic for CDK4/6i-based therapies. Thus, our study demonstrated a novel and useful tool to explore therapeutic potentials of CDK4/6i-based treatments in breast cancer and, promisingly, other more malignant diseases.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by National Natural Science Foundation of China (grant no. 82072939 to M. Yang), Natural Science Foundation of Guangdong Province (grant no. 2022A1515010425 to M. Yang), Guangzhou Science and Technology Program (grant no. 202206010110 to M. Yang), Hefei National Laboratory for Physical Sciences at the Microscale (grant no. KF2020009 to G. Niu).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics committee of Guangdong Provincial People's Hospital gave ethical approval for this work
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data produced in the present study are available upon reasonable request to the authors