News Release

Yonsei University researchers develop deep learning model for microsatellite instability-high tumor prediction

The proposed model achieves high accuracy for ICI responsiveness prediction as well by combining tumor MSI status with stroma-to-tumor ratio

Peer-Reviewed Publication

Yonsei University

MSI-SEER for predicting microsatellite instability-high tumor

image: 

Researchers propose an innovative deep learning model for accurately predicting MSI tumor and immune checkpoint inhibitor responsiveness.

view more 

Credit: Jae-Ho Cheong from Yonsei University College of Medicine

One in every three people is expected to have cancer in their lifetime, making it a major health concern for mankind. A crucial indicator of the outcome of cancer is its tumor microsatellite status—whether it is stable or unstable. It refers to how stable the DNA is in tumors with respect to the number of mutations within microsatellites. The tumor microsatellite status has important clinical value because patients with microsatellite instability-high (MSI-H) cancers usually have more promising outcomes compared to patients with microsatellite stable tumors. Furthermore, tumors deficient in mismatch repair proteins—these are cells with mutations in specific genes that are involved in correcting mistakes made when DNA is copied in a cell—respond well to immune checkpoint inhibitors (ICIs) and not necessarily to chemotherapeutics.

Therefore, health practitioners and experts suggest MSI testing for newly diagnosed gastric and colorectal cancers. In recent years, artificial intelligence (AI) has made significant strides in this field and its incorporation in clinical workflow is expected to provide cost-efficient and highly accessible MSI testing. While several studies have utilized deep learning methods such as convolutional neural networks and vision-transformer-based techniques for MSI status prediction, they fail to capture the uncertainty in the prediction. Moreover, most of them do not provide key insights into ICI responsiveness, restricting their clinical applications.

Addressing these shortcomings, in a recent breakthrough, a team of researchers from the USA and Korea, including Jae-Ho Cheong from Yonsei University College of Medicine and Jeonghyun Kang from Gangnam Severance Hospital, Yonsei University College of Medicine proposed MSI-SEER. This innovative deep Gaussian process-based Bayesian model analyzes hematoxylin and eosin-stained whole-slide images in weakly-supervised learning to predict microsatellite status in gastric and colorectal cancers. A central and distinctive contribution of this study lies in the integration of uncertainty prediction and quantification into the AI model. Specifically, the model is equipped with the capability to self-assess its confidence by estimating predictive variance via Monte Carlo dropout. This variance is then transformed into a Bayesian Confidence Score (BCS), which quantifies the reliability of each prediction. As a result, the AI model is able to recognize instances where its predictions carry high uncertainty—effectively "knowing what it does not know." In such cases, the system, termed MSI-SEER, automatically flags these high-uncertainty slides for secondary review by human pathologists, rather than making autonomous decisions. The novel findings were made available online and published in the journal npj digital medicine on 19 May 2025.

"This study provides a vital blueprint for how an AI model that 'knows what it doesn't know,' which in turn increases the system's overall reliability, can create an AI-Human collaboration clinical framework for safer, more reliable, and more useful in real-world clinical environments,” Prof. Cheong said.

According to Prof. Cheong, “We performed extensive validation using multiple large datasets comprising patients from diverse racial backgrounds and found that MSI-SEER achieved state-of-the-art performance with MSI prediction by integrating uncertainty prediction.”

In addition, the model proved to be highly accurate for ICI responsiveness prediction by integrating tumor MSI status and stroma-to-tumor ratio. Furthermore, the tile-level predictions by MSI-SEER provided key insights into the contribution of spatial distribution of MSI-H regions in the tumor microenvironment and ICI response.

“We believe our technology already has potential for real-world application as a form of prospective cohort surveillance, or a kind of Phase IV clinical trials. The longer-term implication of this study is that it is not about a single specific predictive AI model. Rather, it has a broader implication of how AI algorithm can analyze clinical multi-modal data and create clinically usable models for precision cancer medicine,” expands Prof. Cheong on the possibilities of their innovation.

Overall, this work showcases the utilization of an AI model to devise clinically usable algorithm to predict the responsiveness to ICIs for patients with cancer.

 

***

 

Reference
DOI: 10.1038/s41746-025-01580-8

 

About the institute
Yonsei University, located in Seoul, South Korea, is one of the nation’s oldest and most prestigious private research institutions. Founded in 1885, the university is renowned for academic excellence and its strong commitment to international collaborations and partnerships. Yonsei offers diverse programs across the humanities, sciences, engineering, business, and more, serving approximately 30,000 students with a distinguished faculty. Yonsei remains dedicated to fostering creativity and cultural exchange in a rapidly evolving world.
https://www.yonsei.ac.kr/en_sc/

 

About the author
Prof. Jae-Ho Cheong is a surgeon-scientist with over 20 years of clinical experience in gastric cancer and expertise in systems cancer biology and translational genomics. His research focuses on developing clinically applicable molecular diagnostics and therapeutics tailored to prognostic and predictive subgroups of gastric cancer. He recently introduced a real-world molecular test for identifying patients responsive to adjuvant chemotherapy (nProfiler,TM). His transdisciplinary approach integrates multi-omics data from over 1,000 gastric cancer samples and more than 100 fully annotated PDX and PDO models. These platforms enable the development of precision treatments, especially for therapy-refractory and stem-like, EMT and Mesenchymal (SEM) subtypes.


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.