Skip to main content
Erschienen in: Journal of Cancer Research and Clinical Oncology 17/2023

12.09.2023 | Research

SMiT: symmetric mask transformer for disease severity detection

verfasst von: Chengsheng Zhang, Cheng Chen, Chen Chen, Xiaoyi Lv

Erschienen in: Journal of Cancer Research and Clinical Oncology | Ausgabe 17/2023

Einloggen, um Zugang zu erhalten

Abstract

Purpose

The application of deep learning methods to the intelligent diagnosis of diseases has been the focus of intelligent medical research. When dealing with image classification tasks, if the lesion area is small and uneven, the background image involved in the training will affect the ultimate accuracy in determining the extent of the lesion. We did not follow the traditional approach of building an intelligent system to assist physicians in diagnosis from the perspective of CNN models, but instead proposed a pure transformer framework that can be used for diagnostic grading of pathological images.

Methods

We propose a Symmetric Mask Pre-Training vision Transformer SMiT model for grading pathological cancer images. SMiT performs a symmetrically identical high probability sparsification of the input image token sequence at the first and last encoder layer positions to pre-train visual transformers, and the parameters of the baseline model are fine-tuned after loading the pre-training weights, allowing the model to concentrate more on extracting detailed features in the lesion region, effectively getting rid of the potential feature dependency problem.

Results

SMiT achieved 92.8% classification accuracy on 4500 histopathological images of colorectal cancer processed by Gaussian filter denoising. We validated the effectiveness and generalizability of this study's methodology on the publicly available diabetic retinopathy dataset APTOS2019 from Kaggle and achieved quadratic Cohen Kappa, accuracy and F1-score of 91.9%, 86.91% and 72.85%, respectively, which were 1–2% better than previous studies based on CNN models.

Conclusion

SMiT uses a simpler strategy to achieve better results to assist physicians in making accurate clinical decisions, which can be an inspiration for making good use of the visual transformers in the field of medical imaging.
Literatur
Zurück zum Zitat Canziani A, Paszke A, Culurciello E (2016) An analysis of deep neural network models for practical applications. arXiv e-prints, arXiv:1605.07678 Canziani A, Paszke A, Culurciello E (2016) An analysis of deep neural network models for practical applications. arXiv e-prints, arXiv:​1605.​07678
Zurück zum Zitat Devlin J, Chang M-W, Lee K, Toutanova K (2018) BERT: pre-training of deep bidirectional transformers for language understanding. arXiv e-prints, arXiv:1810.04805 Devlin J, Chang M-W, Lee K, Toutanova K (2018) BERT: pre-training of deep bidirectional transformers for language understanding. arXiv e-prints, arXiv:​1810.​04805
Zurück zum Zitat Dosovitskiy A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv e-prints, arXiv:2010.11929 Dosovitskiy A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv e-prints, arXiv:​2010.​11929
Zurück zum Zitat Lan Z et al (2019) ALBERT: a lite BERT for self-supervised learning of language representations. arXiv e-prints, arXiv:1909.11942 Lan Z et al (2019) ALBERT: a lite BERT for self-supervised learning of language representations. arXiv e-prints, arXiv:​1909.​11942
Zurück zum Zitat Lee Y, Kim J, Willette J, Hwang SJ (2021a) MPViT: multi-path vision transformer for dense prediction. arXiv e-prints, arXiv:2112.11010 Lee Y, Kim J, Willette J, Hwang SJ (2021a) MPViT: multi-path vision transformer for dense prediction. arXiv e-prints, arXiv:​2112.​11010
Zurück zum Zitat Li Y et al (2021) MViTv2: improved multiscale vision transformers for classification and detection. arXiv e-prints, arXiv:2112.01526 Li Y et al (2021) MViTv2: improved multiscale vision transformers for classification and detection. arXiv e-prints, arXiv:​2112.​01526
Zurück zum Zitat Li Y, Mao H, Girshick R, He K (2022b) Exploring plain vision transformer backbones for object detection. arXiv e-prints, arXiv:2203.16527 Li Y, Mao H, Girshick R, He K (2022b) Exploring plain vision transformer backbones for object detection. arXiv e-prints, arXiv:​2203.​16527
Zurück zum Zitat Majumder S, Kehtarnavaz NJ, a. e.-p (2021) Multitasking deep learning model for detection of five stages of diabetic retinopathy. arXiv:2103.04207 Majumder S, Kehtarnavaz NJ, a. e.-p (2021) Multitasking deep learning model for detection of five stages of diabetic retinopathy. arXiv:​2103.​04207
Zurück zum Zitat Matsoukas C, Fredin Haslum J, Söderberg M, Smith KJ a. e.-p (2021) Is it time to replace CNNs with transformers for medical images? arXiv:2108.09038 Matsoukas C, Fredin Haslum J, Söderberg M, Smith KJ a. e.-p (2021) Is it time to replace CNNs with transformers for medical images? arXiv:​2108.​09038
Zurück zum Zitat Mnassri K, Rahmany I, Khlifa N, Hmissa S, Missaoui N (2021) In: 2022 8th international conference on control, decision and information technologies (CoDIT). pp 740–745 Mnassri K, Rahmany I, Khlifa N, Hmissa S, Missaoui N (2021) In: 2022 8th international conference on control, decision and information technologies (CoDIT). pp 740–745
Zurück zum Zitat Su Y, Bai Y, Zhang B, Zhang Z, Wang W (2021) HAT-Net: a hierarchical transformer graph neural network for grading of colorectal cancer histology images. British Machine Vision Conference Su Y, Bai Y, Zhang B, Zhang Z, Wang W (2021) HAT-Net: a hierarchical transformer graph neural network for grading of colorectal cancer histology images. British Machine Vision Conference
Zurück zum Zitat Touvron H et al (2020) Training data-efficient image transformers & distillation through attention. arXiv e-prints, arXiv:2012.12877 Touvron H et al (2020) Training data-efficient image transformers & distillation through attention. arXiv e-prints, arXiv:​2012.​12877
Zurück zum Zitat Waqas Zamir, S. et al (2021) Restormer: Efficient Transformer for High-Resolution Image Restoration. arXiv e-prints, arXiv:2111.09881 Waqas Zamir, S. et al (2021) Restormer: Efficient Transformer for High-Resolution Image Restoration. arXiv e-prints, arXiv:​2111.​09881
Zurück zum Zitat Xing X et al (2021) Categorical relation-preserving contrastive knowledge distillation for medical image classification. arXiv:2107.03225 Xing X et al (2021) Categorical relation-preserving contrastive knowledge distillation for medical image classification. arXiv:​2107.​03225
Zurück zum Zitat Zhou Y et al (2019) CGC-Net: cell graph convolutional network for grading of colorectal cancer histology images. arXiv e-prints, arXiv:1909.01068 Zhou Y et al (2019) CGC-Net: cell graph convolutional network for grading of colorectal cancer histology images. arXiv e-prints, arXiv:​1909.​01068
Metadaten
Titel
SMiT: symmetric mask transformer for disease severity detection
verfasst von
Chengsheng Zhang
Cheng Chen
Chen Chen
Xiaoyi Lv
Publikationsdatum
12.09.2023
Verlag
Springer Berlin Heidelberg
Erschienen in
Journal of Cancer Research and Clinical Oncology / Ausgabe 17/2023
Print ISSN: 0171-5216
Elektronische ISSN: 1432-1335
DOI
https://doi.org/10.1007/s00432-023-05223-x

Weitere Artikel der Ausgabe 17/2023

Journal of Cancer Research and Clinical Oncology 17/2023 Zur Ausgabe

Bei seelischem Stress sind Checkpoint-Hemmer weniger wirksam

03.06.2024 NSCLC Nachrichten

Wie stark Menschen mit fortgeschrittenem NSCLC von einer Therapie mit Immun-Checkpoint-Hemmern profitieren, hängt offenbar auch davon ab, wie sehr die Diagnose ihre psychische Verfassung erschüttert

Antikörper mobilisiert Neutrophile gegen Krebs

03.06.2024 Onkologische Immuntherapie Nachrichten

Ein bispezifischer Antikörper formiert gezielt eine Armee neutrophiler Granulozyten gegen Krebszellen. An den Antikörper gekoppeltes TNF-alpha soll die Zellen zudem tief in solide Tumoren hineinführen.

Erhebliches Risiko für Kehlkopfkrebs bei mäßiger Dysplasie

29.05.2024 Larynxkarzinom Nachrichten

Fast ein Viertel der Personen mit mäßig dysplastischen Stimmlippenläsionen entwickelt einen Kehlkopftumor. Solche Personen benötigen daher eine besonders enge ärztliche Überwachung.

15% bedauern gewählte Blasenkrebs-Therapie

29.05.2024 Urothelkarzinom Nachrichten

Ob Patienten und Patientinnen mit neu diagnostiziertem Blasenkrebs ein Jahr später Bedauern über die Therapieentscheidung empfinden, wird einer Studie aus England zufolge von der Radikalität und dem Erfolg des Eingriffs beeinflusst.

Update Onkologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.