Skip to main content
Erschienen in: Journal of Cancer Research and Clinical Oncology 14/2023

14.07.2023 | Research

Data mining techniques in breast cancer diagnosis at the cellular–molecular level

verfasst von: Jian Yang, Dler Hussein Kadir

Erschienen in: Journal of Cancer Research and Clinical Oncology | Ausgabe 14/2023

Einloggen, um Zugang zu erhalten

Abstract

Introduction

Studies in the field of better diagnosis of breast cancer using machine learning and data mining techniques have always been promising. A new diagnostic method can detect the characteristics of breast cancer in the early stages and help in better treatment. The aim of this study is to provide a method for early detection of breast cancer by reducing human errors based on data mining techniques in medicine using accurate and rapid screening.

Methodology

The proposed method includes data pre-processing and image quality improvement in the first step. The second step consists of separating cancer cells from healthy breast tissue and removing outliers using image segmentation. Finally, a classification model is configured by combining deep neural networks in the third phase. The proposed ensemble classification model uses several effective features extracted from images and is based on majority vote. This model can be used as a screening system to diagnose the grade of invasive ductal carcinoma of the breast.

Results

Evaluations have been done using two histopathological microscopic datasets including patients with invasive ductal carcinoma of the breast. With extracting high-level features with average accuracies of 92.65% and 93.34% in these two datasets, the proposed method has succeeded in quickly diagnosing and classifying breast cancer with high performance.

Conclusion

By combining deep neural networks and extracting features affecting breast cancer, the ability to diagnose with the highest accuracy is provided, and this is a step toward helping specialists and increasing the chances of patients' survival.
Literatur
Zurück zum Zitat Azadifar S, Rostami M, Berahmand K, Moradi P, Oussalah M (2022) Graph-based relevancy-redundancy gene selection method for cancer diagnosis. Comput Biol Med 147:105766CrossRefPubMed Azadifar S, Rostami M, Berahmand K, Moradi P, Oussalah M (2022) Graph-based relevancy-redundancy gene selection method for cancer diagnosis. Comput Biol Med 147:105766CrossRefPubMed
Zurück zum Zitat Cao C, Wang J, Kwok D, Cui F, Zhang Z, Zhao D et al (2022) webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study. Nucleic Acids Res 50(D1):D1123–D1130CrossRefPubMed Cao C, Wang J, Kwok D, Cui F, Zhang Z, Zhao D et al (2022) webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study. Nucleic Acids Res 50(D1):D1123–D1130CrossRefPubMed
Zurück zum Zitat Cheng F, Wang H, Zhang L, Ahmad AM, Xu N (2022) Decentralized adaptive neural two-bit-triggered control for nonstrict-feedback nonlinear systems with actuator failures. Neurocomputing 500:856–867CrossRef Cheng F, Wang H, Zhang L, Ahmad AM, Xu N (2022) Decentralized adaptive neural two-bit-triggered control for nonstrict-feedback nonlinear systems with actuator failures. Neurocomputing 500:856–867CrossRef
Zurück zum Zitat Diz J, Marreiros G, Freitas A (2016) Applying data mining techniques to improve breast cancer diagnosis. J Med Syst 40:1–7CrossRef Diz J, Marreiros G, Freitas A (2016) Applying data mining techniques to improve breast cancer diagnosis. J Med Syst 40:1–7CrossRef
Zurück zum Zitat Jazayeri F, Shahidinejad A, Ghobaei-Arani M (2021) Autonomous computation offloading and auto-scaling the in the mobile fog computing: a deep reinforcement learning-based approach. J Ambient Intell Humaniz Comput 12:8265–8284CrossRef Jazayeri F, Shahidinejad A, Ghobaei-Arani M (2021) Autonomous computation offloading and auto-scaling the in the mobile fog computing: a deep reinforcement learning-based approach. J Ambient Intell Humaniz Comput 12:8265–8284CrossRef
Zurück zum Zitat Lei X, Li Z, Zhong Y, Li S, Chen J, Ke Y et al (2022) Gli1 promotes epithelial–mesenchymal transition and metastasis of non-small cell lung carcinoma by regulating snail transcriptional activity and stability. Acta Pharmaceutica Sinica B 12(10):3877–3890CrossRefPubMedPubMedCentral Lei X, Li Z, Zhong Y, Li S, Chen J, Ke Y et al (2022) Gli1 promotes epithelial–mesenchymal transition and metastasis of non-small cell lung carcinoma by regulating snail transcriptional activity and stability. Acta Pharmaceutica Sinica B 12(10):3877–3890CrossRefPubMedPubMedCentral
Zurück zum Zitat Li X, Shen X, Zhou Y, Wang X, Li TQ (2020) Classification of breast cancer histopathological images using interleaved DenseNet with SENet (IDSNet). PLoS ONE 15(5):e0232127CrossRefPubMedPubMedCentral Li X, Shen X, Zhou Y, Wang X, Li TQ (2020) Classification of breast cancer histopathological images using interleaved DenseNet with SENet (IDSNet). PLoS ONE 15(5):e0232127CrossRefPubMedPubMedCentral
Zurück zum Zitat Li Y, Wang H, Zhao X, Xu N (2022) Event-triggered adaptive tracking control for uncertain fractional-order nonstrict-feedback nonlinear systems via command filtering. Int J Robust Nonlinear Control 32(14):7987–8011CrossRef Li Y, Wang H, Zhao X, Xu N (2022) Event-triggered adaptive tracking control for uncertain fractional-order nonstrict-feedback nonlinear systems via command filtering. Int J Robust Nonlinear Control 32(14):7987–8011CrossRef
Zurück zum Zitat Lim TS, Tay KG, Huong A, Lim XY (2021) Breast cancer diagnosis system using hybrid support vector machine-artificial neural network. Int J Electr Comput Eng (IJECE) 11(4):3059CrossRef Lim TS, Tay KG, Huong A, Lim XY (2021) Breast cancer diagnosis system using hybrid support vector machine-artificial neural network. Int J Electr Comput Eng (IJECE) 11(4):3059CrossRef
Zurück zum Zitat Liu C, Zoph B, Neumann M, Shlens J, Hua W, Li LJ et al. (2018) Progressive neural architecture search. In: Proceedings of the European conference on computer vision (ECCV), pp. 19–34 Liu C, Zoph B, Neumann M, Shlens J, Hua W, Li LJ et al. (2018) Progressive neural architecture search. In: Proceedings of the European conference on computer vision (ECCV), pp. 19–34
Zurück zum Zitat Mookiah MRK, Acharya UR, Ng EYK (2012) Data mining technique for breast cancer detection in thermograms using hybrid feature extraction strategy. Quantitative InfraRed Thermogr J 9(2):151–165CrossRef Mookiah MRK, Acharya UR, Ng EYK (2012) Data mining technique for breast cancer detection in thermograms using hybrid feature extraction strategy. Quantitative InfraRed Thermogr J 9(2):151–165CrossRef
Zurück zum Zitat Padmapriya B, Velmurugan T (2014) A survey on breast cancer analysis using data mining techniques. In: 2014 IEEE international conference on computational intelligence and computing research, pp. 1–4. IEEE Padmapriya B, Velmurugan T (2014) A survey on breast cancer analysis using data mining techniques. In: 2014 IEEE international conference on computational intelligence and computing research, pp. 1–4. IEEE
Zurück zum Zitat Rezaeipanah A, Ahmadi G (2022) Breast cancer diagnosis using multi-stage weight adjustment in the MLP neural network. Comput J 65(4):788–804CrossRef Rezaeipanah A, Ahmadi G (2022) Breast cancer diagnosis using multi-stage weight adjustment in the MLP neural network. Comput J 65(4):788–804CrossRef
Zurück zum Zitat Rezaeipanah A, Syah R, Wulandari S, Arbansyah A (2021) Design of ensemble classifier model based on MLP neural network for breast cancer diagnosis. Intel Artif 24(67):147–156CrossRef Rezaeipanah A, Syah R, Wulandari S, Arbansyah A (2021) Design of ensemble classifier model based on MLP neural network for breast cancer diagnosis. Intel Artif 24(67):147–156CrossRef
Zurück zum Zitat Rostami M, Forouzandeh S, Berahmand K, Soltani M, Shahsavari M, Oussalah M (2022) Gene selection for microarray data classification via multi-objective graph theoretic-based method. Artif Intell Med 123:102228CrossRefPubMed Rostami M, Forouzandeh S, Berahmand K, Soltani M, Shahsavari M, Oussalah M (2022) Gene selection for microarray data classification via multi-objective graph theoretic-based method. Artif Intell Med 123:102228CrossRefPubMed
Zurück zum Zitat Saber A, Sakr M, Abo-Seida OM, Keshk A, Chen H (2021) A novel deep-learning model for automatic detection and classification of breast cancer using the transfer-learning technique. IEEE Access 9:71194–71209CrossRef Saber A, Sakr M, Abo-Seida OM, Keshk A, Chen H (2021) A novel deep-learning model for automatic detection and classification of breast cancer using the transfer-learning technique. IEEE Access 9:71194–71209CrossRef
Zurück zum Zitat Sarvestani AS, Safavi AA, Parandeh NM, Salehi M (2010) Predicting breast cancer survivability using data mining techniques. In: 2010 2nd international conference on software technology and engineering, vol. 2. IEEE, pp. V2–227 Sarvestani AS, Safavi AA, Parandeh NM, Salehi M (2010) Predicting breast cancer survivability using data mining techniques. In: 2010 2nd international conference on software technology and engineering, vol. 2. IEEE, pp. V2–227
Zurück zum Zitat Sekehravani EA, Babulak E, Masoodi M (2020) Implementing canny edge detection algorithm for noisy image. Bull Electric Eng Inform 9(4):1404–1410CrossRef Sekehravani EA, Babulak E, Masoodi M (2020) Implementing canny edge detection algorithm for noisy image. Bull Electric Eng Inform 9(4):1404–1410CrossRef
Zurück zum Zitat Shahidinejad A, Abbasinezhad-Mood D (2022) Ultra-lightweight and secure blockchain-assisted charging scheduling scheme for vehicular edge networks by utilization of NanoPi NEO. IEEE Trans Veh Technol 71(8):8116–8123CrossRef Shahidinejad A, Abbasinezhad-Mood D (2022) Ultra-lightweight and secure blockchain-assisted charging scheduling scheme for vehicular edge networks by utilization of NanoPi NEO. IEEE Trans Veh Technol 71(8):8116–8123CrossRef
Zurück zum Zitat Shahidinejad A, Ghobaei-Arani M, Masdari M (2021) Resource provisioning using workload clustering in cloud computing environment: a hybrid approach. Clust Comput 24(1):319–342CrossRef Shahidinejad A, Ghobaei-Arani M, Masdari M (2021) Resource provisioning using workload clustering in cloud computing environment: a hybrid approach. Clust Comput 24(1):319–342CrossRef
Zurück zum Zitat Talatian Azad S, Ahmadi G, Rezaeipanah A (2022) An intelligent ensemble classification method based on multi-layer perceptron neural network and evolutionary algorithms for breast cancer diagnosis. J Exp Theor Artif Intell 34(6):949–969CrossRef Talatian Azad S, Ahmadi G, Rezaeipanah A (2022) An intelligent ensemble classification method based on multi-layer perceptron neural network and evolutionary algorithms for breast cancer diagnosis. J Exp Theor Artif Intell 34(6):949–969CrossRef
Zurück zum Zitat Tan J, Liu L, Li F, Chen Z, Chen GY, Fang F et al (2022) Screening of endocrine disrupting potential of surface waters via an affinity-based biosensor in a rural community in the Yellow River Basin, China. Environ Sci Technol 56(20):14350–14360CrossRefPubMed Tan J, Liu L, Li F, Chen Z, Chen GY, Fang F et al (2022) Screening of endocrine disrupting potential of surface waters via an affinity-based biosensor in a rural community in the Yellow River Basin, China. Environ Sci Technol 56(20):14350–14360CrossRefPubMed
Zurück zum Zitat Tang F, Niu B, Zong G, Zhao X, Xu N (2022) Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning. Neural Netw 154:43–55CrossRefPubMed Tang F, Niu B, Zong G, Zhao X, Xu N (2022) Periodic event-triggered adaptive tracking control design for nonlinear discrete-time systems via reinforcement learning. Neural Netw 154:43–55CrossRefPubMed
Zurück zum Zitat Tarahomi M, Izadi M, Ghobaei-Arani M (2021) An efficient power-aware VM allocation mechanism in cloud data centers: a micro genetic-based approach. Clust Comput 24:919–934CrossRef Tarahomi M, Izadi M, Ghobaei-Arani M (2021) An efficient power-aware VM allocation mechanism in cloud data centers: a micro genetic-based approach. Clust Comput 24:919–934CrossRef
Zurück zum Zitat Teimouri P, Mazinani M, Hosseini R (2021) An intelligent hybrid segmentation model based on fuzzy logic, discrete wavelet transform in digital imaging for detection of gastric cancer. J Model Eng 18(63):131–150 Teimouri P, Mazinani M, Hosseini R (2021) An intelligent hybrid segmentation model based on fuzzy logic, discrete wavelet transform in digital imaging for detection of gastric cancer. J Model Eng 18(63):131–150
Zurück zum Zitat Tschandl P, Rinner C, Apalla Z, Argenziano G, Codella N, Halpern A et al (2020) Human–computer collaboration for skin cancer recognition. Nat Med 26(8):1229–1234CrossRefPubMed Tschandl P, Rinner C, Apalla Z, Argenziano G, Codella N, Halpern A et al (2020) Human–computer collaboration for skin cancer recognition. Nat Med 26(8):1229–1234CrossRefPubMed
Zurück zum Zitat Wang J, Jiang X, Zhao L, Zuo S, Chen X, Zhang L et al (2020) Lineage reprogramming of fibroblasts into induced cardiac progenitor cells by CRISPR/Cas9-based transcriptional activators. Acta Pharmaceutica Sinica B 10(2):313–326CrossRefPubMed Wang J, Jiang X, Zhao L, Zuo S, Chen X, Zhang L et al (2020) Lineage reprogramming of fibroblasts into induced cardiac progenitor cells by CRISPR/Cas9-based transcriptional activators. Acta Pharmaceutica Sinica B 10(2):313–326CrossRefPubMed
Zurück zum Zitat Wang J, Liu Q, Xie H, Yang Z, Zhou H (2021) Boosted efficientnet: detection of lymph node metastases in breast cancer using convolutional neural networks. Cancers 13(4):661CrossRefPubMedPubMedCentral Wang J, Liu Q, Xie H, Yang Z, Zhou H (2021) Boosted efficientnet: detection of lymph node metastases in breast cancer using convolutional neural networks. Cancers 13(4):661CrossRefPubMedPubMedCentral
Zurück zum Zitat Wang T, Wang H, Xu N, Zhang L, Alharbi KH (2023) Sliding-mode surface-based decentralized event-triggered control of partially unknown interconnected nonlinear systems via reinforcement learning. Inf Sci 641:119070CrossRef Wang T, Wang H, Xu N, Zhang L, Alharbi KH (2023) Sliding-mode surface-based decentralized event-triggered control of partially unknown interconnected nonlinear systems via reinforcement learning. Inf Sci 641:119070CrossRef
Zurück zum Zitat Zhang L, Deng S, Zhang Y, Peng Q, Li H, Wang P et al (2020) Homotypic targeting delivery of siRNA with artificial cancer cells. Adv Healthc Mater 9(9):1900772CrossRef Zhang L, Deng S, Zhang Y, Peng Q, Li H, Wang P et al (2020) Homotypic targeting delivery of siRNA with artificial cancer cells. Adv Healthc Mater 9(9):1900772CrossRef
Zurück zum Zitat Zhang H, Zhao X, Zhang L, Niu B, Zong G, Xu N (2022b) Observer-based adaptive fuzzy hierarchical sliding mode control of uncertain under-actuated switched nonlinear systems with input quantization. Int J Robust Nonlinear Control 32(14):8163–8185CrossRef Zhang H, Zhao X, Zhang L, Niu B, Zong G, Xu N (2022b) Observer-based adaptive fuzzy hierarchical sliding mode control of uncertain under-actuated switched nonlinear systems with input quantization. Int J Robust Nonlinear Control 32(14):8163–8185CrossRef
Zurück zum Zitat Zhang H, Zou Q, Ju Y, Song C, Chen D (2022c) Distance-based support vector machine to predict DNA N6-methyladenine modification. Curr Bioinform 17(5):473–482CrossRef Zhang H, Zou Q, Ju Y, Song C, Chen D (2022c) Distance-based support vector machine to predict DNA N6-methyladenine modification. Curr Bioinform 17(5):473–482CrossRef
Zurück zum Zitat Zhao H, Wang H, Niu B, Zhao X, Alharbi KH (2023a) Event-triggered fault-tolerant control for input-constrained nonlinear systems with mismatched disturbances via adaptive dynamic programming. Neural Netw 164:508–520CrossRefPubMed Zhao H, Wang H, Niu B, Zhao X, Alharbi KH (2023a) Event-triggered fault-tolerant control for input-constrained nonlinear systems with mismatched disturbances via adaptive dynamic programming. Neural Netw 164:508–520CrossRefPubMed
Zurück zum Zitat Zhao Y, Niu B, Zong G, Xu N, Ahmad AM (2023b) Event-triggered optimal decentralized control for stochastic interconnected nonlinear systems via adaptive dynamic programming. Neurocomputing 539:126163CrossRef Zhao Y, Niu B, Zong G, Xu N, Ahmad AM (2023b) Event-triggered optimal decentralized control for stochastic interconnected nonlinear systems via adaptive dynamic programming. Neurocomputing 539:126163CrossRef
Metadaten
Titel
Data mining techniques in breast cancer diagnosis at the cellular–molecular level
verfasst von
Jian Yang
Dler Hussein Kadir
Publikationsdatum
14.07.2023
Verlag
Springer Berlin Heidelberg
Erschienen in
Journal of Cancer Research and Clinical Oncology / Ausgabe 14/2023
Print ISSN: 0171-5216
Elektronische ISSN: 1432-1335
DOI
https://doi.org/10.1007/s00432-023-05090-6

Weitere Artikel der Ausgabe 14/2023

Journal of Cancer Research and Clinical Oncology 14/2023 Zur Ausgabe

Bei seelischem Stress sind Checkpoint-Hemmer weniger wirksam

03.06.2024 NSCLC Nachrichten

Wie stark Menschen mit fortgeschrittenem NSCLC von einer Therapie mit Immun-Checkpoint-Hemmern profitieren, hängt offenbar auch davon ab, wie sehr die Diagnose ihre psychische Verfassung erschüttert

Antikörper mobilisiert Neutrophile gegen Krebs

03.06.2024 Onkologische Immuntherapie Nachrichten

Ein bispezifischer Antikörper formiert gezielt eine Armee neutrophiler Granulozyten gegen Krebszellen. An den Antikörper gekoppeltes TNF-alpha soll die Zellen zudem tief in solide Tumoren hineinführen.

Erhebliches Risiko für Kehlkopfkrebs bei mäßiger Dysplasie

29.05.2024 Larynxkarzinom Nachrichten

Fast ein Viertel der Personen mit mäßig dysplastischen Stimmlippenläsionen entwickelt einen Kehlkopftumor. Solche Personen benötigen daher eine besonders enge ärztliche Überwachung.

15% bedauern gewählte Blasenkrebs-Therapie

29.05.2024 Urothelkarzinom Nachrichten

Ob Patienten und Patientinnen mit neu diagnostiziertem Blasenkrebs ein Jahr später Bedauern über die Therapieentscheidung empfinden, wird einer Studie aus England zufolge von der Radikalität und dem Erfolg des Eingriffs beeinflusst.

Update Onkologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.