Author: Kamel, Ghada Hamed Aly./ Title: Computational Intelligence based<br>Classification Method for Breast Cancer<br>Diagnosis in Mammograms /

Search In this Thesis

العنوان

Computational Intelligence based
Classification Method for Breast Cancer
Diagnosis in Mammograms /

المؤلف

Kamel, Ghada Hamed Aly.

هيئة الاعداد

باحث / غادة حامد علي كامل حامد

مشرف / محمد فهمى طلبه

مناقش / السيد عبد الرحمن دهشان

مناقش / ذكى طه فايد

تاريخ النشر

2021.

عدد الصفحات

147 P. :

اللغة

الإنجليزية

الدرجة

ماجستير

التخصص

Computer Science (miscellaneous)

تاريخ الإجازة

1/1/2021

مكان الإجازة

جامعة عين شمس - كلية الطب - قسم الحسابات العلمية

الفهرس

Only 14 pages are availabe for public view

from

147

from

147

Abstract

With the recent research and development in deep learning since 2012, the emerging of Convolutional Neural Networks (CNNs) in bioinformatics, especially medical imaging, achieved obvious and tremendous improvement. Nowadays, from the raised critical challenges are the cancer detection in breast mammograms followed by the classification of the pathology of the localized lesions. Till now, the evaluation process of the screening mammograms is held by radiologists and physicians. Due to a large number of mammograms screened daily, this mammograms evaluation process becomes very monotonous, tiring, lengthy, costly, and significantly prone to errors. So, in the last two decades, the development of Computer Aided Detection (CAD) systems became very essential for early diagnosis. However, they should be improved for more accurate results which can help to obtain more confidence to the radiologist’s decision.
In this thesis, I present the recently approaches that are machine learning based models developed to detect cancer in breast mammograms and classify them by analyzing them in the form of comparative study and analysis. Also, the mammographic datasets that are publicly available and popular as well are listed in the recent work to facilitate any new experiments and comparisons. It is conducted from the comparative study that the You Only Look Once (YOLO) model is one from the recent efficient and fast object detectors that obtains high accuracy compared with other detectors.
Based on the conducted comparative analysis, an end-to-end computer-aided diagnosis system based on You Only Look Once (YOLO) is proposed. The proposed system first preprocesses the mammograms from their DICOM format to images without losing data. Then, YOLO is utilized to take each mammogram and checks it in one shot to detect any existing lesions. Finally, the localized masses are classified into malignant and benign lesions without any human intervention.
YOLO has three different architectures, and in this thesis, the three versions are trained on breast mammograms for mass detection and classification to compare their performance against each other. I utilized the anchor boxes upgrade in YOLO-V3 but in different manner. In order to achieve high detection accuracy, all anchor boxes used through training YOLO, are updated to sizes related to the masses I need to detect in mammograms, i.e., data related anchors. This is carried on by applying the k-means clustering on the sizes of all masses of the mammograms dataset to cluster them in a specific number of boxes which used later while YOLO training. Using the experimental results, it is proved how the idea of using YOLO-V3 by regenerated anchor boxes has a noticeable impact on the detection of masses and their classification as well. The proposed model is proved its ability to detect most of the challenging cases of masses and classify them correctly comparing with other recent detectors and the earlier versions of YOLO as well.
The existing publicly available datasets that contain fully field digital mammograms (FFDM) with lesions and at the same time contain accurate annotations, represented in only one dataset which is the INbreast. However, the disadvantage of the INbreast is its small number of samples. So, to utilize the good quality supported by the FFDM mammograms with handling the small number of samples issues, I implemented different augmentation approaches of other recent YOLO based studies. Among two commonly used techniques to augment data, it is proved by the results that augmenting the training set only is the fairest and accurate to be applied in the realistic scenarios.
The detected mammograms by the last proposed system have been checked to find out the problem of missing some masses that are already exist in mammograms. The main reason is that the masses exist in mammograms have no fixed sizes or near a specific range of sizes, they may be very small for example of width 10 and height 10 pixels and maybe large for example of width 900 and height 800 pixels. For this wide range I utilized YOLO-V3 model to detect masses through 3 phases-based architecture. First phase is the data preparation to convert DICOM files to images without losing data. Then, they are divided into mammograms with large and small masses representing the input to the second model training phase. The third phase is the model evaluation through two testing levels, first is the large masses checking and the second level is the small masses checking to output the intersected detection results for large and small masses. By this approach, the probability of missing large or small masses become very small. Moreover, this strategy has been proved its successfulness in overcoming the challenge of missing the small sized masses can be exist in breast.
Finally, YOLO is updated to a newer version which is YOLO-V4. In this thesis, the latest two models from YOLO (YOLOV3 and YOLO-V4) have been applied on mammograms. The main objective is to show how the new updates in YOLO-V4 will affect the prediction performance. Among the new updates in YOLO-V4 is the CSPDarknet53, which is a new Backbone that can enhance the learning capability of CNN. Besides the spatial pyramid pooling block which helps in increasing the receptive field and separating out the features of the most significant context. In YOLO-V3, Feature Pyramid Networks (FPN) is used for object detection while YOLO-V4 used PANet for different levels of detection. YOLO-V3 and YOLO-V4 are compared by evaluating their performance in detecting masses. It has been observed by the experimental results that YOLO-V4 outperformed YOLO-V3 by obtaining more accurate detection results and lower number of undetected mammograms, especially when its responsibility set to detect masses whatever their types instead of detecting benign and malignant masses.
To preserve the better accuracy obtained when YOLO is trained to only detect masses however its type, its classification role is replaced by other features extractors like ResNet and Inception. Inception-V3 results in better classification’s performance than ResNet and near accuracy than YOLO’s classifier. For this, I got confirmed from the experimental results that leaving the detection role to YOLO and then replace its features extractors by Inception-V3 to classify the localized objects is much more accurate and better than detect benign and malignant masses by YOLO, i.e., leaving the detection and classification roles to YOLO.