Author: Ghaleb, Moshira Saad El-Din./ Title: Content Based Image Retrieval /

Search In this Thesis

العنوان

Content Based Image Retrieval /

المؤلف

Ghaleb, Moshira Saad El-Din.

هيئة الاعداد

باحث / مشيره سعد الدين ابراهيم غالب

مشرف / محمد فهمي طلبه

مشرف / محمد السعيد عبده غنيمى

مشرف / ياسر هشام دكرورى

تاريخ النشر

2022.

عدد الصفحات

122 P. :

اللغة

الإنجليزية

الدرجة

ماجستير

التخصص

Computer Science (miscellaneous)

تاريخ الإجازة

1/1/2022

مكان الإجازة

جامعة عين شمس - كلية الحاسبات والمعلومات - قسم الحسابات العلمية

الفهرس

Only 14 pages are availabe for public view

from

122

from

122

Abstract

Multimedia became a primary aspect for all different social and business categories. This day, societies figure out using images, videos, and sounds. Productions, marketing, and people’s communication depend on social media using texting and multimedia. Servers hold huge datasets according to the huge numbers of users and their media. The search became a daily manner. Search by text is very easy for the labeled datasets but a search by image is very useful for unlabeled seen. Last few days search by image became a trend for many search engines. Developers built different programs to help users find locations, shops, people, and objects using a single image. This technology is called content-based image retrieval (CBIR). Content refers to the whole image or part of the image. Unfortunately, there is no accurate retrieval accuracy over the huge image datasets. Researchers find in the CBIR topic hot challenging points to propose more accurate models using intelligent methods with the lowest complexity.
In this thesis, we proposed recent approaches using machine learning and deep learning to classify images into categories and retrieve relevant images to the input image from different sizes of datasets. Image classification is a part of computer vision and it splits the datasets into categories to make the retrieve operation easier.. In this study, we utilize various types of algorithms such as; supervised, unsupervised and deep learning. Convolution neural network (CNN) is applied as a main deep learning model. We proposed a model using CNN, model fused CNN with recurrent neural networks like LSTM and GRU, and model fused it with traditional algorithms like Decision tree (DT) and Support Vector Machine (SVM). Also, we used a variety of datasets in sizes and types. The datasets is an important factor to measure the evaluation of the CBIR system, so we used different datasets in types and size. There is many images type like; people, objects, medical and digital numbers.
The first model applied in the small dataset to train and classify the images into ten categories. The model fused SOM+ MLP is used to train the dataset with 100% recognition accuracy. The second proposed approach applied to biomedical images which are the X-rays chest images to classify the images into three categories; infected chest with covid-19 virus, infected chest with other viruses, and normal X-ray chest. The model applied CNN for feature extraction and classification and the accuracies reached 96%, 95%, and 99.5% for the three classes: Covid-19, Pneumonia, and normal chests. The model achieved the highest accuracy in compare to the state of the art techniques for Covied19 classification.
In The third proposed approach, CNN model is applied to two types of images to classify them and evaluate the CBIR approach. The first dataset type is a single object image and the second is a digital numbers dataset image. The models applied on Cifar10 and Mnist datasets. The CNN model achieved 92.9% mAP for Cifar10 and 99.8% for Mnist dataset. The model achieved the highest accuracy compared with the state-of-the-arts on both datasets. The fourth proposed approach is for weather images classification. The proposed technique fused CNN with two different traditional supervised algorithms DT and SVM to classify the weather images into five categories shiny, sunny, fogy, cloudy, and rainy. The study reached 92%, 63%, 94% average accuracies for CNN, CNN+DT, and CNN+SVM models. The models achieved high accuracies in contrast with the state-of-the-art and can help in weather forecasting to reduce transportation accidents.
The last proposed approach is the CBIR approach that used three deep learning models. The three models are similar in the feature extraction phase but use different techniques in the classification phase. The models start with extracting the important features from the images using CNN, classifying them into categories, retrieving the relevant images to the input image from the trained dataset, and finally evaluating the CBIR process using the measurement metrics. The classification for the first model used the fully connected layers but used LSTM and GRU in the second and third models respectively. CNN+GRU achieved the highest results with high complexity reduction compared with the state-of-the-art models. CNN+GRU reached 97%, 91.5%, and 99%, 96%, 87.5%, and 99.9% for Corel1k, Corel8k, Aloi 74K, Cifar-10, Cifar-100, and Mnist dataset respectively.
The study made extensive research on different images to detect the object, classify images, and return the most relevant images. Classification is a very powerful technique that can help in many fields like face recognition, biomedical diagnosis, object detections, and CBIR. The proposed models based on machine learning and deep learning were effectively increased the classification accuracy and the CBIR performance for different datasets types. The study presented four deep learning approaches and one machine learning with the highest accuracies compared with the state-of-the-art models.