Transfer learning approach based for skin cancer detection

Anouar Naoui, Mohammed; Anouar Naoui, Mohammed

My SciELO

Custom services

Services on Demand

Article

Send this article by e-mail

Indicators

Cited by SciELO

Revista Cubana de Ciencias Informáticas

On-line version ISSN 2227-1899

RCCI vol.16 no.4 La Habana Oct.-Dec. 2022 Epub Dec 01, 2022

Artículo original

Transfer learning approach based for skin cancer detection

Transferir el enfoque de aprendizaje basado en la detección del cáncer de piel

0000-0003-1653-531XMohammed Anouar Naoui¹^*

^¹Computer science department, University of El-Oued

ABSTRACT

Nowadays skin cancer is increasing due to several factors. Cancer detection has become an important research field. Several research proposed models, algorithm to detect cancer from image. Image classification is an effective tool to solve cancer classification problems.

This paper proposed a transfer learning approach based for cancer detection. Authors used different transfer learning models Resnet50, Vgg16, Vgg19 InceptionV3, MobileNetV2 and CNN model. After trained these models. The result obtained by InceptionV3 0.98 Resnet50 0.97 Vgg19 0.92 and Vgg16 0.89 are the best result compared to CNN model 0.78. This indicates the importance of transfer learning.

Key words: Skin Cancer; Image Classification; Deep Learning; Transfer Learning

RESUMEN

En la actualidad, el cáncer de piel está aumentando debido a varios factores. La detección del cáncer se ha convertido en un importante campo de investigación. Varias investigaciones han propuesto modelos y algoritmos para detectar el cáncer a partir de imágenes. La clasificación de imágenes es una herramienta eficaz para resolver los problemas de clasificación del cáncer.

Este artículo propone un enfoque de aprendizaje de transferencia basado en la detección del cáncer. Los autores utilizaron diferentes modelos de aprendizaje de transferencia Resnet50, Vgg16, Vgg19 InceptionV3, MobileNetV2 y el modelo CNN. Después de entrenar estos modelos. El resultado obtenido por InceptionV3 0,98 Resnet50 0,97 Vgg19 0,92 y Vgg16 0,89 son los mejores resultados en comparación con el modelo CNN 0,78. Esto indica la importancia del aprendizaje por transferencia.

Palabras-clave: Cáncer de piel; clasificación de imágenes; Aprendizaje profundo; aprendizaje de transferencia

Introduction

Skin cancer is the most common human cancer; its incidence has been constantly increasing for several decades. Exposure to UV light is the leading cause of skin cancer (^{Sun et al, 2020}). In recent years, skin cancer has experienced a worrying increase in the world. The World Health Organization (WHO) estimates that two to three million benign cases of skin cancer and approximately 132,000 malignant melanomas occur each year worldwide. Cancer is one of the leading causes of death in humans. According to the Health Organization statistics, it is predicted that cancer will be the leading cause of death (13.1 million) by 2030. Among all types of cancer, skin cancer is the most common form of cancer in the United States. Based on projections, 20% of Americans will have skin cancer in their lifetime (^{Emre et al,.2007}; ^{Kenet et al,.1994}).

The human body is made up of living cells that grow, divide into new cells and die. Cell division is a continuous process in the human body and replaces dead cells. However, abnormal cell growth and uncontrollable cell division are the causes of cancer.

Fig. 1 Skin cancer example.

The three main types of skin cancer are basal cell carcinoma (BCC), squamous cell carcinoma (SCC) and melanoma. Basal cell carcinoma (BCC) and squamous cell carcinoma (SCC) are also called non-melanoma skin cancer or keratinocyte cancers. Rare types of non-melanoma skin cancer include Merkel cell carcinoma and angiosarcoma. They are treated differently from BCC and SCC. Basal Cell Carcinoma (BCC) starts in the basal cells of the epidermis. It accounts for about 70% of non-melanoma skin cancers. BCC grows slowly over months or years and rarely spreads to other parts of the body. The earlier a BCC is diagnosed, the easier it is to treat. If left untreated, it can grow deeper into the skin and can damage nearby tissues, making treatment more difficult. Having one BCC increases the risk of getting another. It is possible to have more than one BCC at the same time on different parts of the body.

Fig. 2 Type of cancer Basal cell carcinoma CBC.

Melanoma is the most lethal kind of skin cancer (^{Adegun et al.,2020}). It starts in the melanocytic cells of the skin. It accounts for 1 to 2% of all skin cancers. Although melanoma is a less common type of skin cancer, it is considered the most serious because it grows quickly and is most likely to spread to other parts of the body, such as the lymph nodes, lungs, liver, brain and bones, especially if not found early. The earlier melanoma is detected, the more likely it is to be successfully treated. Melanoma is one of the deadliest and fastest growing types of cancer in the world.

Fig. 3 Type of cancer Melanoma.

This work is related to our work in machine learning and big data (^{Naoui et al, 2020}) in which we proposed mutlilayer architecture for integration of machine learning with Big data system.

Skin cancer detection has attracted the attention of many researchers:

(^{Ichim et al,.2020}) proposed melanoma detection using an objective system based on multiple connected neural networks. The system is composed of two hierarchical levels, subjective and objective. The subjective level extract data characteristics of lesions. The objective level learns from subjective level with back-propagation perception and make final decision (melanoma or not melanoma). (^{Pollastri,.2020}) proposed the data augmentation of skin lesion by generative adversarial networks (GANs) model. Authors augmented both skin lesion and their segmentation image. The application of this approach tests two algorithm convolution GAN and Laplacian GAN. (^{Mazoure et al,.2022}) presented a web server architecture for deep skin cancer uncertainty analysis (DUNEScan) based on convolution neuronal network. (^{Fu et al,.2022}) presented a method for melanoma diagnosis. In the first stage images are preprocessed. The second stage consists of segmented the interest region base on kernel fuzzy C-mean method. The third stage optinally extract the main characteristics of the image. Finally, authors proposed multi-layer perception for skin cancer classification. (^{Reis et al,.2020}; ^{Agrahari et al, 2022}) proposed deep convolution neuronal networks to skin detection. (^{Reis et al,.2020}) for deep learning segmentation. (^{Lakshminarayanan et al, 2022}) compared between convolution neuronal networks, AdaBoosting, Gradient Boosting and Decision Tree algorithms to detect skin cancer. (^{Kadampur et al,.2020}) presented a tool in which a non-programmer can develop complex deep learning models. It opened up options for flexibility in the design of deep learning classifiers by hinting at general procedures and loop patterns in the development of deep learning models. Although dermoscopy improves the visual perception of a skin lesion, automatic recognition of melanoma from dermoscopic images remains a difficult task because it has several challenges. First, the low contrast between skin lesions and normal skin makes it difficult to accurately segment lesion areas. Second, melanoma lesions and normal skin may have a high degree of visual similarity, resulting in difficulty distinguishing melanoma lesions from normal skin. Third, variation in skin conditions, e.g., skin color, natural hair or veins, in patients produce a different appearance from melanoma, in terms of color and texture, etc. Early detection is important to increase the life expectancy up to 98% against 17% of the diagnosis at later stages (^{Siegel et al.,2018}). Thus, there is a need for a favorable treatment process allowing early detection of skin cancer is vital for the life of the patient.

The following paper is organized as follow, in the next section we present materials and methods our architecture and the different transfer learning model used MobinetV2, incecptionV3, vgg19, vgg16, Resnet50 and CNN model. To compare each model, we calculate the accuracy. We discuss our result in the results and discussion section. Finally terminate with conclusion.

Materials and Methods

We proposed an architecture composed of transfer learning models and the CNN model. Transfer learning are MobileNetV2, InceptionV3, Vgg19, Vgg16, and ResNet50. Each model returns its accuracy. The data used has two classes: benign and malignant (Figure 4). The development environment is the Python framework. For model development, we used Sklearn, Keras, and TensorFlow libraries. We used Google Colab, which is a Cloud for the Google data science community gives us the ability to build complex and heavy machine learning and deep learning models without having to use limited machine resources (^Bisong,.2019).

Fig. 4 Architecture of the system

Data used

The data used are the data of the skin cancer in two classes benign and malignant. The set of images contains 2637 images of size 164 MB. It consists of two folders containing 2637 images (224x244).

Fig. 5 Example of data used.

The table shows the data contents:

Table 1 Number of images in the dataset.

Type of disease	Number of trained images	Number of test images
Benign	1440	360
Malignant	1197	300

Transfer learning models

Transfer learning (TL) is a machine learning (ML) research problem that focuses on storing the knowledge gained while solving a single problem and applying it to a different but related problem. Transitional learning is one of the most common techniques in computer vision and the deep field of learning to transfer knowledge from one domain to another. Transfer learning allows users to use predefined weights from another domain in case of computational power limitation (^{West et.,2007}). Creating algorithms that facilitate transfer learning processes has become a goal for machine learning professionals, who strive to make machine learning as close as possible to human behavior. Machine learning algorithms are typically designed to handle one-off tasks. Transitional learning develops methods for transferring knowledge from one or more of these source tasks to improve learning by performing a similar target task. The goal of this transfer of learning strategies is to make machine learning evolve as efficiently as human learning (^{Basque et al.,2004}). For image classification, key point detection, segmentation, and object detection, various Deep Transfer Learning (TL) based pre-trained models are used (^{Showkat et al,.2022}; ^{Ayana et al, 2022}).

MobileNetV2 is a convolutional neural network architecture that seeks to work well on mobile devices. It is based on an inverted residual structure where the residual connections between the bottleneck layers. The intermediate expansion layer uses light convolutions in the depth direction to filter out entities as a source of non-linearity. As a whole, the MobileNetV2 architecture contains the initial fully convolutional layer with 32 filters, followed by 19 residual bottleneck layers (^{Sandler et al, 2018}; ^{Falconí et al, 2019})

Inception v3 is a widely used image recognition model that achieves an accuracy of over 78.1% on the ImageNet dataset. The model is the culmination of many ideas developed by several researchers over the years. It is based on the original paper: "Rethinking the Launch Architecture for Computer Vision" by Szegedy. The model itself is composed of symmetric and asymmetric building blocks, including convolutions, average pooling, maximum pooling, concats, dropouts, and fully connected layers. Batchnorm is used extensively throughout the model and applied to activation inputs. Loss is calculated via Softmax (^{Wang et al, 2019}). VGG19 is a variant of the VGG model which in short consists of 19 layers (16 convolution layers, 3 fully connected layers, 5 MaxPooling layers and 1 SoftMax layer). There are other variants of VGG like VGG11, VGG16 and others. VGG19 has 19.6 billion FLOPs (^{Carvalho et al,.2017}).

VGG16 is a convolutional neural network model proposed by K. Simonyan and A. Zisserman of Oxford University in the paper "VeryDeepConvolutional Networks for Large-Scale Image Recognition". The model reaches a test accuracy of 92.7% in the top 5 in ImageNet. It was one of the famous models submitted to ILSVRC-2014. It provides an improvement over AlexNet by replacing the large kernel-sized filters (11 and 5 in the first and second convolutional layers, respectively) with several 3 × 3 kernel-sized filters one after another. VGG16 was trained for weeks and used NVIDIA Titan Black GPUs (Simonyan et al, 2014; ^{Tammina et al,.2019}). ResNet is one of the most powerful deep neural networks that achieved fantastic performance results in the 2015 ILSVRC classification challenge. ResNet achieved excellent generalization performance on other recognition tasks and won first place on ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation in the ILSVRC and COCO 2015 competitions. There are many variations of the ResNet architecture, i.e., the same concept but with a different number of layers. We have ResNet-18, ResNet-34, ResNet-50, ResNet-101, ResNet-110, ResNet-152, ResNet-164, ResNet-1202 etc. The name ResNet followed by a number of two or more digits simply implies the ResNet architecture with a certain number of neural network layers (^{He et al,.2016}; Reddy et al, 2019)

CNN model

CNN model has two convolution layers with ReLu activation function, two polling layers, two dropout layers, Flaten layer, Dense layer with Relu function and Dense function with softmax layer.

Fig. 6 CNN layers.

Results and discussion

The training images are the data used, and the predicted classes are Benign or Malignant. To analyze all models, we have calculated the following metrics: Training loss: Calculated by the summation of the errors in training phase. The lower loss is better except when the model has over-fitting.

Accuracy: The percentage of correctly predicted data to all predicted data is known as classification accuracy in training phase.
Validation loss: The validation loss estimated in testing phase, it calculated by the summation of the errors during testing phase.
Validation accuracy: The validation accuracy is also estimated during the test phase. It is defined by the ratio between correctly predicted new data (data don’t used in training step) to all predicted data.

Our models are implemented with parameters in the table below:

Table 2 Parametters Used.

Input shape	Learning rate	Epoch number	Batch size	Poling layer
(224,224,3)	1e-5	50	64	Average function

MobileNetV2 registered the following results: In epoch number 50 the training phase accuracy is 0.92 and validation accuracy is constant at 0.54 from the first epoch until epoch number 50 (Figure 7). The loss of training phase, in epoch number 1 is 0.67 and in epoch number 50 is 0.18. The validation loss of testing phase, in the first epoch 0.69 and in epoch 50 is 0.70 (Figure 8).

Fig. 7 MobileNetV2 accuracy model.

Fig. 8 MobinetV2 loss model.

InceptionV3 results

In the training phase accuracy is 0.98 and for testing phase validation accuracy is 0.82 (Figure 9). Loss in training phase is 0.048 and validation loss is 0.82 (Figure 10).

Fig. 9 InceptionV3 accuracy model.

Fig. 10 InceptionV3 loss model.

Vgg19 result

For the Vgg19 model, the accuracy in the training phase is 0.92 and the validation accuracy is 0.84 (Figure 11). The loss in training 0.18 and validation loss 0.45 (Figure 12).

Fig. 11 Vgg19 accuracy model.

Fig. 12 Vgg19 loss model.

Vgg16 result

Vgg16 model has an accuracy of 0.89 and a validation accuracy of 0.82 (Figure 13). Loss in this model is 0.23 and the validation loss 0.42 (Figure 14).

Fig.13 Vgg16 accuracy model.

Fig. 14 Vgg16 loss model.

Res net 50

In training phase in epoch 50, the accuracy is 0.97 and for the testing phase the validation accuracy is 0.78 (Figure 15). The loss 0.04 and the validation loss is 0.72 (Figure 16).

Fig. 15 Resnet 50 accuracy model.

Fig. 16 Resnet 50 loss model.

CNN result

In the training phase the accuracy is 0.78. The validation accuracy is 0.65 (Figure 17). The loss in training is 0.61 and the validation loss is 0.25 (Figure 18).

Fig. 17 CNN accuracy model.

Fig. 18 CNN loss model.

Results analysis

Results of models after 50 epochs, Best results are respectively for the training phase accuracy InceptionV3, Resnet 50, Vgg19, Vgg16, MobinetV2 and CNN . These results indicates that transfer learning models InceptionV3, MobinetV2, Vgg19 and Vgg16 outperform classification of skin cancer compared with CNN model. For the validation accuracy transfer learning models InceptionV3, Resnet 50, Vgg19, Vgg16 have best result compared with CNN model except MovinetV2 which has a constant validation accuracy. MobinetV2 has under-fit problem (Figure 19).

Fig. 19 Bar char accuracy model.

For the Loss in training phase, transfer learning models error is the minimum compared with CNN model. That mean transfer learning models are best than CNN model. For the validation loss although CNN model is the minimum but its validation accuracy is less than transfer learning models (Figure 20).

Fig. 20 Bar char loss model.

Result obtained illustrate the outperform of transfer learning models. The highest accuracy are nceptionV3, Resnet 50, Vgg19, Vgg16 and CNN model. Where the accuracy of InceptionV3 0.98 Resnet_50 0.97 Vgg19 0.92, Vgg16 0.89 and CNN model 0.78. For the MobinetV2 the model has under fit. We proposed a transfer learning approach base for skin cancer detection, the model was tested MobinetV2, InceptionV3, Vgg19, Vgg16, Resnet50. This proposition has the following advantages:

Restnet50 and InceptionV3 Outperform accuracy compared with baseline CNN.
Optimize the computation cost: Transfers learning model is trained model, so it doesn't need a full training time like CNN.
Solve the problem of low contrast between skin lesions and normal.
Distinguishing melanoma lesions from normal skin.
Differentiate in cases of variation in skin conditions, e.g., skin color.
Facilitates early detection of skin cancer.

We can conclude that transfer learning is simple and effective for the problems of image classification of skin cancer. Transfer learning remains a very important area for image recognition.

Conclusion

Malignant melanoma is the real cause of death from skin cancer. Although there are commonly used imaging and diagnostic techniques for skin cancer such as dermoscopy, automatic recognition remains challenging due to its difficult lesion and skin disease segmentation of micro-lesion areas and similarities between melanoma and non-melanoma. Our work offers an inevitable advantage to clinicians and patients in the rapid and early detection of this disease. Inception, Resnet_50, Vgg19 and Vgg16 transfer learning model can detect skin cancer with accuracy 0.98, 0.97, 0,92, 0.89 this indicates the importance of transfer learning. In the future work we will test other transfer learning method. On the other hand, we will work on other medical image data.

Reference

Emre Celebi, M., Alp Aslandogan, Y., Stoecker, W. V., Iyatomi, H., Oka, H., & Chen, X. (2007). Unsupervised border detection in dermoscopy images. Skin research and technology, 13(4), 454-462.‏ [ Links ]

Kenet, R. O., & Fitzpatrick, T. B. (1994). Reducing Mortality and Morbidity of Cutaneous Melanoma: A Six Year Plan: B) Identifying High and Low Risk Pigmented Lesions Using Epiluminescence Microscopy. The Journal of Dermatology, 21(11), 881-884.‏‏‏ [ Links ]

Kadampur, M. A., & Al Riyaee, S. (2020). Skin cancer detection: Applying a deep learning based model driven architecture in the cloud for classifying dermal cell images. Informatics in Medicine Unlocked, 18, 100282.‏ [ Links ]

Siegel, R. L., Miller, K. D., & Jemal, A. (2018). Cancer statistics, 2018. CA: a cancer journal for clinicians, 68(1), 7-30.‏ [ Links ]

West, J., Ventura, D., & Warnick, S. (2007). Spring research presentation: A theoretical foundation for inductive transfer. Brigham Young University, College of Physical and Mathematical Sciences, 1(08).‏ [ Links ]

Basque, J. (2004). Le transfert d’apprentissage: qu’en disent les contextualistes. Le transfert des apprentissages: Comprendre pour mieux intervenir, 49-76. [ Links ]

Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., & Chen, L. C. (2018). Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4510-4520).‏ [ Links ]

Wang, B., Yao, Y., Viswanath, B., Zheng, H., & Zhao, B. Y. (2018). With great training comes great vulnerability: Practical attacks against transfer learning. In 27th USENIX Security Symposium (USENIX Security 18) (pp. 1281-1297).‏ [ Links ]

Carvalho, T., De Rezende, E. R., Alves, M. T., Balieiro, F. K., & Sovat, R. B. (2017, December). Exposing computer generated images by eye’s region classification via transfer learning of VGG19 CNN. In 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA) (pp. 866-870). IEEE.‏ [ Links ]

Simonyan, Karen, and Andrew, Zisserman. "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014).‏‏ [ Links ]

He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778). [ Links ]

Adegun, A. A., & Viriri, S. (2020). FCN-based DenseNet framework for automated detection and classification of skin lesions in dermoscopy images. IEEE Access, 8, 150377-150396. [ Links ]

Ichim, L., & Popescu, D. (2020). Melanoma detection using an objective system based on multiple connected neural networks. IEEE Access , 8, 179189-179202. [ Links ]

Pollastri, F., Bolelli, F., Paredes, R., & Grana, C. (2020). Augmenting data with GANs to segment melanoma skin lesions. Multimedia Tools and Applications, 79(21), 15575-15592. [ Links ]

Mazoure, B., Mazoure, A., Bédard, J., & Makarenkov, V. (2022). DUNEScan: a web server for uncertainty estimation in skin cancer detection with deep neural networks. Scientific Reports, 12(1), 1-10. [ Links ]

Fu, Z., An, J., Yang, Q., Yuan, H., Sun, Y., & Ebrahimian, H. (2022). Skin cancer detection using Kernel Fuzzy C-means and Developed Red Fox Optimization algorithm. Biomedical Signal Processing and Control, 71, 103160.‏‏‏ [ Links ]

Reis, H. C., Turk, V., Khoshelham, K., & Kaya, S. (2022). InSiNet: a deep convolutional approach to skin cancer detection and segmentation. Medical & Biological Engineering & Computing, 1-20. [ Links ]

Agrahari, P., Agrawal, A., & Subhashini, N. (2022). Skin Cancer Detection Using Deep Learning. In Futuristic Communication and Network Technologies (pp. 179-190). Springer, Singapore. [ Links ]

Lakshminarayanan, A. R., Bhuvaneshwari, R., Bhuvaneshwari, S., Parthasarathy, S., Jeganathan, S., & Sagayam, K. M. (2022). Skin Cancer Prediction Using Machine Learning Algorithms. In Artificial Intelligence and Technologies (pp. 303-310). Springer, Singapor. [ Links ]

Sun, X., Zhang, N., Yin, C., Zhu, B., & Li, X. (2020). Ultraviolet radiation and melanomagenesis: from mechanism to immunotherapy. Frontiers in Oncology, 10, 951. [ Links ]

Ayana, G., Park, J., Jeong, J. W., & Choe, S. W. (2022). A Novel Multistage Transfer Learning for Ultrasound Breast Cancer Image Classification. Diagnostics, 12(1), 135.‏ [ Links ]

Showkat, S., & Qureshi, S. (2022). Determining the efficacy of Transfer Learning-based ResNet models in Chest X-ray image classification for detecting Covid-19 pneumonia. Chemometrics and Intelligent Laboratory Systems, 104534. [ Links ]

Falconí, L. G., Pérez, M., & Aguilar, W. G. (2019, June). Transfer learning in breast mammogram abnormalities classification with mobilenet and nasnet. In 2019 International Conference on Systems, Signals and Image Processing (IWSSIP) (pp. 109-114). IEEE.‏ [ Links ]

Wang, C., Chen, D., Hao, L., Liu, X., Zeng, Y., Chen, J., & Zhang, G. (2019). Pulmonary image classification based on inception-v3 transfer learning model. IEEE Access , 7, 146533-146541. [ Links ]

Tammina, S. (2019). Transfer learning using VGG-16 with deep convolutional neural network for classifying images. International Journal of Scientific and Research Publications (IJSRP), 9(10), 143-150. [ Links ]

Reddy, A. S. B., & Juliet, D. S. (2019, April). Transfer learning with ResNet-50 for malaria cell-image classification. In 2019 International Conference on Communication and Signal Processing (ICCSP) (pp. 0945-0949). IEEE. [ Links ]

Naoui, M. A., Lejdel, B., Ayad, M., & Amamra, A. (2020). Using a distributed deep learning algorithm for analyzing big data in smart cities. Smart and Sustainable Built Environment.‏ [ Links ]

Naoui, M. A., Lejdel, B., Ayad, M., & Belkeiri, R. (2020). Integrating deep learning, social networks, and big data for healthcare system. Bio-Algorithms and Med-Systems, 16(1).‏ [ Links ]

Naoui, M. A., Lejdel, B., & Ayad, M. (2020). Using K-means algorithm for regression curve in big data system for business environment. Revista Cubana de Ciencias Informáticas, 14(2), 34-48.‏ [ Links ]

Naoui, M. A., Brahim, L., & Ayad, M. (2020). Integrating iot devices and deep learning for renewable energy in big data system. UPB Scientific Bulletin, Series C: Electrical Times, 82, 251-266. [ Links ]

Bisong, E. (2019). Building machine learning and deep learning models on Google cloud platform (pp. 59-64). Berkeley, CA: Apress.‏ [ Links ]

Received: August 21, 2022; Accepted: September 01, 2022

^*Autor para la correspondencia. (manouarn@yahoo.com )

No existe conflicto de interés con este trabajo

No se obtuvo financiamiento para realizar este trabajo