Diagnosis and classification of cancer using hybrid model based on ReliefF and convolutional neural network

dc.authorid0000-0002-3752-7354
dc.authorid0000-0002-1488-1502
dc.contributor.authorKılıçarslan, Serhat
dc.contributor.authorAdem, Kemal
dc.contributor.authorÇelik, Mete
dc.date.accessioned2020-02-10T11:54:20Z
dc.date.available2020-02-10T11:54:20Z
dc.date.issued2020
dc.departmentİktisadi ve İdari Bilimler Fakültesi
dc.descriptionAdem, Kemal ( Aksaray, Yazar )
dc.description.abstractMachine learning and deep learning methods aims to discover patterns out of datasets such as, microarray data and medical data. In recent years, the importance of producing microarray data from tissue and cell samples and analyzing these microarray data has increased. Machine learning and deep learning methods have been started to use in the diagnosis and classification of microarray data of cancer diseases. However, it is challenging to analyze microarray data due to the small number of sample size and high number of features of microarray data and in some cases some features may not be relevant with the classification. Because of this reason, studies in the literature focused on developing feature selection/dimension reduction techniques and classification algorithms to improve classification accuracy of the microarray data. This study proposes hybrid methods by using Relief and stacked autoencoder approaches for dimension reduction and support vector machines (SVM) and convolutional neural networks (CNN) for classification. In the study, three microarray datasets of Overian, Leukemia and Central Nervous System (CNS) were used. Ovarian dataset contains 253 samples, 15,154 genes and 2 classes, Leukemia dataset contains 72 samples, 7129 genes, and 2 classes and CNS dataset contains 60 samples, 7129 genes and 2 classes. Among the methods applied to the three microarray data, the best classification accuracy without dimension reduction was observed with SVM as 96.14% for ovarian dataset, 94.83% for leukemia dataset and 65% for CNS dataset. The proposed hybrid method ReliefF + CNN method outperformed other approaches. It gave 98.6%, 99.86% and 83.95% classification accuracy for the datasets of ovarian, leukemia, and CNS datasets, respectively. Results shows that dimension reduction methods improved the classification accuracy of the methods of SVM and CNN.
dc.identifier.endpage-en_US
dc.identifier.issue-en_US
dc.identifier.pmid31991364
dc.identifier.scopusqualityQ1
dc.identifier.startpage-en_US
dc.identifier.urihttps://hdl.handle.net/20.500.12451/7129
dc.identifier.volume137en_US
dc.identifier.wosWOS:000521112300031
dc.identifier.wosqualityQ4
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.indekslendigikaynakPubMed
dc.language.isoen
dc.publisherChurchill Livingstone
dc.relation.ispartofMedical Hypotheses
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectCNN
dc.subjectDeep Learning
dc.subjectMachine Learning
dc.subjectMicroarray
dc.subjectReliefF
dc.subjectSAE
dc.titleDiagnosis and classification of cancer using hybrid model based on ReliefF and convolutional neural network
dc.typeArticle

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
[ X ]
İsim:
kiliçarslan-serhat-2020.pdf
Boyut:
952.45 KB
Biçim:
Adobe Portable Document Format
Açıklama:
Tam Metin / Full Text
Lisans paketi
Listeleniyor 1 - 1 / 1
[ X ]
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: