Analysis and Prediction of Breast Cancer using AzureML Platform

2019 
Nowadays, healthcare sector starts relying on the datasets that are collected by clinics or some organizations to help doctors in predicting and analyzing the patient’s status in early stage. There are many dangerous diseases around the world that people suffer from them, but one of the most dangerous diseases is cancer. Recent research shows that about 12% US women over the course of their life, develop invasive breast cancer. Thus, in this case, the breast cancer (BC) is categorized as a dangerous type among all cancer types. This study focuses on BC by using a well-known dataset titled Breast Cancer Wisconsin (Diagnostic) Data Set. It has 32 attributes and 569 instances. Some of those attributes have missing values and others are not necessary for our work. So, we removed the ID column and any instance that has a missing value. Our aims in this research is analyzing BC dataset and understand its features. Then, we upload it to Microsoft Azure machine learning (AzureML) platform for building our model. We use two classes Decision Jungle and two Classes Decision machine learning algorithms to predicate whether the patient diagnose is Benign or Malignant. We assess the performance of each algorithms in terms of different measures like Accuracy, Precision, Recall, F1 and AUC. The results of our study in this paper show that the accuracy of Decision Jungle is approximately 97%. On the other hand, the accuracy of Decision tree is approximately 95%.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    9
    References
    1
    Citations
    NaN
    KQI
    []