Juxtaposition on Classifiers in Modeling Hepatitis Diagnosis Data

2020 
Machine Learning and Data Mining have been used extensively in the field of medical science. Approximately 2% of the world population, i.e., 3.9 million people are infected by Hepatitis C. This paper is an investigative study on the comparison of classification models—Support Vector Machine, Random Forest Classifier, Decision Tree Classifier, Logistic Regression, and Naive Bayes Classifier—modeling Hepatitis C Data based on various performance measures—Accuracy, Balanced Accuracy, Precision, Recall, F1-Measure, Matthews Correlation Coefficient and many more using R Programming Language. On normalizing the numerical attributes using Z-score Normalization and using the holdout method for the Train Test data split of 80–20%, the result shows that Random Forest outperforms the other classifiers with an accuracy of 90.7%, followed by Support Vector Machine, Logistic Regression, Decision Tree Classifier, and Naive Bayes Classifier.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    18
    References
    0
    Citations
    NaN
    KQI
    []