Hipotiroidi Hastalığı Teşhisinde Sınıflandırma Algoritmalarının Kullanımı

2020 
Disease diagnosis is one of the most important problems encountered in the medical field. Different types of a specific disease and similar symptoms with other diseases make the disease harder to diagnose. For these reasons Hypothyroidism, which is one of the types of thyroid disease, is a disease that decreases patient's quality of life due to the delay in its diagnosis. The purpose of this article is to propose a data mining-based system that will increase the correct diagnosis of hypothyroidism rate by using the question asked to the patients during the diagnosis process, and the test results applied. The other aim is to reduce the complications that may arise from interventional tests used indirectly for diagnosis. For these purposes, it was estimated whether new samples were hypothyroidism by using a data set consisting of 3163 samples in the UCI machine learning database, 151 of which were hypothyroid and the rest without hypothyroidism. In order to deal with the imbalanced class distribution in the data, different sampling techniques were applied to the data set and models to diagnose hypothyroidism with Logistic Regression, K Nearest Neighbor, and Support Vector Machine classifiers were created. With this aspect, the study demonstrated the effect of sampling methods on the diagnosis of hypothyroid disease. Among the developed models, the Logistics Regression classifier, which was trained with the data set applied to the oversampling techniques, gave the highest performance. The best results obtained with this classifier are 97.8% for accuracy rate, 82.26% for F-Score value, 93.2% for area under the curve and 81.8% for Matthews correlation coefficient.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    2
    Citations
    NaN
    KQI
    []