HEART DISEASE PREDICTION USING MACHINE LEARNING

INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT (2023)

Citation

Reference

Related Paper

Citation Trend

Abstract:

Heart disease is a major cause of death worldwide, making early diagnosis and prevention essential. Predictive models have gained significant attention in recent years, with several algorithms being employed to develop these models. However, there are challenges in implementing heart disease prediction models, including data quality, model accuracy, ethical concerns, and limited data. Therefore, this project aims to develop a heart disease prediction model and analyse different algorithms used in disease prediction. In order to increase the predictive accuracy of machine learning algorithms, this study compares six algorithms, including KNN (K-Nearest Neighbour), Decision Tree, Random Forest, Support Vector Machines, Logistic Regression, and Neural Network. 13 attributes, including age, sex, and cholesterol, are used, and ensemble methods like boosting and bagging are used. The accuracy, recall, f1 score, and precision of each algorithm are calculated to determine the most accurate model. Additionally, this study identifies the limitations of heart disease prediction models and their implications for patient diagnosis and treatment, by developing and analysing heart disease prediction models. In conclusion, while heart disease prediction models have the potential to be financially feasible and be useful in the future, their current limitations and challenges mean that they cannot be relied upon as the sole means of diagnosis or treatment decisions Key Words: Heart Diseases, Machine Learning Algorithms, Logistic Regression, Random Forest, Decision Tree.

Keywords:

Predictive modelling

Boosting

Ensemble Learning

Topics:

Artificial Intelligence in Healthcare

Imbalanced Data Classification Techniques

Data Mining Algorithms and Applications

10.55041/ijsrem27570

Cite

PDF

Smartphone Price Prediction Using Machine Learning Techniques

International Journal of Innovative Research in Engineering (2023)

Richard Honey

This research aimed to predict smart phone prices using two supervised machine learning algorithms: Decision Tree and Random Forest Regression. Data was collected from the Indian e-Commerce website Flip kart using Python libraries such as Beautiful Soup and Selenium, and was cleaned and pre-processed for analysis. The results showed that the Decision Tree algorithm had an R^2of 89.3%. The Random Forest classifier showed the R^2 value with an accuracy score of 82.8%. The study offers a method for accurately predicting smart phone prices that could be useful to determine the cost of their products and ultimately benefit the entire smart phone market. Key Word: Smartphone, Price Prediction, Machine Learning, Decision Tree, Random Forest Regression.

Python

Supervised Learning

10.59256/ijire.2023040233

Cite

Citations (1)

Breast cancer prediction model with decision tree and adaptive boosting

IAES International Journal of Artificial Intelligence (2021)

Tsehay Admassu Assegie R. Lakshmi Tulasi N. Komal Kumar

In this study, breast cancer prediction model is proposed with decision tree and adaptive boosting (Adboost). Furthermore, an extensive experimental evaluation of the predictive performance of the proposed model is conducted. The study is conducted on breast cancer dataset collected form the kaggle data repository. The dataset consists of 569 observations of which the 212 or 37.25% are benign or breast cancer negative and 62.74% are malignant or breast cancer positive. The class distribution shows that, the dataset is highly imbalanced and a learning algorithm such as decision tree is biased to the benign observation and results in poor performance on predicting the malignant observation. To improve the performance of the decision tree on the malignant observation, boosting algorithm namely, the adaptive boosting is employed. Finally, the predictive performance of the decision tree and adaptive boosting is analyzed. The analysis on predictive performance of the model on the kaggle breast cancer data repository shows that, adaptive boosting has 92.53% accuracy and the accuracy of decision tree is 88.80%, Overall, the adaboost algorithm performed better than decision tree.

Boosting

AdaBoost

Gradient boosting

Decision tree model

Alternating decision tree

Tree (set theory)

10.11591/ijai.v10.i1.pp184-190

Cite

Citations (59)

Signal prediction based on boosting and decision stump

International Journal of Computational Science and Engineering (2018)

Lei Shi Qiguo Duan Ping Dong Lei Xi Xinming Ma

Signal prediction has attracted more and more attention from data mining and machine learning communities. Decision stump is a one-level decision tree, and it classifies instances by sorting them based on feature values. The boosting is a kind of powerful ensemble method and can improve the performance of prediction significantly. In this paper, boosting and decision stump algorithm are combined to analyse and predict the signal data. An experimental evaluation is carried out on the public signal dataset and the experimental results show that the boosting and decision stump-based algorithm clearly improves performance of signal prediction.

Boosting

Gradient boosting

SIGNAL (programming language)

10.1504/ijcse.2018.090450

Cite

Citations (4)

Combining bagging, boosting, rotation forest and random subspace methods

Artificial Intelligence Review (2010)

Sotiris Kotsiantis

Boosting

Ensemble Learning

10.1007/s10462-010-9192-8

Cite

Citations (138)

A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects

IEEE Access (2022)

Ibomoiye Domor Mienye Yanxia Sun

Ensemble learning techniques have achieved state-of-the-art performance in diverse machine learning applications by combining the predictions from two or more base models. This paper presents a concise overview of ensemble learning, covering the three main ensemble methods: bagging, boosting, and stacking, their early development to the recent state-of-the-art algorithms. The study focuses on the widely used ensemble algorithms, including random forest, adaptive boosting (AdaBoost), gradient boosting, extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), and categorical boosting (CatBoost). An attempt is made to concisely cover their mathematical and algorithmic representations, which is lacking in the existing literature and would be beneficial to machine learning researchers and practitioners.

Boosting

Gradient boosting

AdaBoost

Ensemble Learning

Categorical variable

10.1109/access.2022.3207287

Cite

Citations (408)

Ensemble Techniques to improve the performance of the High Dimensional MultiClass Algorithms

2022 First International Conference on Electrical, Electronics, Information and Communication Technologies (ICEEICT) (2022)

V. Shobana K. Nandhini

Ensemble plays a major role in machine learning algorithms, and it can improve the performance of the single model by combining two or more models. It can be able to combine a number of different models and comes out with a promising result. There are several ensemble techniques such as bagging, boosting, and stacking each of which performs in its own way and produces the results. In this work the different techniques of ensembling are being explored and has been tested its working on the sample dataset. The results are varying in performance and suits well for the taken data points. Keywords: ensemble, stacking, boosting, bagging, ensemble learners

Boosting

Ensemble Learning

Ensemble forecasting

10.1109/iceeict53079.2022.9768646

Cite

Citations (1)

Bagged ensembles with tunable parameters

Computational Intelligence (2018)

Hieu Pham Sigurđur Ólafsson

Abstract Ensemble learning is a popular classification method where many individual simple learners contribute to a final prediction. Constructing an ensemble of learners has been shown to often improve prediction accuracy over a single learner. Bagging and boosting are the most common ensemble methods, each with distinct advantages. While boosting methods are typically very tunable with numerous parameters, to date, the type of flexibility this allows has been missing for general bagging ensembles. In this paper, we propose a new tunable weighted bagged ensemble methodology, resulting in a very flexible method for classification. We explore the impact tunable weighting has on the votes of each learner in an ensemble and compare the results with pure bagging and the best known bagged ensemble method, namely, the random forest.

Boosting

Ensemble Learning

Ensemble forecasting

Bootstrap aggregating

10.1111/coin.12198

Cite

Citations (34)

Signal prediction based on boosting and decision stump

International Journal of Computational Science and Engineering (2016)

Lei Shi

Boosting

Gradient boosting

SIGNAL (programming language)

10.1504/ijcse.2016.10006637

Cite

Citations (1)

Decision Support System and Web-Application Using Supervised Machine Learning Algorithms for Easy Cancer Classifications

Cancer Informatics (2023)

K Chandrashekar Anagha S Setlur Adithya Sabhapathi C Satyam Suresh Raiker Satyam Singh

Using a decision support system (DSS) that classifies various cancers provides support to the clinicians/researchers to make better decisions that can aid in early cancer diagnosis, thereby reducing chances of incorrect disease diagnosis. Thus, this work aimed at designing a classification model that can predict accurately for 5 different cancer types comprising of 20 cancer exomes, using the mutations identified from whole exome cancer analysis. Initially, a basic model was designed using supervised machine learning classification algorithms such as K-nearest neighbor (KNN), support vector machine (SVM), decision tree, naïve bayes and random forest (RF), among which decision tree and random forest performed better in terms of preliminary model accuracy. However, output predictions were incorrect due to less training scores. Thus, 16 essential features were then selected for model improvement using 2 approaches. All imbalanced datasets were balanced using SMOTE. In the first approach, all features from 20 cancer exome datasets were trained and models were designed using decision tree and random forest. Balanced datasets for decision tree model showed an accuracy of 77%, while with the RF model, the accuracy improved to 82% where all 5 cancer types were predicted correctly. Area under the curve for RF model was closer to 1, than decision tree model. In the second approach, all 15 datasets were trained, while 5 were tested. However, only 2 cancer types were predicted correctly. To cross validate RF model, Matthew's correlation co-efficient (MCC) test was performed. For method 1, the MCC test and MCC cross validation was found to be 0.7796 and 0.9356 respectively. Likewise, for second approach, MCC was observed to be 0.9365, corroborating the accuracy of the designed model. The model was successfully deployed using Streamlit as a web application for easy use. This study presents insights for allowing easy cancer classifications.

Tree (set theory)

10.1177/11769351221147244

Cite

Citations (1)

A Meta-Ensemble Classifier Approach: Random Rotation Forest

Balkan Journal of Electrical and Computer Engineering (2019)

Erdal Taşçı

Ensemble learning is a popular and intensively studied field in machine learning and pattern recognition to increase the performance of the classification. Random forest is so important for giving fast and effective results. On the other hand, Rotation Forest can get better performance than Random Forest. In this study, we present a meta-ensemble classifier, called Random Rotation Forest to utilize and combine the advantages of two classifiers (e.g. Rotation Forest and Random Forest). In the experimental studies, we use three base learners (namely, J48, REPTree, and Random Forest) and two meta-learners (namely, Bagging and Rotation Forest) for ensemble classification on five datasets in UCI Machine Learning Repository. The experimental results indicate that Random Rotation Forest gives promising results according to base learners and bagging ensemble approaches in terms of accuracy rates, AUC, precision and recall values. Our method can be used for image/pattern recognition and machine learning problems.

Ensemble Learning

C4.5 algorithm

Bootstrap aggregating

10.17694/bajece.502156

Cite

Citations (13)