Machine learning and artificial intelligence have been found useful in various disciplines during the course of their development, especially in the enormous increasing data in recent years. It can be more reliable for making better and faster decisions for disease predictions. So, machine learning algorithms are increasingly finding their application to predict various diseases. Constructing a model can also help us visualize and analyze diseases to improve reporting consistency and accuracy. This article has investigated how to detect heart disease by applying various machine learning algorithms. The study in this article has shown a two-step process. The heart disease dataset is first prepared into a required format for running through machine learning algorithms. Medical records and other information about patients are gathered from the UCI repository. The heart disease dataset is then used to determine whether or not the patients have heart disease. Secondly, Many valuable results are shown in this article. The accuracy rate of the machine learning algorithms, such as Logistic Regression, Support vector machine, K-Nearest-Neighbors, Random Forest, and Gradient Boosting Classifier, are validated through the confusion matrix. Current findings suggest that the Logistic Regression algorithm gives a high accuracy rate of 95% compared to other algorithms. It also shows high accuracy for f1-score, recall, and precision than the other four different algorithms. However, increasing the accuracy rates to approximately 97% to 100% of the machine learning algorithms is the future study and challenging part of this research.
Published in | American Journal of Computer Science and Technology (Volume 5, Issue 3) |
DOI | 10.11648/j.ajcst.20220503.11 |
Page(s) | 146-154 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2022. Published by Science Publishing Group |
Machine Learning, Artificial Intelligence, Heart Disease, Linear Regression, Support Vector Machine, K-Nearest-Neighbors, Random Forest, Decision Tree, Gradient Boosting
[1] | Wikipedia contributors. (2022, June 22). Machine learning. In Wikipedia, The Free Encyclopedia. Retrieved 06:31, June 26, 2022, from https://en.wikipedia.org/w/index.php?title=Machine_learning&oldid=1094363111. |
[2] | Victor Chang, Vallabhanent Rupa Bhavani, Ariel Qianwen Xu, MA Hossain. An artificial intellegence model for heart disease detection using machine learning. Healthcare Analytics, volume 2, November 2022, 100016. https://doi.org/10.1016/j.health.2022.100016. |
[3] | Ghumbre, S. U., & Ghatol, A. A. (2012). Heart disease diagnosis using machine learning algorithm. In Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012 (pp. 217-225). Springer, Berlin, Heidelberg. |
[4] | Rohit Bharti, Aditya Khamparia, Mohammed Shabaz, Gaurav Dhiman, Sagar pande, and Parneet Singh. Prediction of Heart Disease Using a combination of Machine Learning and Deep learning. Hindawi Computational Intelligence and Neuroscience, Volume 2021, Article ID 8387680, 11 pages. https://doi.org/10.1155/2021/8387680. |
[5] | Khaled Mohamed Almustafa. Prediction of heart disease and classifiers sensitivity analysis. Almustafa BMC Bioinfirmatics (2020) 21: 278. https://doi.org/10.1186/s12859-020-03626-y. |
[6] | Jee S H, Jang Y, Oh D J, Oh B H, Lee S H, Park S W & Yun Y D (2014), A coronary heart disease prediction model. The Korean Heart Study. BMJ open, 4 (5), e005025. |
[7] | Mai Shouman, Tim Turner, and Rob Stocker. Applying k-Nearest Neighbour in diagnosis heart disease patients.. International Journal of Information and Education Technology, vol. 2, No. 3, June 2012. |
[8] | Ganna A, Magnusson P K, Pedersen N L, de Faire U, Reilly M, Arnlov J & Ingelsson E (2013). Multilocus genetic risk scores for coronary heart disease prediction. Arteriosclerosis, thrombosis, and vascular biology, 33 (9), 2267-72. |
[9] | Raihan M, Mondal S, More A, Sagor M O F, Sikder G, Majumder M A & Ghosh K (2016, December). Smartphone based ischeme heart disease (heart attact) risk prediction using clinical data and data mining approaches, a prototype design. 19th International conference on Computer and Information Technology (ICCIT) (pp. 299-303). IEEE. |
[10] | Acharya U R, Fujita H, Oh S L, Hagiwara Y, Tan J H & Adam M (2017). Application of deep convolutional neural network for automated detection of myocardial infarction using ECG signals. Information Sciences, 415, 190-8. |
[11] | Takci H (2018). Improvement of heart attack prediction by the feature selection methods. Turkish Journal of Electrical Engineering & Computer Sciences, 26 (1), 1-10. |
[12] | Brown N, Young T, Gray D, Skene A M & Hampton J R (1997). Inpatient deaths from acute myocardial infarction, 1982-92: analysis of data in the Nottingham heart attack register, BMJ, 315 (7101), 159-64. |
[13] | Soni J, Ansari U, Sharman D & Soni S (2011). Predictive data mining for medical diagnosis: an overview of heart disease prediction. International Journal of Computer Applications, 17 (8), 43-8. |
[14] | Bashir S, Qamar U & Javed M Y (2014, November). An ensemble-based decision support framework for intelligent heart disease diagnosis. International Conference on Information Sociaty (i-Sociaty 2014) (pp. 259-64). IEEE. |
[15] | Ordonez C (2006). Associate rule discovery with the train and test approach for heart disease prediction. IEEE Transaction on Information Technology in Biomedicine, 10 (2), 334-43. |
[16] | Wikipedia contributors. (2022, June 21). Logistic regression. In Wikipedia, The Free Encyclopedia. Retrieved 06:36, June 26, 2022, from https://en.wikipedia.org/w/index.php?title=Logistic_regression&oldid=1094256072. |
[17] | Wikipedia contributors. (2022, June 1). Linear regression. In Wikipedia, The Free Encyclopedia. Retrieved 06:39, June 26, 2022, from https://en.wikipedia.org/w/index.php?title=Linear_regression&oldid=1091044459. |
[18] | Wikipedia contributors. (2022, June 4). K-nearest neighbors algorithm. In Wikipedia, The Free Encyclopedia. Retrieved 06:40, June 26, 2022, from https://en.wikipedia.org/w/index.php?title=K-nearest_neighbors_algorithm&oldid=1091525121. |
[19] | Wikipedia contributors. (2022, June 20). Random forest. In Wikipedia, The Free Encyclopedia. Retrieved 06:41, June 26, 2022, from https://en.wikipedia.org/w/index.php?title=Random_forest&oldid=1094130824. |
[20] | Wikipedia contributors. (2022, June 15). Decision tree learning. In Wikipedia, The Free Encyclopedia. Retrieved 06:42, June 26, 2022, from https://en.wikipedia.org/w/index.php?title=Decision_tree_learning&oldid=1093316444. |
[21] | Wikipedia contributors. (2022, June 24). Gradient boosting. In Wikipedia, The Free Encyclopedia. Retrieved 06:43, June 26, 2022, from https://en.wikipedia.org/w/index.php?title=Gradient_boosting&oldid=1094845596. |
[22] | nikki2398. (02 Sep, 2020). ML–Gradient Boosting. https://www.geeksforgeeks.org/ml-gradient-boosting/. |
[23] | Wikipedia contributors. (2022, June 20). Support-vector machine. In Wikipedia, The Free Encyclopedia. Retrieved 06:51, June 26, 2022, from https://en.wikipedia.org/w/index.php?title=Support-vector_machine&oldid=1094109362. |
APA Style
Mohammed Khalid Hossen. (2022). Heart Disease Prediction Using Machine Learning Techniques. American Journal of Computer Science and Technology, 5(3), 146-154. https://doi.org/10.11648/j.ajcst.20220503.11
ACS Style
Mohammed Khalid Hossen. Heart Disease Prediction Using Machine Learning Techniques. Am. J. Comput. Sci. Technol. 2022, 5(3), 146-154. doi: 10.11648/j.ajcst.20220503.11
@article{10.11648/j.ajcst.20220503.11, author = {Mohammed Khalid Hossen}, title = {Heart Disease Prediction Using Machine Learning Techniques}, journal = {American Journal of Computer Science and Technology}, volume = {5}, number = {3}, pages = {146-154}, doi = {10.11648/j.ajcst.20220503.11}, url = {https://doi.org/10.11648/j.ajcst.20220503.11}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ajcst.20220503.11}, abstract = {Machine learning and artificial intelligence have been found useful in various disciplines during the course of their development, especially in the enormous increasing data in recent years. It can be more reliable for making better and faster decisions for disease predictions. So, machine learning algorithms are increasingly finding their application to predict various diseases. Constructing a model can also help us visualize and analyze diseases to improve reporting consistency and accuracy. This article has investigated how to detect heart disease by applying various machine learning algorithms. The study in this article has shown a two-step process. The heart disease dataset is first prepared into a required format for running through machine learning algorithms. Medical records and other information about patients are gathered from the UCI repository. The heart disease dataset is then used to determine whether or not the patients have heart disease. Secondly, Many valuable results are shown in this article. The accuracy rate of the machine learning algorithms, such as Logistic Regression, Support vector machine, K-Nearest-Neighbors, Random Forest, and Gradient Boosting Classifier, are validated through the confusion matrix. Current findings suggest that the Logistic Regression algorithm gives a high accuracy rate of 95% compared to other algorithms. It also shows high accuracy for f1-score, recall, and precision than the other four different algorithms. However, increasing the accuracy rates to approximately 97% to 100% of the machine learning algorithms is the future study and challenging part of this research.}, year = {2022} }
TY - JOUR T1 - Heart Disease Prediction Using Machine Learning Techniques AU - Mohammed Khalid Hossen Y1 - 2022/07/20 PY - 2022 N1 - https://doi.org/10.11648/j.ajcst.20220503.11 DO - 10.11648/j.ajcst.20220503.11 T2 - American Journal of Computer Science and Technology JF - American Journal of Computer Science and Technology JO - American Journal of Computer Science and Technology SP - 146 EP - 154 PB - Science Publishing Group SN - 2640-012X UR - https://doi.org/10.11648/j.ajcst.20220503.11 AB - Machine learning and artificial intelligence have been found useful in various disciplines during the course of their development, especially in the enormous increasing data in recent years. It can be more reliable for making better and faster decisions for disease predictions. So, machine learning algorithms are increasingly finding their application to predict various diseases. Constructing a model can also help us visualize and analyze diseases to improve reporting consistency and accuracy. This article has investigated how to detect heart disease by applying various machine learning algorithms. The study in this article has shown a two-step process. The heart disease dataset is first prepared into a required format for running through machine learning algorithms. Medical records and other information about patients are gathered from the UCI repository. The heart disease dataset is then used to determine whether or not the patients have heart disease. Secondly, Many valuable results are shown in this article. The accuracy rate of the machine learning algorithms, such as Logistic Regression, Support vector machine, K-Nearest-Neighbors, Random Forest, and Gradient Boosting Classifier, are validated through the confusion matrix. Current findings suggest that the Logistic Regression algorithm gives a high accuracy rate of 95% compared to other algorithms. It also shows high accuracy for f1-score, recall, and precision than the other four different algorithms. However, increasing the accuracy rates to approximately 97% to 100% of the machine learning algorithms is the future study and challenging part of this research. VL - 5 IS - 3 ER -