A Model for Coronary Heart Disease Prediction Using Data Mining Classification Techniques

Main Article Content

Dominic Obwogi Makumba
Wilson Cheruiyot
Kennedy Ogada

Abstract

Nowadays the guts malady is one amongst the foremost causes of death within the world. Thus it s early prediction and diagnosing is vital in medical field, which might facilitate in on time treatment, decreasing health prices and decreasing death caused by it. The treatment values the disease is not cheap by most of the patients and Clinical choices are usually raised supported by doctors‟ intuition and skill instead of on the knowledge-rich information hidden within the stored data. The model  for prediction of heart disease using a classification techniques in data mining reduce medical errors, decreases unwanted exercise variation, enhance patient well-being and improves patient results. The model has been developed to support decision making in heart disease prediction based on data mining techniques. The experiments were performed using the model, based on the three techniques, and their accuracy in prediction noted. The decision tree, naïve Bayes, KNN (K-Nearest Neighbors) and WEKA API (Waikato Environment for Knowledge Analysis-application programming interface) were the various data mining methods that were used. The model predicts the likelihood of getting a heart disease using more input medical attributes. 13 attributes that is: blood pressure, sex, age, cholesterol, blood sugar among other factors such as genetic factors, sedentary behavior, socio-economic status and race has been use to predict the likelihood of patient getting a Heart disease until now. This study research added two more attributes that is: Obesity and Smoking.740 Record sets with medical attributes was obtained from a publicly available database for heart disease from machine learning repository with the help of the datasets, and the patterns significant to the heart attack prediction was extracted and divided into two data sets, one was used for training which consisted of 296 records & another for testing consisted of 444 records, and the fraction of accuracy of every data mining classification that was applied was used as standard for performance measure. The performance was compared by calculating the confusion matrix that assists to find the precision recall and accuracy. High performance and accuracy was provided by the complete system model. Comparison between the proposed techniques and the existing one in the prediction capability was presented. The model system assists clinicians in survival rate prediction of an individual patient and future medication is planned for. Consequently, the families, relatives, and their patients can plan for treatment preferences and plan for their budget consequently.

Keywords:
Weka API, decision tree, naïve bayes, KNN, cardiovascular disease, KDD

Article Details

How to Cite
Makumba, D., Cheruiyot, W., & Ogada, K. (2019). A Model for Coronary Heart Disease Prediction Using Data Mining Classification Techniques. Asian Journal of Research in Computer Science, 3(4), 1-19. https://doi.org/10.9734/ajrcos/2019/v3i430098
Section
Minireview Article

References

Venkatalakshmi B, Shivsankar M, Heart disease diagnosis using predictive data mining, International Journal of Innovative Research in Science, Engineering and Technology. 2014;3(3):1873–1877.

Umadevi D, Sundar, Dr. Alli P. A study on stock market analysis for stock selection – Naïve Investors’ Perspective using Data Mining Technique. International Journal of Computer Applications. 2011;34(3):0975 – 8887.

Han J, Kamber M. Data Mining: Concepts and Techniques. fourth Edition, Morgan Kaufmann Publishers, San Francisco. 2014-15;16 (3):291-296.

Fayyad, Piatetsky-Shapiro, Smyth. From data mining to knowledge discovery: An Overview, In Fayyad, Piatetsky-Shapiro, Smyth, Uthurusamy, Advances in Knowledge Discovery and Data Mining, AAAI Press / The MIT Press, Menlo Park, CA. 2014;1-34.

Blake CL, Mertz CJ. UCI Machine Learning Databases.

Mrs Subbalakshmi G. Decision support in heart disease prediction system using Naive Bayes, Indian Journal of Computer Science and Engineering. 2014;3(5):227-238.

Wu R, Peters W, Morgan MW. The Next generation clinical decision support: Linking Evidence to Best Practice, Journal of Healthcare Information Management. 2016;16(4):50 -55.

Jyoti Soni et al. Predictive data mining for medical diagnosis: An Overview of Heart Disease Prediction; International Journal of Computer Applications. 2011;17(8):0975 – 8887.

Nidhi Bhatla, Kiran Jyoti. An analysis of heart disease prediction using different data mining techniques. International Journal of Engineering and Technology. 2012;1(8) :234-241.

Nidhi Bhatla, Kiran Jyoti. An analysis of heart disease prediction using different data mining techniques. International Journal of Engineering and Technology. 2012;1(8):234-241.

Chaitrali S. Dangare, Sulabha S. Apte, ―Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques; International Journal of Computer Applications. 2012; 42(10):0975 – 888.

Chaitrali S. Danagre, Sulabha S. Apte, Ph.D, Improved Studyof Heart Disease Prediction Systemusing Data mining Classification Techniques,IJCA; 2012.

Chitra R, Review of heart disease prediction system using data mining and hybrid intelligent techniques; Ictact Journal On Soft Computing. 2013;03(04):781-785.

Vikas Chaurasia, et al. Early prediction of heart diseases using data mining techniques; Caribbean Journal of Science and Technology. 2013;1:208-217.
[ISSN 0799-3757]

Manikandan V, Latha S. Predicting the analysis of heart disease symptoms using medical data mining methods. International Journal of Advanced Computer Theory and Engineering. 2013;2(2):236-240.

Beant Kaur, Williamjeet Singh. Review on heart disease prediction system using data mining techniques. IJRITCC. 2014;56-72.

Aditya Methaila. Early heart disease prediction using data mining techniques; CCSEIT, DMDB, ICBB, MoWiN, AIAP. 2014;53–59.

Hlaudi Daniel Masethe, Mosima Anna Mase. The-prediction of heart disease using classification Algorithms; Proceedings of the World Congress on Engineering and Computer Science; 2014.

Venkatalakshmi B, Shivsankar M. Heart disease diagnosis using predictive data mining, International Journal of Innovative Research in Science, Engineering and Technology. 2014;3(3):1873–1877.

Choi Keunho et al. Classification and sequential pattern analysis for improving managerial efficiency and providing better medical service in public healthcare centers. Health Inform Res. 2014;67- 76.

Mrs. Subbalakshmi G. decision support in heart disease prediction system using Naive Bayes. Indian Journal of Computer Science and Engineering. 2014;3(5):227-238.

Nakul Soni, Chirag Gandhi. Application of data mining to health care. International Journal of Computer Science and its Applications. 2014;36(10).

Patil Dipti. An adaptive parameter for data mining approach for healthcare applications (IJACSA). International Journal of Advanced Computer Science and Applications. 2014;3(1):66-70.

Al-Radaideh. Using data mining techniques to build a classification model for predicting employee’s performance, (IJACSA). International Journal of Advanced Computer Science and Applications. 2014;3(2) 60-71.

Koç et al. A comparative study of artificial neural network and logistic regression for classification of marketing campaign results, Mathematical and Computational Applications. 2013;18(3):392-398.

Fartash Haghanikhameneh. A comparison study between data mining algorithms over classification techniques in squid dataset. International Journal of Artificial Intelligence. 2015;9:66-68.

Sakshi, Prof.Sunil Khare. A comparative analysis of classification techniques on categorical data in data mining. International Journal on Recent and Innovation Trends in Computing and Communication. 3(8):5142 – 5147.

World Health Organization; Cardiovascular Diseases (CVDs) Fact Sheet Reviewed; 2016.

Tariq O. Fadl Elsid, Mergani, Eltahir A. An empirical study of the applications of classification techniques in students database. Int. Journal of Engineering Research and Applications. 2014;4(10) (Part-6):01-10.
[ISSN:2248-9622]

Han J, Kamber M. 2014 Data Mining: Concepts and Techniques. fourth Edition, Morgan Kaufmann Publishers, San Francisco. 2013;16(3):291-296.

Obenshain MK. Application of data mining techniques to healthcare data Infection Control and Hospital Epidemiology. 2014;25(8):690–695.

Charly K. Data mining for the enterprise. 31st Annual Hawaii Int. Conf. on System Sciences, IEEE Computer. 2014;7:295-304.

Charly K. Data mining for the enterprise, 31st Annual Hawaii Int. Conf. on System Sciences, IEEE Computer. 2014;7:295-304.

Choi Keunho et al. Classification and sequential pattern analysis for improving managerial efficiency and providing better medical service in public healthcare centers. Health Inform Res. 2014:67-76.

Tang ZH, MacLennan J. Data mining with SQL server 2005, Indianapolis: Wiley. 2015;445-450.

Nadali A, Kakhky EN, Nosratabadi HE. Evaluating the success level of data mining projects based on CRISP-DM methodology by a Fuzzy expert system. Electronics Computer Technology (ICECT). 3rd International Conference on. 2011;6:161-165.

Dangare CS, Apte SS. Improved Study of heart disease prediction system using data mining classification techniques. Int J Comput Appl. 2012;47(10):44–48.

Cleveland, Hungary, Switzerland, VA Long Beach Database:
Available:http://archive.ics.uci.edu/ml/datasets/Heart+Disease