Abstract
Cardiovascular disease has become one of the world's major causes of death. Accurate and timely diagnosis is of crucial importance. We constructed an intelligent diagnostic framework for prediction of heart disease, using the Cleveland Heart disease dataset. We have used three machine learning approaches, Decision Tree (DT), K-Nearest Neighbor (KNN), and Random Forest (RF) in combination with different sets of features. We have applied the three techniques to the full set of features, to a set of ten features selected by "Pearson's Correlation"technique and to a set of six features selected by the Relief algorithm. Results were evaluated based on accuracy, precision, sensitivity, and several other indices. The best results were obtained with the combination of the RF classifier and the features selected by Relief achieving an accuracy of 98.36%. This could even further be improved by employing a 5-fold Cross Validation (CV) approach, resulting in an accuracy of 99.337%.
Original language | English |
---|---|
Title of host publication | ICISDM 2021 - 5th International Conference on Information System and Data Mining |
Place of Publication | New York |
Publisher | Association for Computing Machinery (ACM) |
Pages | 14-20 |
Number of pages | 7 |
Volume | 1 |
ISBN (Electronic) | 9781450389549 |
DOIs | |
Publication status | Published - 2021 |
Event | 5th International Conference on Information System and Data Mining, ICISDM 2021 - Virtual, Online, United States Duration: 27 May 2021 → 29 May 2021 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | 5th International Conference on Information System and Data Mining, ICISDM 2021 |
---|---|
Country/Territory | United States |
City | Virtual, Online |
Period | 27/05/21 → 29/05/21 |
Bibliographical note
Publisher Copyright:© 2021 ACM.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.