Medical Datasets Training and Enhancing Disease Prediction Accuracy
DOI:
https://doi.org/10.31185/wjps.441Keywords:
Data mining, medical dataset, Covid-19, and classification.Abstract
Data mining is an effective method that uses sophisticated tools and strategies to sift through massive databases in search of useful patterns. Its usefulness extends to many fields, including medicine. We used AdaBoost, Logistic Regression (LR), K-Nearest Neighbors (KNN), and Random Forest (RF) as predictive models in our investigation (ADaB). Evaluating utilizing methods like cross-validation and random sampling allowed us to concentrate on improving accuracy. Medical datasets were used to construct the following: Heart Disease (HD) dataset, Breast Cancer Wisconsin (BCW) dataset, and Covid-19 dataset. We set out to improve the accuracy of disease predictions and push the boundaries of medical data analysis. After completing the assessment procedure with the training data, the performance measurements showed that the highest accuracy was achieved with LR 83% for HD, KNN 97%, and LR 98% for Covid-19.
References
Shanbehzadeh, M., Kazemi-Arpanahi, H., Ghalibaf, M. B., & Orooji, A. (2022). Performance evaluation of machine learning for breast cancer diagnosis: A case study. Informatics in Medicine Unlocked,
Zhang Z, Zixuan Li(2022)Evaluation Methods for Breast Cancer Prediction in Machine Learning Field. SHS Web of Conferences 144, 03010
Sanjay Patidar , Deepak Kumar , Dheeraj Rukwal (2022)Comparative Analysis of Machine Learning Algorithms for Heart Disease Prediction
Masood, F., Masood, J., Zahir, H., Driss, K., Mehmood, N., & Farooq, H. (2022). Novel Approach to Evaluate Classification Algorithms and Feature Selection Filter Algorithms Using Medical Data. Journal of Computational and Cognitive Engineering, 2(1), 57–67.
Ibomoiye Domor Mienye , Yanxia Sun, Professor , Zenghui Wang, Professor b(2020) An improved ensemble learning approach for the prediction of heart disease risk
Masood, F., Masood, J., Zahir, H., Driss, K., Mehmood, N., & Farooq, H. (2022). Novel Approach to Evaluate Classification Algorithms and Feature Selection Filter Algorithms Using Medical Data. Journal of Computational and Cognitive Engineering, 2(1), 57–67.
Ravichandran, B.D., Keikhosrokiani, P. Classification of Covid-19 misinformation on social media based on neuro-fuzzy and neural network: A systematic review. Neural Comput & Applic 35, 699–717 (2023)
Sinaga, Kristina P., and Miin-Shen Yang. "Unsupervised K-means clustering algorithm." IEEE access 8 (2020): 80716-80727.
Senthilkumar, D., and S. Paulraj. "Prediction of low-birth-weight infants and its risk factors using data mining techniques." Proceedings of the 2015 international conference on industrial engineering and operations management. 2015.
Khamis, Hassan Shee, Kipruto W. Cheruiyot, and Stephen Kimani. "Application of k-nearest neighbour classification in medical data mining." International Journal of Information and Communication Technology Research 4.4
(2014).
Vezhnevets, Alexander, and Vladimir Vezhnevets. "Modest AdaBoost-teaching AdaBoost to generalize better." Graphicon. Vol. 12. No. 5. 2005.
Maalouf, Maher. "Logistic regression in data analysis: an overview." International Journal of Data Analysis Techniques and Strategies 3.3 (2011): 28299
Hilbe, Joseph M.” Logistic regression models.” Chapman and hall/CRC, 2009
Han, J., Kamber, M. & Pei, J. “Mining frequent patterns, associations, And correlations. Data Mining: Concepts and Techniques”, (2nd Ed., pp. 227-283) San Francisco, USA: Morgan Kaufmann Publishers,2006.
Shelly, Gary B., and J. Thomas. "Cashman, and Harry J." Rosenblatt, “Systems Analysis and Design Six Edition,” Course Technology (2006).
Yun Xu and Royston Goodacre,” On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning”, Journal of Analysis and Testing.
Garg, Satvik. "Drug recommendation system based on sentiment analysis of drug reviews using machine learning." 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence). IEEE, 2021.
Makridakis, Spyros. "Accuracy measures: theoretical and practical concerns." International journal of forecasting 9.4 (1993): 527-529.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Asraa Mounaf Almousawy, Enas Fadhil Abdullah, Fatima Mohamad Abbas

This work is licensed under a Creative Commons Attribution 4.0 International License.