Comparative Study on Early Stage Diabete Detection by Using Machine Learning Methods

dc.contributor.authorCherifi, Dalila
dc.contributor.authorDjellouli, Seyyid Ahmed
dc.contributor.authorRiabi, Hanane
dc.contributor.authorHamadouche, Mohamed
dc.date.accessioned2024-05-05T13:10:22Z
dc.date.available2024-05-05T13:10:22Z
dc.date.issued2023
dc.description.abstractThis paper introduces an innovative approach to diabetes prediction, leveraging machine learning algorithms. The study is dedicated to elevating the precision of medical examinations through the application of machine learning to electronic health records (EHRs). In our investigation of the Pima Indian dataset, we employed two distinct strategies-imputation data and, notably, the novel filtered data approach-to address missing values. Subsequently, we rigorously evaluated six supervised machine learning models, encompassing Logistic Regression, Random Forest, K-Nearest Neighbor, Support Vector Machine, XGBoost, and Cat Boost. Metrics including accuracy, precision, sensitivity, specificity, and stability were meticulously assessed. Encouragingly, we achieved a commendable 98% accuracy with the Random Forest classifier using the imputation data strategy. However, our groundbreaking contribution lies in the filtered data approach, where we achieved an equally promising 84% accuracy using the XGBoost classifier. This pivotal finding unequivocally establishes the superiority of the filtered data methodology, signifying a significant leap towards enhancing patient risk scoring systems and foreseeing the onset of disease.en_US
dc.identifier.uri10.1109/ICNAS59892.2023.10330477
dc.identifier.urihttps://ieeexplore.ieee.org/document/10330477
dc.identifier.urihttps://dspace.univ-boumerdes.dz/handle/123456789/13879
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.relation.ispartofseries2023 International Conference on Networking and Advanced Systems (ICNAS), Algiers, Algeria(2023);pp. 1-6
dc.subjectDiabetic Detectionen_US
dc.subjectMachine Learning Methodsen_US
dc.subjectLogistic Regressionen_US
dc.subjectRandom Foresten_US
dc.subjectK-Nearest Neighbouren_US
dc.subjectSupport Vector Machineen_US
dc.subjectXGBoosten_US
dc.subjectCat Boosten_US
dc.titleComparative Study on Early Stage Diabete Detection by Using Machine Learning Methodsen_US
dc.typeBooken_US

Files