Article ; Online: An unsupervised cluster-based feature grouping model for early diabetes detection
Healthcare Analytics, Vol 2, Iss , Pp 100112- (2022)
2022
Abstract: Diabetes mellitus is often a hyperglycemic condition that poses a substantial threat to human health. Early diabetes detection decreases morbidity and mortality. Due to the scarcity of labeled data and the presence of oddities in diabetes datasets, it is ...
Abstract | Diabetes mellitus is often a hyperglycemic condition that poses a substantial threat to human health. Early diabetes detection decreases morbidity and mortality. Due to the scarcity of labeled data and the presence of oddities in diabetes datasets, it is exceedingly difficult to develop a trustworthy and accurate diabetes prognosis. The dataset and groupings of the features using the elbow and silhouette methods have been clustered using K-means. Various machine learning approaches have also been applied to the cluster-based dataset to predict diabetes. We propose an unsupervised cluster-based feature grouping model for early diabetes identification using an open-source dataset containing the data of 520 diabetic patients. On the cluster-based dataset and the complete dataset, the maximum Accuracy (ACC) is 99.57% and 99.03%, respectively. The best Precision, Recall, minimum mean squared error (MSE), maximum mean squared error (MSE), and F1-Score of 1.000 are obtained from multi-layer perceptron (MLP), random forest (RF), and k-Nearest Neighbors (KNN), 0.984 from random forest (RF) and support vector machine (SVM), 0.010 from RF, 0.067 from KNN, and 99.20% from RF, respectively. A comparison table displays the anticipated outcomes and highlights the aspects of this research that are most likely to occur as intended. The preprocessed data and codes are available on the GitHub repository to https://github.com/mhashiq/Early-stage-diabetes-risk-prediction. |
---|---|
Keywords | Supervised learning ; Predictive analytics ; Feature ranking ; Elbow method ; Silhouette method ; Diabetics prediction ; Computer applications to medicine. Medical informatics ; R858-859.7 |
Subject code | 006 ; 310 |
Language | English |
Publishing date | 2022-11-01T00:00:00Z |
Publisher | Elsevier |
Document type | Article ; Online |
Database | BASE - Bielefeld Academic Search Engine (life sciences selection) |
Full text online
More links
Kategorien
Inter-library loan at ZB MED
Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.