Cover Image for System.Linq.Enumerable+EnumerablePartition`1[System.Char]

Prediction of Diabetic Retinopathy Using Health Records With Machine Learning Classifiers and Data Science

OAI: oai:igi-global.com:299959 DOI: 10.4018/IJRQEH.299959
Published by: IGI Global

Abstract

Diabetes is a rapidly spreading disease. When the pancreas produces insufficient insulin or the body cannot utilise it effectively. Diabetic Retinopathy (DR) and blindness are two major issues for diabetics. Diabetes patients increase the amount of data collected about DR. To extract important information and undiscovered knowledge from data, data mining techniques are required. DM is necessary in DR to improve society's health. Our study focuses on the early detection of Diabetic Retinopathy using patient information. DM approaches are used to extract information from these numeric records. The dataset was used to forecast DR using logistic regression, KNN, SVM, bagged tree, and boosted tree classifiers. Two cross-validations are used to find the best features and avoid overfitting. Our dataset includes 900 diabetes patients. The boosted tree produced the best classification accuracy (90.1%) with 10% hold-out validation. KNN also achieved 88.9% accuracy, which is impressive. As a result, our research suggests that bagged trees and KNN are good classifiers for DR.