Cover Image for System.Linq.Enumerable+EnumerablePartition`1[System.Char]

AI for Health-Related Data Modeling

OAI: oai:igi-global.com:300780 DOI: 10.4018/IJISMD.300780
Published by: IGI Global

Abstract

Data modeling of health-related data from Data Center (DC) has positive effects for health monitoring, disease prevention, and healthcare research. However, health-related data has the characteristics of huge, high-dimensional, and non-normalized, which are not beneficial to direct analysis, so data needs to be preprocessed before data modeling. This paper focuses on the features of health-related data, and outlier detection during data preprocessing is studied. Meanwhile, we propose an improved algorithm for health-related data based outlier detection. The experimental results reveal that the proposed outlier detection algorithm has a smaller running time, and more outliers are detected compared to three baselines. In addition, local importance based random forest feature selection algorithm is proposed to measure the importance of each feature. The experimental results indicate that the proposed algorithm can select optimal feature subset to apply health-related data.