عرض بسيط للتسجيلة

المؤلفRen, Lijuan
المؤلفSeklouli, Aicha Sekhari
المؤلفZhang, Haiqing
المؤلفWang, Tao
المؤلفBouras, Abdelaziz
تاريخ الإتاحة2023-04-09T08:34:50Z
تاريخ النشر2023
اسم المنشورInformation Systems
المصدرScopus
معرّف المصادر الموحدhttp://dx.doi.org/10.1016/j.is.2022.102122
معرّف المصادر الموحدhttp://hdl.handle.net/10576/41755
الملخصAs the application of information technology in the medical field is resulting in a large amount of medical data. As early withdrawal and refusal of participants, there are a lot of missing values in medical data. Although various processing methods for missing values have been proposed, few methods for those medical data with characteristics of imbalance and mixed-type data. In this work, we proposed an adaptive Laplacian weight random forest, called ALWRF. In ALWRF, feature weights were adjusted dynamically when model constructing, which increases selection probabilities of features with low Laplacian score and high importance. Meanwhile, a random operator is introduced to increase the diversity of trees. Furthermore, we proposed an imputation method based on SMOTE-NC oversampling technology and the ALWRF method for imbalanced and mixed-type data, called SncALWRFI. Meanwhile, Bayesian optimization and cross-validation were employed to search optimal parameters. The experimental results showed that the ALWRF method outperforms random forest and Bayesian optimized random forest in terms of classification and regression accuracy. Further, in the experiment for missing values, the SncALWRFI showed the best imputation accuracy, and it performed high imputation effectiveness in public datasets with characteristics of imbalanced and mixed-type. 2022 Elsevier Ltd
راعي المشروعAuthor Lijuan REN is grateful for the support of China Scholarship Council (CSC). In addition, this research is supported by the Sichuan Science and Technology Program of China (No. 2021YFH0107 ). All the authors are thankful to their respective universities for their support: University of Lyon, Chengdu University of Information Technology and Qatar University.
اللغةen
الناشرElsevier
الموضوعImbalanced
Imputation
Missing values
Mixed-type
Random forest
العنوانAn adaptive Laplacian weight random forest imputation for imbalance and mixed-type data
النوعArticle
رقم المجلد111


الملفات في هذه التسجيلة

الملفاتالحجمالصيغةالعرض

لا توجد ملفات لها صلة بهذه التسجيلة.

هذه التسجيلة تظهر في المجموعات التالية

عرض بسيط للتسجيلة