A Novel Class Noise Detection Method for High-Dimensional Data in Industrial Informatics
Author | Guan, Donghai |
Author | Chen, Kai |
Author | Han, Guangjie |
Author | Huang, Shuqiang |
Author | Yuan, Weiwei |
Author | Guizani, Mohsen |
Author | Shu, Lei |
Available date | 2022-11-08T09:44:22Z |
Publication Date | 2021-03-01 |
Publication Name | IEEE Transactions on Industrial Informatics |
Identifier | http://dx.doi.org/10.1109/TII.2020.3012658 |
Citation | Guan, D., Chen, K., Han, G., Huang, S., Yuan, W., Guizani, M., & Shu, L. (2020). A novel class noise detection method for high-dimensional data in industrial informatics. IEEE Transactions on Industrial Informatics, 17(3), 2181-2190. |
ISSN | 15513203 |
Abstract | The data in industrial informatics may be high-dimensional and mislabeled. Irrelevant or noisy features pose a significant challenge to the detection of high-dimensional mislabeling. The traditional method usually adopts a two-step solution, first finding the relevant subspace and then using it for mislabeling detection. This two-step method struggles to provide the optimal mislabeling detection performance, since it separates the procedures of feature selection and label error detection. To solve this problem, in this article, we integrate the two steps and propose a sequential ensemble noise filter (SENF). In the SENF, relevant features are selected and used to generate a noise score for each instance. Continuously, these noise scores guide feature selection in the regression learning. Thus, the SENF falls in the scope of sequential ensemble learning. We evaluate our approach on several benchmark datasets with high dimensionality and much label noise. It is shown that the SENF is significantly better than other existing label noise detection methods. |
Sponsor | This work was supported in part by the National Key Research, and Development Program under Grant 2017YFE0125300 and Grant 2018YFB1702700, in part by the National Natural Science Foundation of China under Grant 61672284 and Grant 61772233, in part by the Key Research and Development Program of Jiangsu Province under Grant BE2019012 and Grant BE2019648, and in part by the project of Shenzhen Science and Technology Innovation Committee under Grant JCYJ20190809145407809. |
Language | en |
Subject | High dimension industrial informatics noise filtering |
Type | Article |
Pagination | 2181-2190 |
Issue Number | 3 |
Volume Number | 17 |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |
This item appears in the following Collection(s)
-
Computer Science & Engineering [2402 items ]