Complementary Learning Subnetworks towards Parameter-Efficient Class-Incremental Learning
Author | Li, Depeng |
Author | Zeng, Zhigang |
Author | Dai, Wei |
Author | Suganthan, Ponnuthurai Nagaratnam |
Available date | 2025-05-08T11:11:45Z |
Publication Date | 2025 |
Publication Name | IEEE Transactions on Knowledge and Data Engineering |
Identifier | http://dx.doi.org/10.1109/TKDE.2025.3550809 |
Citation | Li, D., Zeng, Z., Dai, W., & Suganthan, P. N. (2025). Complementary Learning Subnetworks towards Parameter-Efficient Class-Incremental Learning. IEEE Transactions on Knowledge and Data Engineering. |
ISSN | 1041-4347 |
Abstract | In the scenario of class-incremental learning (CIL), deep neural networks have to adapt their model parameters to non-stationary data distributions, e.g., the emergence of new classes over time. To mitigate the catastrophic forgetting phenomenon, typical CIL methods either cumulatively store exemplars of old classes for retraining model parameters from scratch or progressively expand model size as new classes arrive, which, however, compromises their practical value due to little attention paid to parameter efficiency. In this paper, we contribute a novel solution, effective control of the parameters of a well-trained model, by the synergy between two complementary learning subnetworks. Specifically, we integrate one plastic feature extractor and one analytical feed-forward classifier into a unified framework amenable to streaming data. In each CIL session, it achieves non-overwritten parameter updates in a cost-effective manner, neither revisiting old task data nor extending previously learned networks; Instead, it accommodates new tasks by attaching a tiny set of declarative parameters to its backbone, in which only one matrix per task or one vector per class is kept for knowledge retention. Experimental results on a variety of task sequences demonstrate that our method achieves competitive results against state-of-the-art CIL approaches, especially in accuracy gain, knowledge transfer, training efficiency, and task-order robustness. Furthermore, a graceful forgetting implementation on previously learned trivial tasks is empirically investigated to make its non-growing backbone (i.e., a model with limited network capacity) suffice to train on more incoming tasks. |
Language | en |
Publisher | Institute of Electrical and Electronics Engineers Inc. (IEEE) |
Subject | class-incremental learning complementary learning system Non-stationary data streaming data modeling |
Type | Article |
Files in this item
This item appears in the following Collection(s)
-
Information Intelligence [98 items ]