Complementary Learning Subnetworks towards Parameter-Efficient Class-Incremental Learning

Li, Depeng; Zeng, Zhigang; Dai, Wei; Suganthan, Ponnuthurai Nagaratnam

Author	Li, Depeng
Author	Zeng, Zhigang
Author	Dai, Wei
Author	Suganthan, Ponnuthurai Nagaratnam
Available date	2025-05-08T11:11:45Z
Publication Date	2025
Publication Name	IEEE Transactions on Knowledge and Data Engineering
Identifier	http://dx.doi.org/10.1109/TKDE.2025.3550809
Citation	Li, D., Zeng, Z., Dai, W., & Suganthan, P. N. (2025). Complementary Learning Subnetworks towards Parameter-Efficient Class-Incremental Learning. IEEE Transactions on Knowledge and Data Engineering.
ISSN	1041-4347
URI	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105000289812&origin=inward
URI	http://hdl.handle.net/10576/64812
Abstract	In the scenario of class-incremental learning (CIL), deep neural networks have to adapt their model parameters to non-stationary data distributions, e.g., the emergence of new classes over time. To mitigate the catastrophic forgetting phenomenon, typical CIL methods either cumulatively store exemplars of old classes for retraining model parameters from scratch or progressively expand model size as new classes arrive, which, however, compromises their practical value due to little attention paid to parameter efficiency. In this paper, we contribute a novel solution, effective control of the parameters of a well-trained model, by the synergy between two complementary learning subnetworks. Specifically, we integrate one plastic feature extractor and one analytical feed-forward classifier into a unified framework amenable to streaming data. In each CIL session, it achieves non-overwritten parameter updates in a cost-effective manner, neither revisiting old task data nor extending previously learned networks; Instead, it accommodates new tasks by attaching a tiny set of declarative parameters to its backbone, in which only one matrix per task or one vector per class is kept for knowledge retention. Experimental results on a variety of task sequences demonstrate that our method achieves competitive results against state-of-the-art CIL approaches, especially in accuracy gain, knowledge transfer, training efficiency, and task-order robustness. Furthermore, a graceful forgetting implementation on previously learned trivial tasks is empirically investigated to make its non-growing backbone (i.e., a model with limited network capacity) suffice to train on more incoming tasks.
Language	en
Publisher	Institute of Electrical and Electronics Engineers Inc. (IEEE)
Subject	class-incremental learning complementary learning system Non-stationary data streaming data modeling
Title	Complementary Learning Subnetworks towards Parameter-Efficient Class-Incremental Learning
Type	Article
dc.accessType	Full Text

Files in this item

Name:: Complementary_Learning_Subnetw ...
Size:: 3.884Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Information Intelligence [‎109‎ items ]

Show simple item record

Complementary Learning Subnetworks towards Parameter-Efficient Class-Incremental Learning

Files in this item

This item appears in the following Collection(s)

Video