Show simple item record

AuthorLi, Depeng
AuthorZeng, Zhigang
AuthorDai, Wei
AuthorSuganthan, Ponnuthurai Nagaratnam
Available date2025-05-08T11:11:45Z
Publication Date2025
Publication NameIEEE Transactions on Knowledge and Data Engineering
Identifierhttp://dx.doi.org/10.1109/TKDE.2025.3550809
CitationLi, D., Zeng, Z., Dai, W., & Suganthan, P. N. (2025). Complementary Learning Subnetworks towards Parameter-Efficient Class-Incremental Learning. IEEE Transactions on Knowledge and Data Engineering.
ISSN1041-4347
URIhttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105000289812&origin=inward
URIhttp://hdl.handle.net/10576/64812
AbstractIn the scenario of class-incremental learning (CIL), deep neural networks have to adapt their model parameters to non-stationary data distributions, e.g., the emergence of new classes over time. To mitigate the catastrophic forgetting phenomenon, typical CIL methods either cumulatively store exemplars of old classes for retraining model parameters from scratch or progressively expand model size as new classes arrive, which, however, compromises their practical value due to little attention paid to parameter efficiency. In this paper, we contribute a novel solution, effective control of the parameters of a well-trained model, by the synergy between two complementary learning subnetworks. Specifically, we integrate one plastic feature extractor and one analytical feed-forward classifier into a unified framework amenable to streaming data. In each CIL session, it achieves non-overwritten parameter updates in a cost-effective manner, neither revisiting old task data nor extending previously learned networks; Instead, it accommodates new tasks by attaching a tiny set of declarative parameters to its backbone, in which only one matrix per task or one vector per class is kept for knowledge retention. Experimental results on a variety of task sequences demonstrate that our method achieves competitive results against state-of-the-art CIL approaches, especially in accuracy gain, knowledge transfer, training efficiency, and task-order robustness. Furthermore, a graceful forgetting implementation on previously learned trivial tasks is empirically investigated to make its non-growing backbone (i.e., a model with limited network capacity) suffice to train on more incoming tasks.
Languageen
PublisherInstitute of Electrical and Electronics Engineers Inc. (IEEE)
Subjectclass-incremental learning
complementary learning system
Non-stationary data
streaming data modeling
TitleComplementary Learning Subnetworks towards Parameter-Efficient Class-Incremental Learning
TypeArticle
dc.accessType Full Text


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record