Skip to Main Content (Press Enter)

Logo UNIBG
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIBG

|

UNI-FIND

unibg.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

Weighted Ensemble Models Are Strong Continual Learners

Contributo in Atti di convegno
Data di Pubblicazione:
2025
Citazione:
(2025). Weighted Ensemble Models Are Strong Continual Learners . Retrieved from https://hdl.handle.net/10446/311030
Abstract:
In this work, we study the problem of continual learning (CL) where the goal is to learn a model on a sequence of tasks, under the assumption that the data from the previous tasks becomes unavailable while learning on the current task data. CL is essentially a balancing act between learning on the new task (i.e. plasticity) and maintaining the performance on the previously learned concepts (i.e. stability). To address the stability-plasticity trade-off, we propose to perform weight-ensembling of the model parameters of the previous and current tasks. This weighted-ensembled model, which we call Continual Model Averaging (or CoMA), attains high accuracy on the current task by leveraging plasticity, while not deviating too far from the previous weight configuration, ensuring stability. We also propose an improved variant of CoMA, named Continual Fisher-weighted Model Averaging (or CoFiMA), that selectively weighs each parameter in the weights ensemble by leveraging the Fisher information of the weights of the model. Both variants are conceptually simple, easy to implement, and effective in attaining state-of-the-art performance on several standard CL benchmarks. Code is available at: https://github.com/IemProg/CoFiMA.
Tipologia CRIS:
1.4.01 Contributi in atti di convegno - Conference presentations
Keywords:
Continual Learning; Model Averaging
Elenco autori:
Marouf, I. E.; Roy, Subhankar; Tartaglione, E.; Lathuilière, S.
Autori di Ateneo:
ROY Subhankar
Link alla scheda completa:
https://aisberg.unibg.it/handle/10446/311030
Titolo del libro:
In European Conference on Computer Vision (pp. 306-324). Cham: Springer Nature Switzerland.
Pubblicato in:
LECTURE NOTES IN COMPUTER SCIENCE
Series
  • Ricerca

Ricerca

Settori


Settore IINF-05/A - Sistemi di elaborazione delle informazioni
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 25.11.5.0