Baselines for Video CIL

In order to evaluate our benchmark, we extend four methods from image domain. The first two are EWC and MAS. These are regularized methods that penalize changes to the most relevant parameters for the previous tasks. The other two, iCaRL and BIC, are memory-based methods that select and store samples of the current task into a memory buffer for future replay. Consistent with image benchmarks, the memory-based approaches significantly outperform the regularized methods.

ModelNum. TaskKineticsActivityNet-TrimUCF101
Mem. Video InstancesMem. Frame CapacityAccBWFMem. Video InstancesMem. Frame CapacityAccBWFMem. Video InstancesMem. Frame CapacityAccBWF
EWC10NoneNone5.81%16.05%NoneNone4.02%5.32%NoneNone9.51%98.94%
MAS10NoneNone7.81%10.12%NoneNone8.11%0.18%NoneNone10.89%11.11%
EWC20NoneNone2.95%32.70%NoneNone1.28%3.77%NoneNone4.71%92.12%
MAS20NoneNone4.25%5.54%NoneNone4.61%0.1%NoneNone5.90%5.31%
Naive1080002 × 10630.14%41.30%400015.5 × 10647.20%20.64%20203.69 × 10591.42%7.43%
iCaRL1080002 × 10632.04%38.74%400015.5 × 10648.53%19.72%20203.69 × 10580.97%18.11%
BiC1080002 × 10627.90%51.96%400015.5 × 10651.96%24.27%20203.69 × 10578.16%18.49%
Naive2080002 × 10623.47%48.05%400015.5 × 10640.78%23.18%20203.69 × 10587.40%10.96%
iCaRL2080002 × 10626.73%42.25%400015.5 × 10643.33%21.57%20203.69 × 10576.59%21.83%
BiC2080002 × 10623.06%58.97%400015.5 × 10646.53%15.95%20203.69 × 10570.69%24.90%