Datasets

vCLIMB is a novel benchmark devised to evaluate Class Incremental Learning in video domain to promote and facilitate research in this area. Our benchmark includes three well-known video datasets: UCF101, Kinetics, and ActivityNet.

Our Benchmark

This table shows the main attributes of the Continual Learning Scenarios we propose in our benchmark.

  • We create two sets of tasks, one with ten tasks and the other with twenty. The last one is more challenging because it has a longer series of tasks. Therefore, it offers more chances to forget.
  • We do not leverage pretraining of the same data distribution to isolate the forgetting problem.
  • For the first time, we propose evaluating a more realistic and challenging scenario using untrimmed videos.
  • The video instances contain a temporal dimension size that could show large variability. To favor fair comparisons between methods and datasets, we define the working memory size of the rehearsal methods in terms of stored frames.
SetTasksClasses Per TaskAvg. Frames Per VideoUntrimmed Videos
vCLIMB UCF1011010183
vCLIMB UCF101205183
vCLIMB Kinetics1040250
vCLIMB Kinetics2020250
vCLIMB ActivityNet-Untrim10203542
vCLIMB ActivityNet-Untrim20103542
vCLIMB ActivityNet-Trim10203879
vCLIMB ActivityNet-Trim20103879