Crossover Learning for Fast Online Video Instance Segmentation
Crossover Learning for Fast Online Video Instance Segmentation Modeling temporal visual context across frames is critical for video instance segmentation (VIS) and other video understanding tasks....

