XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
We develop a multi-store memory model to untie accuracy with memory consumption -- achieving good results in both short and long videos.
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
Rethinks mask tracking as an image correspondence problem and uses L2 similarity to encourage diversified voting -- simpler, better, and more efficient.
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion
A more user-friendly and efficient paradigm of iVOS in which interactions and propagations are decoupled, with the user’s intention captured by a novel difference-aware fusion module.
CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
Refines segmentations (4K and beyond) in a class-agnostic manner without using any high-resolution training data through a set of carefully designed cascade operations.
Icons from Icons8