XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model We develop a multi-store memory model to untie accuracy with memory consumption -- achieving good results in both short and long videos. |
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation Rethinks mask tracking as an image correspondence problem and uses L2 similarity to encourage diversified voting -- simpler, better, and more efficient. |
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion A more user-friendly and efficient paradigm of iVOS in which interactions and propagations are decoupled, with the user’s intention captured by a novel difference-aware fusion module. |
CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement Refines segmentations (4K and beyond) in a class-agnostic manner without using any high-resolution training data through a set of carefully designed cascade operations. |
Icons from Icons8