Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation Rethinks mask tracking as an image correspondence problem and uses L2 similarity to encourage diversified voting -- simpler, better, and more efficient. |