profile picture
Ho Kei (Rex) Cheng

I am currently a Ph.D. candidate at the University of Illinois Urbana-Champaign, advised by Alexander Schwing. Before that, I was at The Hong Kong University of Science and Technology, advised by Yu-Wing Tai and Chi Keung Tang.

I am recently working on algorithms for videos, including segmentation, tracking, editing, and generation. I have interned at Adobe Research (open-world video segmentation) and Kaiber (diffusion models for videos). I will intern at Sony AI in Tokyo this summer.

[GitHub] | [Google Scholar] | [CV]

Research (hover over videos to play)
CVPR 2024 Highlight
Project page / code / arXiv / pdf
Equipped with an object transformer that integrates pixel-level features and object-level features for efficient and robust video object segmentation in challenging scenarios. Used by iMotions and Annolid.
ICCV 2023
Project page / code / arXiv / pdf
Open-world video segmentation achieved by combining universal image segmentation with temporal propagation. Easy to extend.
Ho Kei Cheng, Alexander Schwing.
ECCV 2022
Project page / code / arXiv / pdf
We look at video object segmentation from a memory perspective and design a pipeline that models both short-term and long-term dependencies effectively. Used by supervisely and Track-Anything.
Ho Kei Cheng, Yu-Wing Tai, Chi Keung Tang.
NeurIPS 2021
Project page / code / arXiv / pdf
We devise a new, simple, and effective way of modeling correspondences between pixels from different frames. Used by Trioscope and BURST.
Ho Kei Cheng, Yu-Wing Tai, Chi Keung Tang.
CVPR 2021
Project page / code / arXiv / pdf
We decouple the problem of interactive video segmentation into single-frame interaction and temporal propagation, showing that this works better by a large margin. Used by Sieve.
CVPR 2020
Project page / code / arXiv / pdf / pypi
We train an iterative refinement network that generalizes to high-quality/high-resolution (4K+) segmentation with just low-resolution (<500 pixels per side) data.

Invited Talks
Professional Activities