I am a Ph.D. candidate at the University of Illinois Urbana-Champaign, advised by
I work on visual understanding, focusing on videos. My past research includes video object tracking, segmentation, and multimodal-conditioned video-to-audio synthesis. I have interned at Adobe Research (open-world video segmentation), Kaiber (videos diffusion models), and Sony AI (multimodal flow matching models).