Weili Xu

weili_pic_34.jpg

       Email | GitHub

I am a sophomore in Computer Engineering, currently pursuing a dual degree from University of Illinois Urbana-Champaign and Zhejiang University.

I am fortunate to collaborate with Wenhao Chai and Enxin Song, working on Efficient Long Video Understanding. We built LongVidRWKV, a hybrid MLLM that efficiently handles hour-long videos on a single consumer GPU while achieving comparable performance to its Transformer counterparts on multiple video understanding benchmarks such as MLVU, MovieChat-1k and VDC.

I’m interested in various aspects of machine learning and computer systems:

  • Hardware-aware efficient algorithms
  • Exploiting sparsity for training and inference acceleration
  • Applications of multi-modal (video, audio, text, etc.) long-context modeling

news

Jun 25, 2025 One paper accepted by ICCV 2025, see you in Hawaii!
Mar 31, 2025 One paper accepted by the second CVPR workshop on Efficient Large Vision Models
Jan 21, 2025 Start to work as Teaching Assistant for ECE 220 Computer Systems & Programming with Prof. Ujjal Bhowmik

selected publications

  1. ICCV 2025
    AuroraLong-preview.png
    Bringing RNNs Back to Efficient Open-Ended Video Understanding
    Weili XuEnxin SongWenhao Chai, and 3 more authors
    To appear at International Conference on Computer Vision (ICCV) 2025 , Oct 2025