Weili Xu

weili_pic_34.jpg

       Email | GitHub

I am a sophomore in Computer Engineering, currently pursuing a dual degree from University of Illinois Urbana-Champaign and Zhejiang University.

I am fortunate to collaborate with Wenhao Chai and Enxin Song, working on Efficient Long Video Understanding. We built AuroraLong, a hybrid MLLM that efficiently handles hour-long videos on a single consumer GPU while achieving comparable performance to its Transformer counterparts on multiple video understanding benchmarks such as MLVU, MovieChat-1k and VDC.

I’m interested in various aspects of machine learning and computer systems:

  • Hardware-aware efficient algorithms
  • Exploiting sparsity for training and inference acceleration
  • Applications of multi-modal (video, audio, text, etc.) long-context modeling

news

Jun 25, 2025 One paper accepted by ICCV 2025, see you in Hawaii!
Mar 31, 2025 One paper accepted by the second CVPR workshop on Efficient Large Vision Models
Jan 21, 2025 Start to work as Teaching Assistant for ECE 220 Computer Systems & Programming with Prof. Ujjal Bhowmik

selected publications

  1. AuroraLong
    AuroraLong-preview.png
    Bringing RNNs Back to Efficient Open-Ended Video Understanding
    Weili XuEnxin SongWenhao Chai, and 3 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2025