Weili Xu

weili_pic_34.jpg

       Email | GitHub

Seeking research opportunities in MLsys
Check out my CV here

I am a third-year undergraduate in Computer Engineering, currently pursuing a dual degree from University of Illinois Urbana-Champaign and Zhejiang University.

I am fortunate to collaborate with Wenhao Chai and Enxin Song, working on Efficient Long Video Understanding. We built AuroraLong, a hybrid MLLM that efficiently handles hour-long videos on a single consumer GPU while achieving comparable performance to its Transformer counterparts on multiple video understanding benchmarks such as MLVU, MovieChat-1k and VDC.

I’m interested in various aspects of machine learning and computer systems:

  • Hardware-aware efficient algorithms
  • Exploiting sparsity for training and inference acceleration
  • Applications of multi-modal (video, audio, text, etc.) long-context modeling

news

Oct 20, 2025 Video-MMLU is granted Outstanding Paper Awared by ICCV 2025 Workshop on Knowledge-Intensive Multimodal Reasoning!
Jul 11, 2025 One paper accepted by ICCV 2025 Findings
Jun 25, 2025 One paper accepted by ICCV 2025, see you in Hawaii!
Mar 31, 2025 One paper accepted by the second CVPR workshop on Efficient Large Vision Models

selected publications

  1. AuroraLong
    AuroraLong-preview.png
    Bringing RNNs Back to Efficient Open-Ended Video Understanding
    Weili XuEnxin SongWenhao Chai, and 3 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2025
  2. Video-MMLU
    video-mmlu-prev.png
    Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark
    Enxin SongWenhao ChaiWeili Xu, and 3 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, Oct 2025