Weili Xu
Seeking research opportunities in MLsys
Check out my CV here
I am a research intern at Together AI, as well as a rising senior undergraduate in Computer Engineering. I’m currently pursuing a dual degree from University of Illinois Urbana-Champaign and Zhejiang University.
I’m interested in various aspects of machine learning and computer systems:
- Efficient sequence modeling algorithms with hardware-aware implementation
- Heterogeneous runtime optimization for agentic workloads
- Long-context modeling for multi-modal (text, video, audio, etc.) applications
My journey into MLSys research began with AuroraLong, a hybrid multimodal LLM I built that unlocked hour long video understanding on consumer GPUs, which lead to a first-author paper accepted at ICCV 2025. This steered my focus toward system-driven modeling, where we co-design architecture and infrastructure to bridge the gap between fantastic algorithms and the rapid iteration that scales them.
news
| Jun 29, 2026 | ThunderAgent is accepted as a Spotlight paper at ICML 2026 and is integrated into NVIDIA Dynamo and SkyRL. |
|---|---|
| May 18, 2026 | Started internship at Together AI, see you in SF! |
| Oct 20, 2025 | Video-MMLU is granted Outstanding Paper Award by ICCV 2025 Workshop on Knowledge-Intensive Multimodal Reasoning! |