I am a third-year Ph.D. student in Computer Science at UCLA, where I am fortunate to be advised by Prof. Harry Xu.
My research focuses on machine learning systems, where I design and build high-performance, cost-effective solutions by leveraging application semantics. My current work includes developing scalable and efficient systems for large language model (LLM) serving (Prism, ConServe) and video querying (VQPy).
Before my PhD, I spent four years at Intel as an AI frameworks engineer, building open-source big data and AI systems optimized for Intel CPUs in data centers.