Me
Junkai Zhang (张峻恺)
CS PHD Student @ UCLA

About

I'm now a fourth-year Ph.D. student in the Computer Science Department at UCLA. I am fortunate to be advised by Professor Wei Wang. I am broadly interested in Large Language Model post-training. I received my B.S. degree in Mathematics from Tsinghua University in 2022, with a minor in Philosophy

Education


Ph.D.           Sep. 2022 - Present
                     University of California, Los Angeles (UCLA), Los Angeles, CA, U.S.
                     Ph.D. student in Computer Science
B.S.              Aug. 2018 - Jun. 2022
                     Tsinghua University (THU), Beijing, China.
                     B.S. in Mathematics, Minor in Philosophy

Professional Experience

Oct. 2025 -- Present
Google, Playa Vista, CA, U.S.
Student Researcher
Jun. 2025 -- Sep. 2025
Scale AI, San Francisco, CA, U.S.
Post-training Research Intern

Publications & Manuscripts

Selected Works

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training
In Submission, available on arXiv
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
[Paper] [Code] [Model]
In Submission, available on arXiv

AI for Science

MatSciBench: Benchmarking the Reasoning Ability of LLM in Material Science
In Submission, available on arXiv
MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
[Paper] [Demo]
NAACL 2025 - Demo Track
Protein Large Language Models: A Comprehensive Survey
[Paper]
EMNLP 2025 Findings
MetamatBench: Integrating Heterogeneous Data, Computational Tools, and Visual Interface for Metamaterial Discovery
[Paper] [Code]
KDD 2025 Datasets and Benchmarks Track
Neural network-assisted personalized handwriting analysis for Parkinson's disease diagnostics
[Paper]
Nature Chemical Engineering
Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems
[Paper]
ACM Web Conference (WWW) 2024

Reinforcement Learning

Uncertainty-Aware Reward-Free Exploration with General Function Approximation
[Paper] [Code]
International Conference on Machine Learning (ICML) 2024
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
[Paper]
International Conference on Machine Learning (ICML) 2023

Generative Models

Fast Sampling via De-randomization for Discrete Diffusion Models
[Paper] [Code]
Advances in Neural Information Processing Systems (NeurIPS) 2024

Optimization Methods

Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
[Paper]
Advances in Neural Information Processing Systems (NeurIPS) 2023

Teaching

Academic Services

Conference Reviewer: ICML [2025], ICLR [2024-2026], NeurIPS [2025], NAACL [2025], AAAI [2024-2025], AISTATS [2024]
Top