Hi there! I am a 4-th year undergraduate student at Nanjing University, majoring in Software
Engineering. Currently I'm working as a Research Assistant at XLANG
Lab (as part of the HKU
NLP
Group) with Prof. Tao Yu. Before that,
I was honoured to work with Prof. Shujian
Huang
at
Nanjing University.
My research interests lie in ML and NLP. Nowadays, I am working on Computer Use Multimodal
LLM agents and Code Generation.
Here is the problem I am thinking about: How to design smarter GUI agents
with reinforcement learning (RL) and large language models
(LLM)?
I am looking for a Ph.D. position starting in 2025 Fall. Please feel free to reach out!
OSWorld🖥️: A unified, real computer env for multimodal agents to evaluate open-ended computer tasks
with arbitrary apps and interfaces on Ubuntu, Windows, & macOS.
Spider2-V is a multimodal agent benchmark spanning across the entire data science and engineering
workflow.
Education
Nanjing University 2021.09 - 2025.07 (Expected)
B.E. in Software Engineering GPA: 91.60 / 100.0 (4.58/5.00)
Overall Academic Ranking: 1/259
The Hong Kong University of Science and Technology 2024.01 - 2024.05
Exchange Student with a full scholarship
Academic Experiences
XLANG Lab @ HKU
Research Intern         2023.08 - present
Conducted research on various topics including executable language grounding, tool usage, code
generation and multimodal LLMs.
Advisor: Prof. Tao Yu
Services
Reviewer: ICLR 2025
Honors & Awards
Merit Student of Jiangsu Province (top 1% in Academics), 2024
National Scholarship (Top 0.2% nationwide, the highest honor for
undergraduates in China, 8,000 ¥), 2022