Hi there! I am a third-year undergraduate student at Nanjing University, majoring in Software
Engineering. Currently I'm working as a Research Assistant at the XLANG Lab (as part of the HKU
NLP
Group) with Prof. Tao Yu. Before that,
I have worked as a research intern with Prof. Shujian
Huang
at
Nanjing University.
My research interests lie in ML and NLP. Nowadays, I am working on benchmarking Multimodal
Language Models as agents.
Here is the problem I am thinking about: How to design smarter GUI agents
with reinforcement learning (RL) and large language models
(LLM)?
OSWorld🖥️: A unified, real computer env for multimodal agents to evaluate open-ended computer tasks
with arbitrary apps and interfaces on Ubuntu, Windows, & macOS.
Spider2-V is a multimodal agent benchmark spanning across the entire data science and engineering
workflow.
Education
Nanjing University 2021.09 - 2025.07 (Expected)
B.E. in Software Engineering GPA: 91.60 / 100.0 (4.58/5.00)
Overall Academic Ranking: 1/259
The Hong Kong University of Science and Technology 2024.01 - 2024.05
Exchange Student with a full scholarship
Academic Experiences
XLANG Lab @ HKU
Research Intern         2023.08 - present
Conducted research on various topics including executable language grounding, tool usage, code
generation and multimodal LLMs.
Advisor: Prof. Tao Yu
Services
Reviewer: ICLR 2025
Honors & Awards
Merit Student of Jiangsu Province, 2024
National Scholarship (Top 0.2% nationwide), 2022
Kwok Shek Pik Yung Scholarship Special Award (RMB 10,000, awarded to top 3 students in the
department), 2023
Best Paper Award at China Conference on Machine Translation, 2023