Jixuan Chen

Hi there! I am a third-year undergraduate student at Nanjing University, majoring in Software Engineering. Currently I'm working as a Research Assistant at the XLANG Lab (as part of the HKU NLP Group) with Prof. Tao Yu. Before that, I have worked as a research intern with Prof. Shujian Huang at Nanjing University.

My research interests lie in ML and NLP. Nowadays, I am working on benchmarking Multimodal Language Models as agents.

Here is the problem I am thinking about: How to design smarter GUI agents with reinforcement learning (RL) and large language models (LLM)?

Email  /  Semantic Scholar  /  Google Scholar  /  Twitter  /  Github

profile photo

Publication

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu
Website / Paper / Slides / Data Viewer / Code GitHub stars
NeurIPS'24 D&B

OSWorld🖥️: A unified, real computer env for multimodal agents to evaluate open-ended computer tasks with arbitrary apps and interfaces on Ubuntu, Windows, & macOS.

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Ruisheng Cao, Fangyu Lei, Haoyuan Wu, Jixuan Chen, Yeqiao Fu, Hongcheng Gao, Xinzhuang Xiong, Hanchong Zhang, Yuchen Mao, Wenjing Hu, Tianbao Xie, Hongsheng Xu, Danyang Zhang, Sida Wang, Ruoxi Sun, Pengcheng Yin, Caiming Xiong, Ansong Ni, Qian Liu Victor Zhong, Lu Chen, Kai Yu, Tao Yu
arXiv, 2024
Website / Paper / Data Viewer / Code GitHub stars
NeurIPS'24 D&B, Spotlight Presentation

Spider2-V is a multimodal agent benchmark spanning across the entire data science and engineering workflow.

Education

Nanjing University
2021.09 - 2025.07 (Expected)

B.E. in Software Engineering
GPA: 91.60 / 100.0 (4.58/5.00)
Overall Academic Ranking: 1/259
The Hong Kong University of Science and Technology
2024.01 - 2024.05

Exchange Student with a full scholarship

Academic Experiences

XLANG Lab @ HKU

Research Intern         2023.08 - present
Conducted research on various topics including executable language grounding, tool usage, code generation and multimodal LLMs.
Advisor: Prof. Tao Yu

Services

  • Reviewer: ICLR 2025

Honors & Awards

  • Merit Student of Jiangsu Province, 2024
  • National Scholarship (Top 0.2% nationwide), 2022
  • Kwok Shek Pik Yung Scholarship Special Award (RMB 10,000, awarded to top 3 students in the department), 2023
  • Best Paper Award at China Conference on Machine Translation, 2023
  • Outstanding Student, Nanjing University, 2022, 2023, 2024

Miscellanea

  • President, College Student Council; Advanced Individual in Student Union Organization (2022)
  • I really enjoy Chinese calligraphy (Gr. 6) and Ping Pong 🏓.

Template courtesy: Jon Barron.