Xinyuan Wang

Hi! I am Xinyuan Wang (王心远).

me.png

I am a Ph.D. student at XLANG Lab, the University of Hong Kong, supervised by Prof. Tao Yu.

I am now working on agentic foundation models, especially computer-use agent models (Kimi K2.5, OpenCUA and Kimi-VL), agent evaluation (Computer Agent Arena, OSWorld-Verified), and agent data synthesis (Jedi, VideoAgentTrek). At UCSD, I worked on automatic LLM prompt optimization (PromptAgent) and LLM Reasoning (LLM Reasoners). I also worked in Prof. Zhuowen Tu’s group, exploring how to improve diffusion models’ conceptual performance.

News

Jan 31, 2026 Kimi K2.5 is released! Ranked #1 on OSWorld leaderboard — the strongest open agentic model. Happy to be part of the team!
Jan 26, 2026 Computer Agent Arena and VideoAgentTrek are accepted by ICLR 2026!
Oct 11, 2025 🎉 OpenCUA received the Best Paper Award at the COLM AIA Workshop!
Sep 19, 2025 OpenCUA and Jedi are accepted by NeurIPS as Spotlight paper!
Sep 19, 2025 OpenCUA is accepted by COLM 2025 Workshop AIA as Oral paper!

Selected Publications

  1. kimik25.png
    Kimi K2.5: Visual Agentic Intelligence
    Kimi Team
    2026
  2. videoagenttrek.png
    VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos
    Dunjie Lu, Yiheng Xu, Junli Wang, Haoyuan Wu, Xinyuan Wang, and 10 more authors
    2025
  3. opencua_main_fig.png
    Opencua: Open foundations for computer-use agents
    Xinyuan Wang, Bowen Wang, Dunjie Lu, Junlin Yang, Tianbao Xie, and 6 more authors
    NeurIPS 2025 (Spotlight), COLM 2025 Workshop AIA (Best Paper), 2025
  4. jedi.png
    Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
    Tianbao Xie, Jiaqi Deng, Xiaochuan Li, Junlin Yang, Haoyuan Wu, and 6 more authors
    NeurIPS 2025 (Spotlight), 2025
  5. osworld-verified.png
    Introducing OSWorld-Verified
    Tianbao Xie, Mengqi Yuan, Danyang Zhang, Xinzhuang Xiong, Zhennan Shen, and 12 more authors
    xlang.ai, Jul 2025
  6. kimivl.png
    Kimi-vl technical report
    Kimi Team, Angang Du, Bohong Yin, Bowei Xing, Bowen Qu, and 6 more authors
    arXiv preprint arXiv:2504.07491, Jul 2025
  7. arena.png
    Computer Agent Arena: Compare & Test Computer Use Agents on Crowdsourced Real-World Tasks
    Bowen Wang, Xinyuan Wang, Jiaqi Deng, Tianbao Xie, Ryan Li, and 11 more authors
    Jul 2025
  8. llm_reasoners_preview.png
    LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models
    Shibo Hao, Yi Gu, Haotian Luo, Tianyang Liu, Xiyan Shao, and 6 more authors
    COLM 2024, Jul 2024
  9. promptagent_header.png
    PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization
    Xinyuan Wang, Chenxi Li, Zhen Wang, Fan Bai, Haotian Luo, and 4 more authors
    ICLR 2024, Jul 2024
  10. medicalbert.png
    Reduce the medical burden: An automatic medical triage system using text classification BERT based on Transformer structure
    Xinyuan Wang, Make Tao, Runpu Wang, and Likui Zhang
    In 2021 2nd International Conference on Big Data & Artificial Intelligence & Software Engineering (ICBASE), Jul 2021