Yuanli Wang (王远立)

I'm a Ph.D. candidate at Complex Analytics & Scalable Processing (CASP) Research Lab at Boston University, and I am working with Vasiliki (Vasia) Kalavri.

My research interests are in Agentic AI systems and applications (Coding Agents, Web Agents). I also work on distributed systems and data stream processing systems.

Email: yuanliw at bu dot edu  /  CV  /  Github  /  Google Scholar

profile photo

Selected Publications
  • Agentic AI systems and applications
  • DeployBench: Benchmarking LLM Agents for Research Artifact Deployment
    Yuanli Wang, Yaoyao Qian, Yue Zhang, Hanhan Zhou, Jindan Huang, Tianfu Fu, Qiuyang Mang, Huanzhi Mao, Wenhao Chai, Wendong Fan, Liqiang Jing
    Preprint [Code]
    SkillsBench: Benchmarking how well agent skills work across diverse tasks
    The SkillsBench Team
    Preprint [Website]
    Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
    The Terminal-Bench Team
    ICLR 2026 [Website]
    WebGraphEval: Multi-Turn Trajectory Evaluation for Web Agents using Graph Representation
    Yaoyao Qian, Yuanli Wang, Jinda Zhang, Yun Zong, Meixu Chen, Hanhan Zhou, Jindan Huang, Yifan Zeng, Xinyu Hu, Chan Hee Song, Danqing Zhang
    NeurIPS 2025 Workshop on Multi-Turn Interactions in Large Language Models [Demo]
  • Distributed systems
  • CAPSys: Contention-aware task placement for data stream processing
    Yuanli Wang, Lei Huang, Zikun Wang, Vasiliki Kalavri, Ibrahim Matta.
    EuroSys 2025 [code] [poster] [Tech report]
    The Non-Expert Tax: Quantifying the cost of auto-scaling in Cloud-based data stream analytics
    Yuanli Wang, Baiqing Lyu, Vasiliki Kalavri
    BiDEDE’22, co-located with SIGMOD’22 [slides] [poster]
    A New Benchmark Harness for Systematic and Robust Evaluation of Streaming State Stores
    Esmail Asyabi, Yuanli Wang, John Liagouris, Vasiliki Kalavri, Azer Bestavros
    EuroSys 2022 [code] [slides]
    HACCS: Heterogeneity-Aware Clustered Client Selection for Accelerated Federated Learning
    Joel Wolfrath, Nikhil Sreekumar, Dhruv Kumar, Yuanli Wang, Abhishek Chandra
    IPDPS 2022 [code]
    Accelerated Training via Device Similarity in Federated Learning
    Yuanli Wang, Joel Wolfrath, Nikhil Sreekumar, Dhruv Kumar, Abhishek Chandra
    EdgeSys 2021, co-located with EuroSys 2021 [talk]

    Work Experiences
  • Megagon Labs , Research Intern , 05/2025 - 08/2025
  • LLM based multi-agent system for decision making under uncertainty.

  • Apple , AIML - Data Processing Platform Intern , 05/2023 - 08/2023
  • Integrated OpenLineage framework with Flink to track data lineage for Apple’s AIML data processing platform.

  • PingCAP , Database Engineer Intern , 05/2019 - 08/2019
  • Worked on AutoTiKV project from scratch: use machine learning to tune database under user-specific workloads.


    Invited Talks
  • Impact of Scheduling for Terminal Agent Workloads on Unified-Memory Workstations. 2026 North East AI Agents Day
  • Adaptive Data Stream Processing on Hybrid Clouds. 2026 New England Systems Day
  • CAPSys: Contention-aware task placement for data stream processing. Tufts University, Oct.2024
  • Towards a cost-efficient and QoS-aware self-managed stream processing system. Meta, July.2022

  • Professional Services
  • Reviewer for Journals: Future Generation Computer Systems, Internet of Things Journal
  • Reviewer / Program Committee for Conferences: NeurIPS 2026, ICLR 2026, ICASSP 2026, ACM CAIS 2026 (Demo Track), IJCAI 2026 (Demo Track), ICLR 2025 , ICASSP 2025 , IJCNN 2025 , NAACL 2025 (Demo Track) , IJCAI 2025 (Demo Track), ICDCS 2024 (sub-reviewer) , EuroSys 2022 (Shadow PC) , IMC 2022 (Shadow PC)
  • Reviewer / Program Committee for Workshops: DL4C @ ICML 2026, SEA @ NeurIPS 2025, DL4C @ NeurIPS 2025, DL4C @ ICLR 2025, LLM Reason and Plan @ ICLR 2025, SCI-FM @ ICLR 2025, Multi-Agent AI in the Real World @ AAAI 2025 , ML and Compression @ NeurIPS 2024
  • Artifact Evaluation Committee: SIGCOMM 2021, SOSP 2021, MLSys 2023

  • Teaching
  • Teaching Fellow, CAS CS 551 Streaming and Event-driven Systems, Boston University, Spring 2024
  • Teaching Fellow, CAS CS 210 Computer Systems, Boston University, Fall 2022
  • Teaching Assistant, CSCI 5105 Distributed Systems, University of Minnesota, Spring 2021
  • Teaching Assistant, CSCI 5103 Operating Systems, University of Minnesota, Fall 2020

  • Honors and Awards
  • OpenAI Researcher Access Program
  • Rank 16/183 in 2018 ACM-ICPC North Central North America Regional Contest
  • Bronze Medal, 2015 China Collegiate Programming Contest

  • Personal
    My reading notes of system papers.
    I love traveling and collecting old computer hardware. Here are the albums of my photography and collections.


    Using template from jonbarron.