publications

publications and preprints by year.

2026

  1. SafetyPhoneBench.png
    Safe, or Simply Incapable? Rethinking Safety Evaluation for Phone-Use Agents
    Zhengyang Tang, Yi Zhang, Chenxin Li, and 18 more authors
    2026
  2. PhoneWorld.png
    PhoneWorld: Scaling Phone-Use Agent Environments
    Zhengyang Tang, Yuxuan Liu, Xin Lai, and 21 more authors
    2026
  3. PhoneHarness.png
    PhoneHarness: A Mixed-Action Orchestration Harness and Benchmark for Phone Agents across CLI, GUI, and MCP Tools
    Jason, Zhengyao Fang, Zhengyang Tang, and 18 more authors
    2026

2025

  1. VideoTool.png
    Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task
    Sunqi Fan, Jiashuo Cui, Meng-Hao Guo, and 1 more author
    In NeurIPS, 2025
  2. AKeyS_viz.png
    Agentic Keyframe Search for Video Question Answering
    Sunqi Fan, Meng-Hao Guo, and Shuojin Yang
    2025

2024

  1. oral
    FlexKBQA.png
    FlexKBQA: a flexible LLM-powered framework for few-shot knowledge base question answering
    Zhenyu Li*Sunqi Fan*, Yu Gu, and 5 more authors
    In AAAI, 2024
  2. UCTR-ST.png
    Optimization Techniques for Unsupervised Complex Table Reasoning via Self-Training Framework
    Zhenyu Li, Xiuxing Li, Sunqi Fan, and 1 more author
    IEEE Transactions on Knowledge and Data Engineering, 2024

2023

  1. FAAC_woman_blinking.gif
    FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability
    Linze Li, Sunqi Fan, Hengjun Pu, and 6 more authors
    2023
  2. survey_video_diffusion.gif
    A Survey of Video Generation with Diffusion Models
    Linze Li, and Sunqi Fan
    2023