PABU: Progress-Aware Belief Update for Efficient LLM Agents

North Carolina State University
A Preprint

Abstract

Large Language Model (LLM) agents commonly condition actions on full action--observation histories, which introduce task-irrelevant information that easily leads to redundant actions and higher inference cost. We propose Progress-Aware Belief Update (PABU), a belief-state framework that compactly represents an agent's state by explicitly modeling task progress and selectively retaining past actions and observations. At each step, the agent predicts its relative progress since the previous round and decides whether the newly encountered interaction should be stored, conditioning future decisions only on the retained subset. Across eight environments in the AgentGym benchmark, and using identical training trajectories, PABU achieves an 81.0% task completion rate, outperforming previous State of the art (SoTA) models with full-history belief by 23.9%. Additionally, PABU's progress-oriented action selection improves efficiency, reducing the average number of interaction steps to 9.5, corresponding to a 26.9% reduction. Ablation studies show that both explicit progress prediction and selective retention are necessary for robust belief learning and performance gains.

Methodologies

Experiments

First research result visualization

Main Result: Evaluating task completion and efficiency on the AgentGym suite.

For each method, we report the task completion rate (↑ succ %, higher is better) and the number of interaction steps (↓ step #, lower is better). Aggregated performance across eight tasks is summarized in the ∑ column, and the best-performing model is highlighted in bold.

BibTeX

@misc{jiang2026pabuprogressawarebeliefupdate,
      title={PABU: Progress-Aware Belief Update for Efficient LLM Agents}, 
      author={Haitao Jiang and Lin Ge and Hengrui Cai and Rui Song},
      year={2026},
      eprint={2602.09138},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2602.09138}, 
}