Google researchers, in collaboration with the University of California, Santa Barbara, have introduced a novel framework aimed at enhancing the efficiency of artificial intelligence (AI) agents in utilizing computing power and tools. This development comes as a response to the increasing complexities associated with agentic AI, particularly the challenge of scaling the use of external tools without incurring significant costs or latency, according to a recent paper published on arXiv.
The focus of this research is particularly pertinent in practical applications such as web search and document analysis, where the volume of external actions can substantially influence the depth of an agent’s inquiry. Each tool call tends to increase the context window, ramp up token consumption, and incur added API costs, which can quickly become burdensome for organizations.
Interestingly, the findings indicate that merely allocating a larger budget to an AI agent does not guarantee improved performance. The researchers point out that many agents often lack awareness of their available resources. This leads to scenarios where an agent may pursue a singular line of inquiry for an extended period, expending numerous tool calls on a seemingly relevant path before realizing that it has hit a dead end. Consequently, additional computing resources are consumed without yielding any meaningful improvement in the quality of results.
To address this inefficiency, the researchers have devised the Budget Tracker, a straightforward module that provides continuous updates to the agent regarding its remaining budget. This tool operates entirely at the prompt level and requires no retraining. By receiving explicit signals about resource consumption, the agent is better positioned to adjust its strategy in real time. In Google’s implementation, the Budget Tracker also incorporates guidelines that suggest appropriate behaviors based on varying budget levels.
Initial experiments utilizing search agents following a ReAct-like method demonstrate the effectiveness of this approach. The paper reports that the Budget Tracker can lead to a reduction of over 40 percent in search calls and nearly 20 percent in browse calls, resulting in a total cost reduction of more than 30 percent. Notably, performance improves even at higher budgets, an area where traditional agents often encounter stagnation.
In addition to this lightweight solution, the paper elaborates on a more comprehensive framework dubbed Budget Aware Test-time Scaling, or BATS. This innovative framework integrates planning, verification, and budget awareness into a cohesive iterative process. The agent is thereby equipped to dynamically adapt its behavior based on the remaining budget, enabling it to make informed decisions about whether to continue exploring or to alter its course of action.
Tests conducted on benchmarks such as BrowseComp and HLE-Search, using Gemini 2.5 Pro as the underlying model, reveal that BATS achieves improved accuracy at a lower cost compared to existing methodologies. This development not only has implications for the operational efficiency of AI agents but could also reshape how organizations allocate resources for AI applications moving forward.
As AI technology continues to evolve, the integration of budget awareness and strategic resource management may become essential for maximizing performance while controlling costs. This research underscores a pivotal step toward making AI agents not just more intelligent, but also more resource-efficient, setting the stage for more sustainable AI practices in the future.
For further details on this research, visit arXiv, or explore Google’s initiatives at Google.
See also
CHROs Lead AI Integration: Transforming Workforce Dynamics and Driving Business Strategy
Traffic Surge Causes CloudFront Connection Failures, Disrupts Major App Access
Chai Discovery Secures $130M to Propel AI-Driven Drug Development, Valued at $1.3B
DeepSeek Launches DeepSeek-R1 LLM; Mistral AI Unveils 1,000 WPS Assistant at AI Action Summit



















































