Learning vs. Optimizing Bidders in Budgeted Auctions

2026-04-09Computer Science and Game Theory

Computer Science and Game Theory
AI summary

The authors study how budget limits change how two agents interact repeatedly in auctions. They extend a classic game theory idea called Stackelberg equilibrium to include budgets, showing that strategies must be split into several different phases instead of one fixed plan. They also show that when the learner uses a common simple control method (a Proportional controller) to adjust bids, the strategic agent can't do better than this new budget-aware equilibrium. Their analysis provides evidence that this simple bidding adjustment method is surprisingly strong against manipulation in budget-limited settings.

Stackelberg equilibriumsecond-price auctionsbudget constraintsBayesian settinglearning algorithmsgame theoryProportional controllerPID controllerstrategic interactiontime-multiplexing
Authors
Giannis Fikioris, Balasubramanian Sivan, Éva Tardos
Abstract
The study of repeated interactions between a learner and a utility-maximizing optimizer has yielded deep insights into the manipulability of learning algorithms. However, existing literature primarily focuses on independent, unlinked rounds, largely ignoring the ubiquitous practical reality of budget constraints. In this paper, we study this interaction in repeated second-price auctions in a Bayesian setting between a learning agent and a strategic agent, both subject to strict budget constraints, showing that such cross-round constraints fundamentally alter the strategic landscape. First, we generalize the classic Stackelberg equilibrium to the Budgeted Stackelberg Equilibrium. We prove that an optimizer's optimal strategy in a budgeted setting requires time-multiplexing; for a $k$-dimensional budget constraint, the optimal strategy strictly decomposes into up to $k+1$ distinct phases, with each phase employing a possibly unique mixed strategy (the case of $k=0$ recovers the classic Stackelberg equilibrium where the optimizer repeatedly uses a single mixed strategy). Second, we address the intriguing question of non-manipulability. We prove that when the learner employs a standard Proportional controller (the "P" of the PID-controller) to pace their bids, the optimizer's utility is upper bounded by their objective value in the Budgeted Stackelberg Equilibrium baseline. By bounding the dynamics of the PID controller via a novel analysis, our results establish that this widely used control-theoretic heuristic is actually strategically robust.