Abstract:The diverse advertiser demands (brand effects or immediate outcomes) lead to distinct selling (pre-agreed volumes with an under-delivery penalty or compete per auction) and pricing (fixed prices or varying bids) patterns in Guaranteed delivery (GD) and real-time bidding (RTB) advertising. This necessitates fair impression allocation to unify the two markets for promoting ad content diversity and overall revenue. Existing approaches often deprive RTB ads of equal exposure opportunities by prioritizing GD ads, and coarse-grained methods are inferior to 1) Ambiguous reward due to varied objectives and constraints of GD fulfillment and RTB utility, hindering measurement of each allocation's contribution to the global interests; 2) Intensified competition by the coexistence of GD and RTB ads, complicating their mutual relationships; 3) Policy degradation caused by evolving user traffic and bid landscape, requiring adaptivity to distribution shifts. We propose LIBRA, a generative-adversarial framework that unifies GD and RTB ads through request-level modeling. To guide the generative allocator, we solve convex optimization on historical data to derivehindsight optimal allocations that balance fairness and utility. We then train a discriminator to distinguish the generated actions from these solvedlatent expert policy's demonstrations, providing an integrated reward to align LIBRA with the optimal fair policy. LIBRA employs a self-attention encoder to capture the competitive relations among varying amounts of candidate ads per allocation. Further, it enhances the discriminator withinformation bottlenecks-based summarizer against overfitting to irrelevant distractors in the ad environment. LIBRA adopts a decoupled structure, where theoffline discriminator continuously fine-tunes with newly-coming allocations and periodically guides theonline allocation policy's updates to accommodate online dynamics. LIBRA has been deployed on the Tencent advertising system for over four months, with extensive experiments conducted. Online A/B tests demonstrate significant lifts in ad income (3.17%), overall click-through rate (1.56%), and cost-per-mille (3.20%), contributing a daily revenue increase of hundreds of thousands of RMB.

Follow the LIBRA: Guiding Fair Policy for Unified Impression Allocation Via Adversarial Rewarding.

Libra: A Stateful Layer-4 Load Balancer with Fair Load Distribution.

A Unified Guaranteed Impression Allocation Framework for Online Display Advertising.

Impression Allocation and Policy Search in Display Advertising

CONFLUX: A Request-level Fusion Framework for Impression Allocation Via Cascade Distillation

A Multi-Agent Reinforcement Learning Method for Impression Allocation in Online Display Advertising

Ad-load Balancing via Off-policy Learning in a Content Marketplace

Online Allocation Rules in Display Advertising

Optimizing Trade-offs Among Stakeholders in Real-Time Bidding by Incorporating Multimedia Metrics

Towards Fairness in Personalized Ads Using Impression Variance Aware Reinforcement Learning

Truthful Auctions for Automated Bidding in Online Advertising

Deep Automated Mechanism Design for Integrating Ad Auction and Allocation in Feed

An Effective Budget Management Framework for Real-Time Bidding in Online Advertising

Know in AdVance: Linear-Complexity Forecasting of Ad Campaign Performance with Evolving User Interest

AIGB: Generative Auto-bidding via Conditional Diffusion Modeling

Bidding Strategies for Proportional Representation in Advertisement Campaigns

Adaptive Risk-Aware Bidding with Budget Constraint in Display Advertising

A Unified Framework for Campaign Performance Forecasting in Online Display Advertising

A dynamic pricing model for unifying programmatic guarantee and real-time bidding in display advertising

Risk-aware dynamic reserve prices of programmatic guarantee in display advertising