Lagrangian arbitration

What it does

The hard-constraint filter (lib/arbitration/constraints.ts) drops offers that fully violate a budget, inventory, or frequency cap. Lagrangian arbitration runs after that filter and softly penalizes the survivors that are close to a cap, so traffic naturally rotates toward less-saturated offers. The penalty is computed from the dual variables (shadow prices λ_k) of a relaxed resource-allocation problem.

When it adds value

Scenario	Hard filter alone	Lagrangian on top
All caps with ample slack	All offers eligible, ranked by baseScore	Same — λ ≈ 0, scores unchanged
One offer at 95 % of daily cap	Eligible until cap hits, then drops	Slight score penalty as it approaches cap → other offers rotate in
Cross-offer / channel quotas	Not modeled	Modeled (W12, hot-path wire shipped) — see “Cross-offer constraints” below

W12 ships the cross-offer wire on the realtime /api/v1/recommend hot path: the rank node now combines per-offer Lagrangian constraints (budget + inventory) with cross-offer constraints (channel_quota, portfolio_budget, category_cap) in a single solve. The batch path (lib/batch-executor.ts) still uses the per-offer-only applyLagrangianAdjustment helper from W5.1; the hot-path realtime wire is the new applyRealtimeArbitration helper at lib/arbitration/realtime-wire.ts. Both share the same tenantSettings.aiAnalyzerSettings.arbitration.lagrangianEnabled flag and noOp/solverFailed semantics.

Configuration

The flag lives inside aiAnalyzerSettings:

{
  "aiAnalyzerSettings": {
    "arbitration": {
      "lagrangianEnabled": true
    }
  }
}

Default: off for every tenant. There is no global env-var kill switch yet — flipping the tenant flag back to false is the incident-response path.

What gets logged

Per batch run with the flag on, lib/batch-executor.ts emits a single aggregated log line at completion:

INFO lagrangian.applied {
  runId, tenantId,
  attempted, noOp, solverFailed, nonConverged,
  defaultedCostOffersCount, malformedConfigOffersCount,
  defaultedCostOffers: [<offerIds>],
  malformedConfigOffers: [<offerIds>]
}

Field	Meaning
`attempted`	Customers where Lagrangian was invoked (post-contact-policy survivors > 0)
`noOp`	Calls that returned without modeling any constraint (no offers had budget/inventory)
`solverFailed`	Calls where the solver threw and we fell back to base scores
`nonConverged`	Calls where the solver ran but did not converge within tolerance
`defaultedCostOffers`	Offers with budget configured but no per-impression cost — solver ran against a synthetic 1¢ unit cost
`malformedConfigOffers`	Offers whose `budget` or `inventory` JSON had wrong types (string-encoded numbers, NaN, negatives)

If solverFailed > 0 or the tenant-settings lookup itself failed, the line is logged at error level instead of info. When the flag is off, nothing is logged for Lagrangian — no volume noise on tenants that haven’t opted in.

How the wire works

The batch executor reads tenantSettings.aiAnalyzerSettings once per run to check whether Lagrangian arbitration is enabled.
On every customer iteration, after the hard-constraint filter and before ranking, the Lagrangian adjustment runs against the surviving candidates.
The helper builds one Lagrangian constraint per offer with non-zero remaining budget AND/OR non-zero remaining inventory.
solveLagrangian runs dual ascent (200 iterations max, tolerance 1e-4, step size 0.1).
Returned adjustedScores overwrite candidate.score for ranking.

Result shape

interface ApplyLagrangianResult {
  adjustedScores: Record<string, number>;
  shadowPrices: Record<string, number>;
  usage: Record<string, { used: number; rhs: number; slack: number }>;
  iterations: number;
  converged: boolean;
  /** Legitimate skip — no constraints to model. NOT set when the solver crashed. */
  noOp: boolean;
  /** Solver threw; baseScores were preserved. Distinct from noOp on purpose. */
  solverFailed: boolean;
  candidateCount: number;
  constraintCount: number;
  defaultedCostOfferIds: string[];
  malformedConfigOfferIds: string[];
}

noOp and solverFailed are deliberately separate. A “solver crashed” result is not a “no-op” — callers that conflate them will silently mask production failures.

Benchmark

platform/perf/lagrangian-vs-weighted.ts compares the Lagrangian path against a baseline weighted-composite ranker on 1000 synthetic customers × 10 offers (5 inventory-constrained, 5 unconstrained). Run:

cd platform
npx tsx perf/lagrangian-vs-weighted.ts

The first baseline shipped 2026-04-28 at platform/perf/baselines/2026-04-28-lagrangian-bench.json with the following honest result:

{
  "delta": {
    "revenueDeltaPct": 0,
    "giniDelta": 0,
    "latencyOverheadMs": 70.16
  }
}

Why zero deltas: the synthetic fixture only models per-offer constraints. Both paths reach the same selection because the hard filter already correctly handles “stock = 0” before the Lagrangian re-rank is consulted. The latency overhead (~70 µs per decision) is the cost of the dual-ascent loop on 5 binding constraints. This is a baseline, not a victory lap. The Lagrangian wire’s structural advantage will show when the offer model is extended to include channel-quota / portfolio / cross-offer constraints — those constraints couple multiple offers and the hard filter cannot correctly handle them in a single pass.

Operational checklist

Enable for one tenant first via the aiAnalyzerSettings.arbitration.lagrangianEnabled flag.
Watch the per-run lagrangian.applied log line. Specifically:
- solverFailed > 0 → check the per-customer lagrangian adjustment threw errors and file an issue.
- nonConverged > 0 → likely fine on small batches; investigate if it stays > 5 % of attempts.
- malformedConfigOffersCount > 0 → fix the offer’s budget/inventory JSON shape (numbers, not strings).
- defaultedCostOffersCount > 0 → solver ran but per-impression cost was unknown; results trustworthy only directionally.
If a regression appears, flip the flag back to false. The wire is designed so that flag-off behavior is bit-identical to pre-W5.1.

Cross-offer constraints (W12, realtime hot path)

A dedicated cross-offer-constraint table stores rules that span multiple offers. Three rule types ship today:

`ruleType`	Cap meaning	Cost vector
`channel_quota`	At most N picks per channel in a window	1 per pick of an offer whose channel is in `config.channels`
`portfolio_budget`	At most $X spend across `config.offerIds`	`costPerAction` for offers in scope, 0 otherwise
`category_cap`	At most M picks from offers in `config.categories`	1 per pick in the category, 0 otherwise

Per-pick semantics. Cross-offer cost rows are assessed per candidate (per creative-pick), matching the cross-offer.ts test contract. Two creatives of the same offer each pay the cost — the intent is “n picks toward the cap when n creatives are picked.” A “count by offer” rule type is on the roadmap but not in the W12 scope.

Hot-path wire log

Per realtime recommend call with the flag on, lib/pipeline-runner.ts emits a single realtime arbitration applied log line at the rank node:

INFO realtime arbitration applied {
  tenantId, customerId,
  candidateCount,
  perOfferConstraintCount,
  crossOfferConstraintCount,
  noOp, solverFailed, converged, iterations,
  defaultedCostOfferIds, malformedConfigOfferIds
}

Field semantics match the batch-path lagrangian.applied line, with two additions: perOfferConstraintCount and crossOfferConstraintCount let you distinguish “no constraints to model” from “only cross-offer constraints active.”

Failure modes (fail-CLOSED)

Condition	Behavior
`tenantSettings` lookup fails	Wire stays OFF; `error`-level log surfaces the lookup failure.
Cross-offer rows lookup fails	`loadCrossOfferConstraints` already returns `[]` on Prisma error and logs at `warn`; wire continues with per-offer constraints only.
Solver throws	`solverFailed=true`, `adjustedScores === baseScores`, error-level log. Ranking proceeds with original scores.
Wire helper itself throws	Outer `try/catch` in pipeline-runner falls through with original scores; `error`-level log surfaces the bug.
Flag off	Wire is fully skipped — no settings read on the hot path.

The realtime hot path NEVER returns a 5xx because of the wire — the worst case is degraded mode where scores are unchanged.

Roadmap

W5.2 ships the EXP3-IX online-weights / budget-pacing / goal-seek flags. They live in the same aiAnalyzerSettings.arbitration.* namespace.
A future iteration may add a “count by offer” cross-offer rule variant for tenants who want offer-pick semantics rather than creative-pick semantics. Today’s workaround is to limit candidates to one creative per offer upstream of the rank node.

Get Started

Tutorials

Decisioning

Studio

Data Pipelines

AI & ML

Operations & Reporting

Governance & Security

Integrations

Reference

Lagrangian arbitration

What it does

When it adds value

Configuration

What gets logged

How the wire works

Result shape

Benchmark

Operational checklist

Cross-offer constraints (W12, realtime hot path)

Hot-path wire log

Failure modes (fail-CLOSED)

Roadmap

Get Started

Tutorials

Decisioning

Studio

Data Pipelines

AI & ML

Operations & Reporting

Governance & Security

Integrations

Reference

Documentation Index

​What it does

​When it adds value

​Configuration

​What gets logged

​How the wire works

​Result shape

​Benchmark

​Operational checklist

​Cross-offer constraints (W12, realtime hot path)

​Hot-path wire log

​Failure modes (fail-CLOSED)

​Roadmap

What it does

When it adds value

Configuration

What gets logged

How the wire works

Result shape

Benchmark

Operational checklist

Cross-offer constraints (W12, realtime hot path)

Hot-path wire log

Failure modes (fail-CLOSED)

Roadmap