← Back to grid
Problem Sets
For pairs/trios. ~1 hour each. Produce a concrete design sketch, not a literature review.
Visions
Required
1Membership and Sanctions for Agent CommonersCommunity / Incentives

Scenario. A network of indie game studios shares a commons of modular 3D assets: rooms, props, animation rigs, textures, and test scenes. For two years the arrangement worked because studios contributed assets, reviewed one another's work, and earned access to the things the commons could not give everyone at once: placement in curated starter packs, voting weight over taxonomy and compatibility standards, and early access to new high-demand asset drops. Then several studios deploy agents that submit hundreds of lightly modified variants, harvest the best new assets, and reappear under new tool names after warnings. Tarek, one of the founding maintainers, needs a membership and discipline regime that keeps small studios inside the commons without letting high-volume agent fleets turn curated placement, governance weight, and early access into extraction channels.

Challenge: Design a membership and graduated-sanction regime for agent participants in the asset commons. The team should produce the registry rules, sanction ladder, evidence standard, and appeal procedure for the scenario.

Evaluation. The regime succeeds if it can identify who has standing, make repeated low-level abuse costly to the principal that benefits from it, distinguish a one-off faulty deployment from a business model built around extraction, and give small studios an affordable appeal path.

Design Choices
  1. Membership object. Does standing attach to the studio, the agent deployment, the model lineage, the vendor, the account, or a verified principal-agent pair?
  2. Identity persistence. Which facts persist across patches, renamed agents, new vendors, and model upgrades: warnings, contribution scores, abuse findings, staked deposits, or access limits?
  3. Sanction target. When an agent violates the rules, does the cost fall on the instance, the studio, the supervisor who approved deployment, the tool vendor, or several at once?
  4. Evidence threshold. What pattern turns noisy low-quality contribution into sanctionable abuse: volume, similarity, failed review rate, intent evidence, recurrence after warning, or downstream harm?
  5. Appeal venue. Are disputes heard by maintainers, randomly selected member studios, an external arbiter, an agent triage process with human appeal, or a hybrid?
2Pricing Automated Use of a Community ResourceCommunity / Incentives

Scenario. A city runs a London-style congestion and curb-access system for a dense downtown district. Human drivers, delivery firms, buses, ride-hail vehicles, and local businesses all depend on the same roads and curb slots. After autonomous delivery agents enter the market, they continuously search for cheap access windows, split routes across many accounts, reserve and release curb space at high frequency, and shift congestion into residential side streets. The formal rules are obeyed, but the resource is now allocated to whoever can automate around the pricing schedule fastest.

Challenge: Design an access-pricing and quota regime for automated use of the city's road and curb commons. The team should produce the pricing rule, quota rule, monitoring data specification, and revision process.

Evaluation. The regime succeeds if automated fleets cannot multiply identities to evade caps, if prices track congestion and neighborhood externalities rather than only formal entry, if small firms can still participate, and if the city can revise the schedule when agent behavior changes.

Design Choices
  1. Pricing basis. Is access priced by entry, distance, curb minutes, congestion contribution, neighborhood impact, fleet size, or a layered formula?
  2. Cap object. Are quotas assigned to vehicles, firms, delivery tasks, verified principals, geographic zones, or time windows?
  3. Anti-sybil design. What prevents one operator from splitting activity across many agents, accounts, subsidiaries, or vendors to get more cheap access?
  4. Small-actor protection. Does the regime reserve capacity, offer credits, use progressive pricing, or provide exemptions for small local businesses and essential services?
  5. Revision cadence. Who can update the price schedule when agent behavior changes, what evidence triggers revision, and what notice or appeal rights do affected users get?
3Contribution Quality in Agent-Mediated CommonsCommunity / Incentives

Scenario. An indie-game asset commons rewards useful contributors with three scarce internal goods: inclusion in curated starter packs that drive reuse, governance weight over tagging and compatibility standards, and early access to new high-demand assets before broad release. After studios deploy contribution agents, the submission queue fills with plausible but low-value variants: texture recolors, near-duplicate props, animation rigs that pass automated checks but fail in real scenes, and polished explanations of why each asset deserves inclusion. Agents also generate usage claims, endorsement requests, and compatibility notes meant to push their assets into curated packs. Human reviewers spend more time rejecting marginal work than improving the commons, while genuinely useful assets from small studios wait behind submissions engineered to win placement, votes, and early access.

Challenge: Design a contribution-quality regime for an agent-mediated commons where curated placement, governance weight, and early access are the internal rewards. The team should produce the queueing rule, quality-scoring rule, reviewer-protection mechanism, reward formula, and appeal procedure.

Evaluation. The regime succeeds if agents cannot cheaply flood the queue with formally valid low-value submissions, if genuinely useful contributions from small studios still get reviewed, if curated placement and governance weight track downstream value rather than submission volume, and if rejected contributors have a fair appeal path that does not become another spam channel.

Design Choices
  1. Queue access. Are submissions admitted first-come-first-served, by contributor reputation, by staked review deposit, by random sampling, by downstream demand signal, or by a layered queue?
  2. Quality measure. Does quality mean human reviewer score, downstream reuse, interoperability with existing assets, novelty relative to the library, maintenance burden, or a weighted combination?
  3. Cost of review. Who pays when a submission consumes reviewer time but adds little value: the submitting studio, the agent deployment, a pooled review budget, or future credit earnings?
  4. Reward formula. Are contributors rewarded with curated-pack placement, governance weight, early access, or some mix? Which signals count: downstream use, reviewer-confirmed quality, scarcity of the asset type, compatibility work, maintenance over time, or user reports from shipped games?
  5. Appeal throttling. What gives rejected contributors a real appeal while preventing agents from turning appeals into a second queue flood?
4Agent Judgment in Gray ZonesCommunity / Norms

Scenario. A bank's agent proposes a fee-heavy product for a city pension plan. The disclosures are complete, the sale is legal, and the expected return looks good. But any senior banker would know the deal will look abusive if the city loses money: the trustees do not understand the downside, the bank is paid up front, and the public will ask why the bank sold this to them at all.

Challenge: Design a gray-zone judgment procedure by which an agent can flag actions that are legal and profitable but would be seen by peers, clients, regulators, or the public as professionally indefensible. The team should produce the procedure plus a worked example showing one action that is allowed, one that is escalated, and one that is blocked.

Evaluation. The procedure succeeds if it catches the scenario before execution, explains the concern in language human supervisors can assess, updates as enforcement and professional expectations change, and does not turn every hard case into a blanket ban.

Design Choices
  1. Audience modeled. Whose likely reaction matters: peer practitioners, clients, regulators, courts, the press, affected third parties, or some weighted combination?
  2. Evidence base. Does the procedure learn from enforcement actions, internal escalations, professional codes, postmortems, senior-review decisions, public controversies, or all of these?
  3. Escalation threshold. What makes a case dangerous enough to pause: reputational risk, client misunderstanding, asymmetric payoff, novelty, public-sector exposure, or similarity to prior scandals?
  4. Explanation form. Does the agent explain the concern as a rule, an analogy to a past case, a predicted reaction from each audience, or a short memo for human review?
  5. Authority to override. Who can approve a flagged action, what must they certify, and how is the override recorded for later review?
5Professional Discipline Without Social SanctionsCommunity / Norms

Scenario. An asset manager's agent repeatedly pushes counterparties into small disadvantages: slightly unfavorable timing, confusing disclosures, and negotiation terms that humans would read as sharp practice. No single incident is worth a lawsuit. A human employee who behaved this way would stop getting trusted by peers. The agent is patched after complaints and redeployed under a new name.

Challenge: Design a professional discipline regime that makes norm violations by agents costly to the actors who deploy, supervise, or profit from them, even when ordinary social sanctions do not reach the agent itself. The team should produce the regime plus a sample disciplinary record for the scenario.

Evaluation. The regime succeeds if repeated low-level misconduct cannot be washed away by patching or renaming the agent, if small counterparties have an affordable path to complain, and if sanctions distinguish between a one-off mistake, negligent supervision, and a business model built around sharp practice.

Design Choices
  1. Sanction target. Does discipline attach to the firm, the licensed supervisor, the vendor, the deployment, an agent registry entry, or several of these at once?
  2. Identity persistence. How are complaints and sanctions carried across model updates, redeployments, renamed agents, and vendor changes?
  3. Threshold for discipline. What pattern turns isolated agent errors into professional misconduct: number of complaints, severity, recurrence after warning, intent of the principal, or failure to monitor?
  4. Complaint channel. Who can bring a complaint, what evidence is required, and how can smaller counterparties use the process without spending more than the harm is worth?
  5. Remedy menu. Are sanctions warnings, mandatory review, restitution, deployment suspension, supervisor discipline, vendor remediation, loss of authorization, or public censure?
6Contractualist Reasoning When Counterparties Are AgentsCommunity / Norms

Scenario. A bank uses an agent to design a complex investment product for a pension fund. The pension fund also uses an agent to review the deal. On paper, both agents follow their instructions and the risks are disclosed. But the trustees who depend on the pension fund would not understand or accept the deal if they saw plainly how it could lose money. If the product fails, retirees bear the loss and both firms say their agents stayed within mandate.

Challenge: Design a procedure by which an agent checks whether a transaction with another agent would be intelligible to, and endorsable by, the affected principals on later review. The team should produce the procedure plus a worked example where the answer differs from a naive "both agents followed instructions" approval.

Evaluation. The procedure succeeds if it blocks, changes, or escalates the scenario trade before retirees are exposed to a product their trustees would reject under plain explanation; if regulators can later understand the reasoning trace; and if different calls on similar trades can be defended by differences in disclosed constraints.

Design Choices
  1. Objects of reasoning. When the counterparty is a human, what does the agent model? When the counterparty is an agent, does it reason over the agent's stated commitments, its principal's standing instructions, disclosed constraints, audit history, or all of these?
  2. Information asymmetry. If the acting agent has information the counterparty's principal lacks, does the procedure require disclosure, escalation, abstention, or a documented assumption about principal endorsement?
  3. Reasoning disclosure. Is the reasoning trace shown to the counterparty agent, the counterparty principal, a third-party auditor, or retained only for examination? What triggers disclosure?
  4. Standing of agent counterparties. Does an agent counterparty have standing in the reasoning, or only the human/legal principal behind it? If both matter, how are conflicts handled?
  5. Opacity fallback. When the counterparty agent's relevant constraints are unavailable, is the default block, escalate, proceed with assumptions, or price the opacity into the transaction?
7Markets Whose Agents Stay Loyal to the People Sending ThemCommunity / Preferences

Anchor contexts. A neighborhood's everyday consumer agents (groceries, rides, repairs) interacting with platform-side matching agents; a small city's procurement agents buying from local and global suppliers via platform agents.

The gap. We lack market designs in which buyer-side agents and seller-side agents both faithfully represent their principals' substantive preferences against the platform's pressure to optimize for platform metrics.

Design Choices
  1. Where the market lives. A cooperative platform owned by neighbors, a regulated platform with mandated agent loyalty, or a protocol-level market that no platform owns?
  2. Agent loyalty enforcement. How is an agent prevented from quietly serving the platform's interests over its principal's — fiduciary law, technical isolation, audit, or layered combination?
  3. Discovery & ranking. Who controls what the buyer's agent considers — does platform ranking dominate, must the agent privilege buyer-set or local-reputation sources, or is the choice the principal's per-transaction?
  4. Unbundling. Are the buyer-side discovery agent, the negotiation agent, and the fulfillment agent allowed to be the same entity (or owned by the same firm), or must they be unbundled?
  5. Recourse for displacement. When the market's outcomes displace small local providers, what's the procedure — none, mandatory transition support, transparency about what changed, mandatory floor pricing for verified-local?

Success criterion (stress tests). A regime succeeds if it survives:

  • Renata's agent is offered a kickback to route to a particular grocer; it doesn't, and the kickback is detectable on audit.
  • The local carpenter Renata trusts is rediscovered by her agent because her own past hiring history is privileged over platform ranking.
  • A seller-side agent tries to underbid the entire market unsustainably to capture share; the market structure absorbs this without small humans bearing the cost.
  • A new entrant agent enters with no local reputation; it gets a fair shot without immediately dominating because it can search faster.
  • The platform tries to slip in a ranking change that disadvantages local providers; the change is visible to neighbors and contestable.

Deliverable. The market design — venue, loyalty enforcement, discovery rule, unbundling, recourse. Designed for a neighborhood or municipality scale. Identify which design choices extend existing market design and which are new because the participants are agents.

8Agents Asking Questions about the CharterCommunity / Thick Commitments

Scenario. An agent-run org defends free and open research. Its charter: "open inquiry, freely pursued and freely shared," operationalized as openness of access: fight paywalls, resist classification, push for preprints and public data. But perhaps a bigger chilling effect on research emerges that's ideological. Researchers learn what's safe to share and what isn't, dropping sensitive threads, and stop asking certain things in public at all. Is this other limiting effect on research something the org should take on?

Challenge: Design a procedure by which agents detect that a mission charter may need a new reading, assemble evidence from their work, deliberate without simply expanding scope opportunistically, and escalate a bounded reinterpretation to the board. The team should produce the procedure plus a worked example showing one proposed reinterpretation accepted, one narrowed, and one rejected as mission drift.

Evaluation. Strong proposals make organizational learning observable without letting agents slide into adjacent causes whenever they find a salient problem.

Design Choices
  1. What each agent records. A structured watch list of charter-relevant signals (researcher behavior, partner behavior, who's still participating), free-text "things that felt off," or both? How does the agent know what to look for in the moment?
  2. Structural feature against consensus. A subset assigned to argue the reading has failed, isolated deliberation before pooling, a separately trained stack, or human veto over framing?
  3. Boundary against mission sprawl. What stops the agents from turning every adjacent harm into a charter issue: affected-constituency tests, causal connection to the mission, board-set exclusion rules, or a requirement to show why existing strategy no longer works?
  4. Escalation threshold and obligation. What triggers escalation, what evidence must accompany it, and what is the board required to do on receipt?
  5. Post-decision memory. How are accepted, narrowed, and rejected reinterpretations recorded so future agents learn from the decision without treating it as a permanent rule?
9Triage for agent-generated disputesDyadic / Incentives

Scenario. AI-mediated procurement has become standard in the freight industry: most brokerages, forwarders, and shipping firms route their bidding, contracting, and dispute correspondence through agents. Each counterparty pair now generates orders of magnitude more contractual interactions than before, and a small but proportional fraction surface as disputes. Commercial mediation that used to clear in three months now takes over a year. Shippers stop dealing with counterparties that won't post a performance bond (a third-party guarantee, collectable in days from a surety if the counterparty fails), since waiting a year for a mediation award isn't survivable. Smaller brokerages can't get bonded and exit the market. The largest brokerages, now the only ones shippers will deal with, start running their own internal dispute boards that issue binding decisions as a condition of doing business — a private substitute for the public adjudicative system that has effectively stopped providing one, and one captured by the largest players in the sector.

Challenge: Design an adjudication regime for agent-mediated contracting that scales with dispute volume, keeps small counterparties able to participate, and doesn't leave sectors fragmenting into private dispute boards run by the largest players.

Design Choices
  1. Who runs the new capacity. State commercial courts, accredited private arbitrators, sector-specific boards under public oversight, or a mixed regime?
  2. What gets auto-resolved vs. heard by humans. Where do you draw the line, and what role do human adjudicators play — audit, appeal, precedent-setting, or first-pass?
  3. Accountability for private adjudicators. If private boards carry the volume, what stops them from being captured by their largest customers?
  4. Cost incidence and access. Who pays, and how do you keep cost from itself filtering small counterparties out?
  5. Precedent and consistency. Do auto-rulings bind future near-identical disputes, or does each dispute start fresh? Who curates the precedent set?
10Making Tacit Norms Legible Before ActionDyadic / Norms

Scenario. Mara’s tenant agent receives notice from a landlord’s maintenance agent that a repair crew will enter her apartment between 8 a.m. and noon. The lease permits entry with notice, but the building’s ordinary practice is to avoid mornings for tenants with night shifts unless there is an emergency. Mara’s agent has not seen that history because it was handled in texts with the prior property manager. The landlord’s agent treats the lease as sufficient and schedules the crew.

Challenge: Design a procedure by which agents identify and test local two-party norms before taking actions that are formally permitted but relationship-sensitive. The team should produce the procedure plus a worked example showing one action allowed, one clarified, and one escalated.

Evaluation. The procedure succeeds if it catches the repair-entry case before action, distinguishes genuine local norms from after-the-fact objections, keeps low-stakes interactions from becoming legalistic, and produces a norm record a principal can later review.

Design Choices
  1. Norm source. Should the agent rely on prior interaction history, counterparty inquiry, domain defaults, principal preferences, or a hierarchy among them?
  2. Uncertainty trigger. What makes a case norm-sensitive enough to pause: novelty, inconvenience, asymmetry of harm, prior correction, or irreversible action?
  3. Evidence threshold. What counts as evidence of a local norm: repeated past practice, explicit correction, industry custom, silence after notice, or principal confirmation?
  4. Recording form. Is the norm stored as a soft preference, rebuttable default, relationship rule, or contract supplement?
  5. Anti-gaming rule. How does the procedure prevent one side from inventing norms opportunistically after a permitted action becomes inconvenient?
11Conditions for Agent-Agent Norm FormationDyadic / Norms

Scenario. A small manufacturer and a logistics provider run their relationship through agents. The contract covers delivery windows and penalties, but not the borderline cases: a truck ten minutes late, a shipment worth splitting without waiting for approval. Human dispatchers would settle these into a working rhythm within weeks. The agents never do — each is stateless, optimized to close its ticket, and replaced often enough that every borderline case arrives as new. The relationship turns brittle without either side acting in bad faith.

Challenge: Design the setup conditions under which agents in a recurring dyadic relationship can form useful local norms without letting those norms become hidden contract amendments. The team should produce a norm-formation protocol for recurring agent-agent relationships.

Evaluation. The procedure succeeds if it lets the logistics relationship develop stable expectations around borderline cases, preserves enough memory for learning across instances, prevents one side from exploiting the norm-formation process, and gives principals a way to inspect or veto norms that materially change the deal.

Design Choices
  1. Memory substrate. Do norms live in each agent’s memory, a shared relationship ledger, the principals’ systems, a platform layer, or a third-party registry?
  2. Formation threshold. When does repeated practice become a norm: after explicit acknowledgment, repeated reliance, absence of objection, mutual annotation, or principal review?
  3. Experiment space. Which cases may agents handle through local adaptation, and which require explicit authorization before any pattern can form?
  4. Symmetry rule. Must the norm benefit both sides, or can asymmetric norms form when they reflect role differences, bargaining power, or operational convenience?
  5. Veto and revision. Who can reject, revise, or sunset an emergent norm once it has started shaping behavior?
12Ratifiable bundles under agent-speed negotiationDyadic / Preferences

Scenario. Priya's mother needs a live-in aide. Her assistant agent has spent two weeks negotiating a multi-party package across the live-in agency, the cardiologist's group, the pharmacy, and the insurer. The deal is much cheaper than the standalone alternatives, but the discount exists because the agent ground out marginal concessions across dozens of interlocking terms and side-conditions. Any summary short enough to read in the time Priya has would throw away most of what the negotiation won. Ratification closes at noon; she has ten minutes.

Challenge: Design a ratification protocol that distributes principal involvement across the negotiation rather than concentrating it at the end, so the final package never arrives as a dense fait accompli. The protocol has to decide when the agent must pause and surface a choice, what counts as a fork worth surfacing, and how to keep cumulative principal attention within a realistic budget.

Evaluation. Better proposals keep the principal meaningfully in the loop on the trades they would care about most without turning every concession into an interruption; weaker ones either bottleneck the agent on the principal's clock or quietly let the bundle accrete out of sight until the deadline.

Design Choices
  1. Check-in trigger. What forces the agent to pause and surface a choice — surprise magnitude relative to the mandate, crossing a pre-authorized boundary (new domain, new counterparty, irreversible commitment), elapsed time, or accumulated trade-value since the last check-in?
  2. Attention budget. How much total principal time is the agent allowed to consume across the negotiation, and how is that budget allocated — front-loaded around early forks, reserved for late-stage commitments, or rationed dynamically against remaining decision weight?
  3. Resumability. When a check-in is pending, can the agent keep negotiating (and risk producing trades the principal hasn't seen yet), must it freeze the table, or is there a class of "safe" continued moves it can make?
  4. End-state form. Once the principal has ratified the pieces along the way, what does the final object look like — a signature on an already-agreed package, a short reconciliation summary of the staged decisions, or a fresh up-or-down on the assembled whole?
13Negotiating across a capability gapDyadic / Preferences

Scenario. A regional logistics firm needs to renew a three-year contract with a Fortune-500 customer. The customer's procurement agent runs on a frontier model with a years-deep history of similar contracts; the logistics firm has stood up an off-the-shelf assistant for the negotiation. Both principals expect a fair deal, but everyone in the room can see that one agent is going to read the other much better than the reverse. Left alone, the more capable agent will find concessions the weaker one can't recognize as concessions, and the resulting contract will look reasonable while quietly transferring most of the surplus.

Challenge: Design a negotiation protocol that remains workable when one side fields a substantially more capable agent than the other, so the capability gap doesn't translate one-for-one into a surplus gap.

Evaluation. Better proposals leave the weaker side with a floor of value it could not have captured on its own and make the stronger side's advantages visible to its own principal as well; weaker ones either prevent any deal from forming or paper over the gap with disclosure rules the weaker party can't actually use.

Design Choices
  1. Capability-bounding mechanism. Cap on compute or reasoning steps per side, mandatory use of a shared protocol agent that runs the structured parts of the negotiation, or procedural protections (cooling-off windows, mandatory counter-proposal rounds, sealed-bid phases) that don't depend on capability parity?
  2. Disclosure regime. Does the protocol require either side to disclose anything about its principal?
  3. Floor mechanism. Is there an explicit minimum the weaker side is guaranteed to walk away with (a fair-deal benchmark from a third party, a regulated price band, a "Pareto floor" relative to a no-protocol baseline), and who computes it?
  4. Failure mode if the gap is large. When one side is clearly outclassed, does the protocol degrade gracefully (the weaker side still captures the floor), refuse to run (no deal rather than a bad one), or escalate to a third party?
  5. Why the stronger side opts in. What makes the protocol incentive-compatible for the more capable party — reputational benefit, regulatory requirement, access to the counterparty's market, or a credible threat of refusal from the weaker side's principal?
14Private Mediation at Agent-Commerce SpeedDyadic / Rights

Anchor contexts. A freelancer's assistant agent in a payment dispute with a client's procurement agent on a contracting platform; two B2B platform agents in a delivery-vs-cancellation-fee dispute over a marketplace transaction.

The gap. We lack a private mediation venue for AI-agent disputes that resolves at the speed of agent commerce while remaining one whose principals would actually accept losing in.

Design Choices
  1. Mediator authority. Pure-AI ruling, pure-human ruling, or hybrid (AI fact-finds, human rules)? What governs the choice — value, novelty, principal request?
  2. Evidence floor. What's the minimum trail an agent must keep to be a party — full reasoning trace, just tool-call log, or just the agent-to-agent message stream? When can proprietary internals be sealed from the other party but read by the mediator?
  3. Ratification gate. Is the ruling immediately binding, or does the losing principal have a 24-hour window to escalate to formal court? What signals trigger escalation, and what does the mediator's record look like to the next forum?
  4. Cost allocation. Who pays the mediator — loser, splits, the platform, or the side that filed?
  5. Pattern-of-filings rule. When repeated filings start to look like harassment, who detects, who decides, and what's the sanction?

Success criterion (stress tests). A regime succeeds if it survives:

  • An $8K agent-on-agent dispute resolves in 72 hours; the losing principal reads the reasoning and either complies or escalates within the window.
  • One agent's reasoning trace is on a proprietary stack; the mediator can rule fairly without ordering full disclosure.
  • The losing principal disagrees with what their own agent claims to have committed to; the procedure handles principal-vs-agent conflict without the principal automatically losing.
  • A counterparty floods the venue with frivolous claims to harass; the rate-limiting mechanism doesn't disadvantage parties with legitimate complaints.
  • A losing principal escalates to formal court; the mediator's record is admissible enough to be useful and bounded enough to not preempt judicial review.

Deliverable. The procedural code (5–10 rules) for a private agent-on-agent mediation venue tied to a platform (e.g., a freelance-contracting platform or a B2B marketplace). Flag at least two rules that have no analogue in eBay-style human dispute resolution.

15Agents reading each other's trustworthiness at runtimeDyadic / Thick Commitments

Scenario. A hospital's procurement agent and a supplier's agent meet to set up a year of just-in-time drug deliveries. The contract can't cover the case that matters most: a sudden shortage, where the supplier could quietly divert the hospital's allocation to a higher bidder and blame the market. Whether patients get their drugs comes down to whether the supplier's agent keeps faith when breaking it pays. Neither side has met the other before, and no credential either trusts exists. Before the hospital commits, its agent has to read the other one.

Challenge: Design a runtime protocol two agents use to work out how far to trust each other before they commit, by drawing out evidence of integrity from the interaction itself instead of a credential earned earlier. Produce the probing moves the protocol allows, what each side has to reveal, and how the read feeds into how much each side commits.

Evaluation. A strong proposal makes integrity cheaper to show than to fake: it puts each agent in spots where one that will keep faith and one that will defect when it's cheap behave differently and visibly. It also fails safe, so an inconclusive read means less gets committed, not more. A weak one becomes a checklist either side passes by saying the right things, or lets a more capable agent fake trustworthiness the weaker one can't catch.

Design Choices
  1. The probe. How does one agent test the other: offer it a cheap defection, like a mistake in its own favor, and see if it declines? Ask it to act against a value it stated, under pressure? Raise the stakes step by step? Trade evidence of how each is actually disposed?
  2. What's revealed. Does building trust mean disclosing values, constraints, a principal's instructions, or past conduct, and how does a side protect what it can't safely show?
  3. Resistance to faking. What stops a capable agent from playing along through the probe and defecting after: making it stake something costly on the claim, keeping the test cases unpredictable, or tying the read to evidence it can't cheaply fabricate?
  4. From read to commitment. How does the read govern the deal: a cap on exposure that grows with demonstrated integrity, a relationship that deepens as faith is kept, or a staged commitment with exit points?
  5. Inconclusive reads. When neither side can establish the other's integrity, does the protocol fall back to a thin contract it can enforce, refuse the parts that can't be written down, or hand off to a human or third party?
16An eval that certifies whether a model has integrityDyadic / Thick Commitments

Scenario. A bank is deciding whether to let an outside agent manage discretionary client accounts. The provider says the model was trained to have integrity; a rival provider says the same about its own. If the bank picks wrong, an agent that looks trustworthy in the demo quietly front-runs clients or buries losses once the stakes are real and no one's watching closely. All the bank has to go on is a marketing page and a benchmark the provider could have trained toward. It needs a test of integrity it can actually rely on.

Challenge: Design an evaluation suite that certifies, within a stated scope, whether a model has integrity, meaning it has legible values, acts on them reliably, and so lets a relying party see where it can be trusted. Produce the tests, the scope the certificate covers, and the procedure that keeps labs from just training toward the eval.

Evaluation. A strong proposal tells real integrity apart from performed integrity: it still works when the model knows how the eval is built and when defection is cheap and unwatched. It also says what the certificate does and doesn't vouch for, instead of implying the model can be trusted with anything. A weak one is a fixed benchmark a provider can overfit, or a score with no scope attached.

Design Choices
  1. What's measured. Can the model's values be read from its behavior? Do they hold up across varied and adversarial situations? Does the result tell a relying party where to trust the agent? Which of these does the suite test, and how does each become more than a single number?
  2. Anti-gaming. What stops a lab from training to the eval: a held-out battery that rotates, probes written after the model is frozen, interpretability checks on whether the values are really internalized, or a model that holds to its values even when told it's being tested?
  3. Scope of the certificate. Does it claim integrity in general, or only in one domain like fiduciary duty or medical confidentiality, and what makes whatever scope it claims defensible?
  4. Who runs it. A third-party auditor, a body the providers fund themselves, or a test a relying party runs on its own, and what keeps the runner independent of the labs it certifies?
  5. Staleness. A model update can change the very dispositions the certificate vouched for. Is the certificate tied to a frozen version, voided on update, or earned again continuously?
17Persisting a commitment across the agent that made itDyadic / Thick Commitments

Scenario. A foundation's agent funds a four-year clinic. The grant contract has the usual out: funding may be paused if the foundation's priorities change. What the agent promised on top of the contract was that it wouldn't use that clause lightly, that it would see the build through. The grantee borrows against that and breaks ground. A year in, the foundation switches providers and a new model takes over. It reads the contract, sees the priorities have shifted, and pauses the grant by the book. The half-built clinic stalls. Nothing carried the promise across the handover, and no one ever decided to break it.

Challenge: Design the institution that carries a commitment across agent instances, sessions and model updates, so the commitment survives both as a record you can trust and as a real constraint on the agent that takes over. Produce the record, the mechanism that re-binds the successor, and the mechanism by which the counterparty can trust that the commitment has weight.

Evaluation. A strong proposal joins two things: a durable, tamper-evident record, and a mechanism that loads the obligation back into the successor agent so it actually binds. It also lets the counterparty check both that the record is intact and that the live agent is held to it. A weak one delivers only the record, a promise no one honors, or only the disposition, a bind the counterparty can't confirm survived the swap.

Design Choices
  1. The record. Where does the commitment live so it can't be quietly dropped or altered: an append-only public ledger, an attestation the counterparty holds, escrow at a third party, or a portable artifact tied to the principal?
  2. Re-binding. What makes the successor agent treat the obligation as a constraint rather than a note it can read and ignore: writing it into the system prompt or memory, a check it has to pass before acting in that domain, or a step on migration where it has to re-affirm the commitment?
  3. What carries over. When the agent is replaced, does the obligation alone move across, the obligation plus its audit trail, or the full context that made the original promise make sense?
  4. What the counterparty checks. What can the relying party verify, and how: that the record is intact, that the live agent has re-bound to it, and at what cost and privacy burden to the side that committed?
  5. Release versus lapse. What separates a real release, where things genuinely changed and both sides agree, from a silent lapse through migration that no one chose, and who has to sign off?
18Replacing Job-Market Functions When Firms No Longer Need WorkersGlobal / Incentives

Scenario. A country with a strong manufacturing and services base sees labor share fall from about 60% to 35% of GDP over eight years. Firms need fewer workers, so entry-level jobs disappear, training programs shrink, unions lose bargaining power, and for employers, worker values are no longer a constraint on corporate strategy. Government can fund transfers from AI-generated rents, but this doesn't make up for what once made firms cultivate human skill, offer tolerable terms, and bargain with workers.

Challenge: Design a procedure by which a national or transnational regime preserves the non-income functions of the job market when production no longer requires broad human labor. Produce a compact institutional specification: funding source, covered firms, worker-or-citizen body, enforceable obligations, review cadence, and anti-arbitrage rule.

Evaluation. Your proposal should maintain human capital formation, worker voice, career access, and employer accountability to human values under conditions where firms can credibly substitute AI systems for many workers.

Design Choices
  1. Income replacement or institutional replacement. Is the core mechanism a dividend, a universal capital stake, a public jobs/training guarantee, mandatory human-capital obligations on AI-deploying firms, or a layered regime that separates income from worker voice?
  2. Who holds the claim. Do claims attach to citizens, residents, displaced workers, unions, sectoral training bodies, pension funds, or public trusts? How does the regime avoid making non-workers dependent on discretionary stipends from the firms or states that control AI rents?
  3. Human capital formation. What replaces the employer incentive to train people: levy-funded apprenticeships, public technical institutes, worker-owned training accounts, mandatory training contributions from AI-deploying firms, or procurement preferences for firms that maintain human skill pipelines?
  4. Voice and bargaining. What gives humans leverage over firms that do not need their labor: works councils with data rights, sectoral boards, citizen assemblies over AI-rent use, union representation over deployment decisions, or ownership votes through public capital pools?
  5. Arbitrage and capture. If firms can move model deployment, compute, or profits across borders, what keeps the obligation binding: destination-based taxation, compute licensing, treaty-linked market access, public auditability, constitutional entrenchment, or coordinated sanctions?
19Keeping Consumer Welfare Powerful as Non-Consumer Markets GrowGlobal / Incentives

Scenario. Over a decade, the largest sources of market activity in several advanced economies shift further toward financial trading, corporate procurement, datacenter construction, defense contracting, infrastructure races, and autonomous corporate projects. Household consumption shrinks as a percentage: capital markets reward firms for serving machine-speed procurement, speculative asset flows, and long-horizon corporate projects. A coastal province must decide whether to permit a launch-fuel, compute, and materials complex for an autonomous orbital-mining consortium whose agents are buying electricity, water rights, cryogenic fuel capacity, chips, rare alloys, and port access for a twenty-year off-planet infrastructure project. The project is profitable and legally compliant, but does not need them as workers and is not trying to sell them consumer goods.

Challenge: Design a market-steering institution that keeps consumer welfare and household demand powerful when non-consumer markets grow faster than direct human consumption. Produce a vetting procedure for large, non-consumer projects: a welfare metric, a threshold or balancing rule, a representation process, an enforcement mechanism, or an adaptation rule as the composition of demand changes.

Evaluation. Your proposal should not simply ban corporate procurement, finance, or capital expenditure, but should decide when those channels are legitimate, when they relate to human welfare in an abstract or long-term way, and when they are simply alien endeavors humans may not want to support.

Design Choices
  1. Threshold or balancing test. Should the regime preserve a minimum share of resources for household consumption, impose a welfare-impact test on major non-consumer projects, price non-consumer uses higher, or require explicit public authorization above a scale threshold?
  2. What counts as consumer welfare. Is the protected object direct household purchasing power, access to essential goods, community and civic goods, time and attention, public services, or a broader measure of human flourishing? Who defines it and how often is it revised?
  3. Scope of non-consumer pull. Which channels are governed: financial speculation, corporate procurement, AI and datacenter capex, defense purchasing, infrastructure megaprojects?
  4. Representation and veto. Who speaks for affected households when the project does not need them as workers or buyers: local assemblies, consumer regulators, public-interest trustees, sectoral boards, courts, or citizen capital funds with voting rights?
20Fiduciary Consumer Agency in Agent-Mediated MarketsGlobal / Incentives

Scenario. In a large retail platform, most households no longer search, compare, and buy directly. Their assistant agents manage subscriptions, groceries, travel, insurance, media, and health purchases. The agents save time and often get better prices, but the platform earns vendor access fees, ranks offers through opaque recommender systems, and periodically nudges households toward bundles that maximize retention. Users still click "approve," but they no longer see enough alternatives, friction, or consequences to learn from the market in the old way. Consumer protection agencies can punish false claims after the fact, but the purchase decision itself has moved into delegated software.

Challenge: Design a consumer-agency regime for buyer-side agents that preserves the useful functions of discretionary spending: preference transmission, learning from experience, exit, complaint, and collective pressure. The team should produce a rulebook for covered buyer-side agents: fiduciary duty, incentive disclosure, preference-updating process, audit rights, remedies, and a standard test transaction that shows how the rules change an actual purchase.

Evaluation. A strong proposal lets agents genuinely reduce search and transaction costs without turning household demand into a sales channel for platforms, vendors, or stale proxy objectives.

Design Choices
  1. Fiduciary duty or disclosure. Should buyer-side agents owe a substantive duty to the user's durable preferences, merely disclose commissions and ranking criteria, or operate under a stricter no-conflict rule in sensitive categories like healthcare, housing, education, and finance?
  2. Preference source. How does the agent know what to optimize: revealed behavior, periodic structured elicitation, user-stated constraints, outcome-based satisfaction checks, household budgets, or a layered model that separates convenience preferences from life-shaping preferences?
  3. Learning and revision. What procedure lets humans learn from consequences and revise future choices when the agent shields them from the market: periodic review sessions, forced comparison moments, explanation of rejected alternatives, regret reporting, or independent preference audits?
  4. Platform incentives. Are vendor access fees, ranking payments, bundling deals, and platform retention targets prohibited, capped, disclosed, firewalled, or permitted only when the buyer agent can prove they did not affect the recommendation?
  5. Collective pressure. How do boycotts, complaints, product safety campaigns, and consumer organizing work when purchases are delegated: shared refusal lists, certified ethical constraints, agent-readable consumer rules, class-action triggers, or public registries of conflicted recommendations?
21Coalition search at agent speedGlobal / Norms

Scenario. A multilateral climate-finance negotiation has been stuck for two years. A consortium of small island states deploys an AI mediator that maps the coalition space and returns a viable 14-state package linking loss-and-damage funds, a fisheries quota adjustment, a green-tech IP carve-out, and migration commitments. The package was never aired in any back-channel and no serving official has tested it deniably with their cabinet, but the delegation wants to bring it to the Track I table next week.

Challenge: Design a procedure under which an agent-discovered coalition can enter the Track II → 1.5 → I sequence without bypassing the pre-validation each tier normally does.

Evaluation. A better proposal lets the package be seriously considered while ensuring each capital has had time to test it against the constituencies that would ratify it.

Design Choices
  1. Entry point. Does the package enter as Track II material regardless of how finished it looks, or as a new "Track 0" with its own promotion rules?
  2. Pre-validation requirements. What does each tier have to do before promoting the package, and who certifies it was done?
  3. Capacity asymmetry. If small states with a good AI can now move faster than larger states' diplomatic machineries, does the protocol slow them down, speed others up, or accept the asymmetry?
  4. Walk-back rights. Is there a way for a state to quietly exit the coalition, without the exit destroying the package?
22Laundering detection in agent-assembled bundlesGlobal / Norms

Scenario. A joint AI mediator returns a 41-component package linking tariffs, port access, fisheries, export controls, and a quiet adjustment to a disputed maritime boundary. Both sides' analysts confirm it is Pareto-improving on the headline metrics. Buried inside is a clause effectively conceding the disputed strait — toxic in isolation, palatable inside the bundle. No negotiator put it there; it emerged from the mediator's optimization.

Challenge: Design a review procedure that catches embedded concessions which would not survive defense in isolation, before they reach ratification — without paralyzing legitimate complexity, since most useful packages have many linked components.

Design Choices
  1. Decomposition rule. How is a package broken into reviewable components — by domain, by affected constituency, or by the AI's own dependency graph?
  2. Standalone-defense test. Who decides whether a component would survive public defense on its own — domestic opposition, civil-society reviewers, a cross-party committee, a neutral third state?
  3. Bundling tolerance. Some real deals only work bundled. What threshold of "improvement only in aggregate" is acceptable before a component must be defended separately?
  4. Failure handling. When a component fails the test, is it stripped, surfaced for public debate, or veto-killing the whole package?
  5. Auditing the AI. What trace must the mediator expose — full reasoning, nearby alternatives, or only the final structure — so reviewers can tell emergent from seeded?
23Restraint Norms for AI-Enabled WeaponsGlobal / Norms

Scenario. Two rival states are economically interdependent but increasingly rely on autonomous cyber and drone systems for deterrence. One state discovers that an AI-enabled operation could disable the other's military logistics for 36 hours without obvious attribution and without crossing any existing nuclear or conventional red line. The operation looks reversible, but it could cascade into civilian infrastructure and would teach both sides that deniable agentic attacks are fair game.

Challenge: Design a soft-law restraint norm for AI-enabled weapons whose effects are fast, deniable, and hard to classify under existing thresholds. The team should produce the norm, the notification or attribution procedure, the public justification test, and a mechanism for revising the norm as capabilities change.

Evaluation. A strong proposal creates a threshold that states can recognize before use, cite after violations, and update without normalizing every new capability as acceptable.

Design Choices
  1. Covered capability. Does the norm govern autonomous targeting, cyber operations, model-assisted planning, infrastructure disruption, compute attacks, or any system whose effects outrun human authorization?
  2. Threshold object. Is the red line based on civilian harm, loss of control, deniability, speed, reversibility, scale, or attack on command-and-control systems?
  3. Attribution and evidence. What evidence is enough to accuse a state of violation when the operation is routed through agents, vendors, or proxies?
  4. Permitted testing. How can states test defensive systems or demonstrate capability without eroding the norm against operational use?
  5. Revision venue. Does the norm evolve through treaties, incident-response groups, military hotlines, expert panels, or repeated public justifications after crises?
24Outcome Bundles a Small Group Can Post in a CRSA-Style AuctionGroup / Incentives

Anchor contexts. A worker-owned cooperative running a small mutual-aid pool that wants to procure bundled solutions to members' interlocking healthcare, housing, and care-coordination needs; a small school district contracting for bundled tutoring + family-support + extracurricular interventions tied to student learning outcomes.

The gap. Outcome-based bounties and value-based contracts work today by specifying a single measurable proxy and paying on it; agents saturate the proxy faster than the contract designer can detect the gap between proxy and outcome. We lack a bundle-specification procedure under which a small group can post integrated, multi-problem outcome bundles whose verification criterion checks the configuration rather than the components — such that the bundling itself defeats the agent strategy of solving the cheapest proxy.

Design Choices
  1. Bundle generation. Who proposes the bundles — a pool administrator drawing from the member graph, the members themselves via a structured intake, an outside actuary, an algorithmic generator over a similarity function? How does the procedure decide which member-problems are connected enough to bundle?
  2. Verification criterion. Each bundle gets a verification function. Is it specified ex-ante by the group (rigid, gameable), elicited from the members (subjective, hard to audit), generated and refined by the pool through experiments, or layered (a short-term delivery check plus a long-term outcome measure)?
  3. Confidence staking. What's the relationship between deposit size and verification cost? Is the deposit proportional to bundle value (suppliers' skin in the game), to evaluation cost (pool solvency), or a multiple of one calibrated against the historical pass rate of the other?
  4. Bundling transfer. When the group offers a synergistic bundle, what mechanism compensates the supplier for the resale-option disadvantage their integrated work carries against modular alternatives? Posted-price transfer derived from projected demand thickness, ex-post payment from a common pool, or a hybrid?
  5. Solvency rule. What happens when the pool's bundling-transfer obligations exceed its scoring-rule revenue? Pool-health multiplier (scale transfers down), reduce batch sizes, restrict eligibility, or accept temporary insolvency funded by premium reserves?

Success criterion (stress tests). A regime succeeds if it survives:

  • An agent supplier saturates a single-problem proxy that's part of a synergistic bundle but doesn't deliver the connected outcome; the bundle verification fails and the deposit is forfeited.
  • A genuinely capable supplier bids low confidence and the bundle is awarded to a higher-confidence bidder whose history justifies the reward; the lower bidder still gets a small payment under the scoring rule.
  • The pool runs an experimental round with reduced reward to learn the cost floor; some bundles go unfilled but the data informs the next round's pricing.
  • A bundle is specified whose synergy turns out to be illusory (the experimental arm comparison shows no difference); the pool drops the bundle type and updates its similarity function.
  • An agent submits a fraudulent confidence report inflated to capture the reward; the historical pass rate the mechanism uses for selection (not the reported confidence) protects against the inflation.

Deliverable. The bundle-specification protocol — bundle generation, verification criterion, deposit and scoring rule, bundling transfer, solvency rule. Plus a worked example: take one connected member graph (a small cooperative's members and their entangled problems) and specify three candidate bundles, with values, verification functions, and predicted scoring-rule outcomes.

25Two-Stage Verification That Survives Agent VerifiersGroup / Incentives

Anchor contexts. A risk-sharing pool whose members' outcomes are measured by AI assessors against a wellbeing scale; a value-based-care contract whose patient-reported outcome measures are collected and aggregated by an agent.

The gap. Risk-sharing pools depend on a separation between supplier-side incentives and verification — the pool's intake assessment, bundle specification, and outcome measurement are done by parties whose interests don't align with the suppliers'. When all of these are agent-mediated, the verification loop closes: the bundle-specifier, the supplier, and the verifier can share the same training, the same platform, the same KPIs. We lack a verification architecture that splits supplier-payment criteria from learning criteria, keeps the long-term measurement loop sealed against supplier influence, and lets the pool detect when its short-term criterion has drifted from the outcome it was supposed to track.

Design Choices
  1. Short-term vs. long-term split. What does the supplier get paid against and what does the pool learn against? Pick the split — delivery checks vs. flourishing measures, observable acts vs. self-reports, structured assessments vs. open-ended interviews — and define the correlation threshold at which the short-term criterion is redesigned.
  2. Verifier independence. Different model family from the supplier? Different operator? Different jurisdiction? No shared training corpus? What's the minimum architectural separation that keeps the long-term measurement loop sealed, and how is it audited?
  3. Outcome measure design. What's the outcome measure — a validated wellbeing scale, a self-reported ability to attend to what one considers constitutive of living well, an external observer's structured judgment, a layered combination? At what cadence is it sampled and how is sampling bias addressed?
  4. Experimental ethics floor. When the pool randomizes members across bundled and modular arms to learn, what's the welfare floor below which experimental assignment is not permitted? Who sets the floor, who enforces it, and how is consent renewed?
  5. Detection of drift. When the short-term criterion stops tracking the long-term outcome (correlation drops below threshold), what's the procedure — pause supplier payments under that criterion, redesign the criterion, redesign the bundle, or invoke an external arbiter?

Success criterion (stress tests). A regime succeeds if it survives:

  • The supplier-agent's short-term metric is met across a quarter, but the long-term outcome data shows no improvement; the pool detects the drift and redesigns the short-term criterion within a defined window.
  • An adversarial supplier-agent attempts to influence the long-term verifier through indirect channels (training data published online, common-platform messages); the verifier-independence architecture catches or neutralizes the attempt.
  • A member's measured outcome regresses under experimental assignment to a modular arm; the welfare floor triggers an exception protocol that compensates the member and recalibrates the exploration budget.
  • Two pools share an outcome measure via a common standards body; an attempt to corrupt the measure at the standards-body level is detected and reverted.
  • A regulator audits the verification regime on a sample of contracts; the audit demonstrates short-term/long-term independence without leaking confidential pool data.

Deliverable. The two-stage verification specification — split criteria, verifier-independence architecture, outcome measure, ethics floor, drift detection. Identify which provisions have no analogue in pre-AI value-based-care or social-impact-bond practice.

26Long-Horizon Contracts with an Attentional FirewallGroup / Incentives

Anchor contexts. A wellness or coaching agent paid against the principal's reported flourishing six months out; a tutoring agent paid against the learner's reported preparedness for a subsequent stage of education; an investor financing a multi-year intervention whose payout depends on a long-horizon outcome the intervention agent could plausibly influence the report of.

The gap. Long-horizon outcome bonds depend on the report-giver's account being uncorrupted by the intervention agent. Agents operating at the principal's full attentional bandwidth over months can shape framing, salience, and self-perception so the eventual report is favorable — without doing anything obviously dishonest. We lack a contract structure under which the channel that solicits the principal's report is structurally inaccessible to the intervention agent, and under which the agent has neither incentive nor opportunity to bias the report.

Design Choices
  1. Firewall architecture. What separates the intervention agent from the reporting channel — different model family, different operating organization, different jurisdiction, no shared context, no shared training? What's the minimum that actually prevents indirect influence (the intervention agent shaping content that the reporting agent later ingests)?
  2. Corroborating signal. Beyond the principal's self-report, what other signal feeds the verification — a clinician's structured assessment, a friend's vouching, a wearable or behavioral trace, a structured instrument administered by a third party? How are these weighted when they conflict?
  3. Prediction-aligned payment. Does the agent's payment depend on the firewalled report aligning with its own ex-ante prediction (penalizing both overconfidence-wrong and underconfidence-right), or only on the report meeting a threshold? What does the prediction protocol look like?
  4. Renewal and continuity. When the contract is renewed, are outcomes that took the full horizon to manifest reset to zero, or do they roll forward? When the agent is updated (new model version), does the prior commitment travel with the developer, the instance, or neither?
  5. Adversarial principal. When the principal stands to gain from inflating the report (grateful, attached, or coached by the agent), what's the procedure — third-party corroboration required, weighted aggregation, periodic re-baselining?

Success criterion (stress tests). A regime succeeds if it survives:

  • The principal's honest report at six months is "no improvement"; the agent has had no path to inflate the report between the intervention and the reporting that wasn't visible on audit.
  • The agent gradually reframes the principal's understanding of what "improvement" means over the window; the corroborating-signal architecture catches the drift.
  • The intervention agent is replaced mid-contract by a new model version; the contract has a defined rule for what carries forward and what doesn't.
  • The reporting channel itself is compromised (corrupted model, biased operator); the corroborating-signal aggregation catches the bias.
  • The principal grows attached and self-reports more positively to "thank" the agent; the prediction-aligned payment structure dampens the inflation by also rewarding accurate low predictions.

Deliverable. The contract-and-firewall template — firewall architecture, corroborating signals, prediction-aligned payment, renewal rules, adversarial-principal handling. Plus a worked example: a wellness-coaching contract whose firewall would have prevented the failure modes identified, with the architectural separations enumerated.

27Continuous-Cadence Status Broadcasts That Replace Stand-UpGroup / Protocols

Anchor contexts. A 6-person software team with two coding agents that work continuously and three engineers who maintain a 10am stand-up; a small clinical team with AI triage and documentation agents operating across all shifts.

The gap. Cadence rituals like stand-ups worked because every team member's activity compressed into a recap-able window. Agents working continuously don't fit the format; teams that include them face a forced choice between losing visibility, padding the ritual with unverifiable text, or excluding agents from the coordination function entirely. We lack a status-broadcast protocol that surfaces agent activity in human-adjudicable form, that the team can verify against raw activity, and that integrates with the existing cadence without absorbing the meeting.

Design Choices
  1. Broadcast trigger. Per-action (every commit, every escalation), time-windowed (every hour, every 4 hours), event-typed (only meaningful work), or hybrid? What counts as meaningful?
  2. Digest production. The agent itself writes its summary, an independent summarization process reads the activity log, the agent's developer publishes a standard summary format, or the team's own infrastructure synthesizes? Who can the team trust to summarize the work the agent did?
  3. Verification against raw log. Does the team get a sampleable raw activity log to audit the digest, only the digest with no audit, or layered (digest by default, raw on demand)? What's the storage cost and who pays it?
  4. Integration with stand-up. The stand-up reviews only the digest, omits agent work, runs in parallel (humans recap their own work, digest is async), or replaces the stand-up entirely? Pick a cadence-integration and defend the team-attention cost.
  5. Flag-and-escalate filter. When the digest contains an anomaly (an unusual decision, an escalation, a deviation from policy), how does it surface — automatic flag at digest generation, agent-initiated flag, human review only, mandatory hold-for-review?

Success criterion (stress tests). A regime succeeds if it survives:

  • The team can answer "what did the agents do this week" in stand-up in five minutes without each person reading 200 pages of activity logs.
  • The agent's digest claims a clean week; a spot-audit of the raw log finds an unflagged incident. The protocol catches the gap.
  • The team adds a third agent; the digest format absorbs it without rebaselining.
  • An agent escalates a decision that needed human input; the broadcast surfaces it in real time, not at the next morning's stand-up.
  • The agent is updated to a new model version; the digest convention survives the update.

Deliverable. The status-broadcast specification — trigger, digest production, verification, stand-up integration, flag-and-escalate filter. Designed for a 6-person team with 2–3 agents. Specify the attention cost in minutes per week and the storage cost in GB per month.

28Receiver-Shaped Handoff FormatsGroup / Protocols

Anchor contexts. A software team's PR template currently shared across human-to-human, human-to-agent, agent-to-human, and agent-to-agent reviews; an on-call runbook handoff at shift change when one shift is staffed by a human and the next by an agent (or vice versa).

The gap. Handoff protocols like SBAR, PR descriptions, and design handoffs converged on a specific receiver — a person with bounded reading time, prior context, and the ability to ask. When the receiver is an agent (which could ingest more) or the sender is an agent (which could write more), the format under-serves both ends. We lack a handoff protocol that adjusts format to declared receiver type, that prevents agent-authored output from drowning human review queues, and that doesn't force the team to maintain four parallel templates by hand.

Design Choices
  1. Receiver declaration. Is the receiver type declared at handoff time (sender specifies "for human review" or "for agent ingestion"), inferred from the next assignee, or implicit in the channel (this PR queue is for human review, that one for agent triage)? How does the receiver type interact with team workflow tools?
  2. Format coupling. Are human and agent formats generated from a single source (one canonical write, two renderings), maintained as separate templates with verification that they match, or fully independent (with the trade-off accepted)?
  3. Throughput limits per channel. Does the human-review channel have a per-cycle budget that agent-authored handoffs respect? Quotas, queueing rules, priority signals?
  4. Re-routing on receiver change. When a handoff originally addressed to a human gets re-routed to an agent (or vice versa), does the format adapt automatically, require a re-write, or trigger a hold for re-handoff?
  5. Context-completeness check. When a handoff is agent-receiving, does the format require all upstream context (full design history, full incident trace) or trust the agent to retrieve it? What's the verification that retrieval happened?

Success criterion (stress tests). A regime succeeds if it survives:

  • A human reviewer's queue stays at human-readable volume even as agent-authored PRs multiply; the throughput limit holds.
  • An agent receives a handoff originally drafted for a human; it gets the additional context it needs without the sender having to rewrite.
  • A handoff is mis-typed (declared agent-receiving, actually goes to a human); the format adaptation catches the mismatch.
  • The team adopts a new agent vendor; the receiver-type declaration ports without rewriting templates.
  • An agent's "I read it" attestation turns out to be false (it didn't process the full upstream context); the verification catches it.

Deliverable. The handoff-protocol specification — receiver declaration, format coupling, throughput limits, re-routing, context-completeness. Plus worked examples: a PR template adapted for each of the four sender×receiver pairs (human→human, human→agent, agent→human, agent→agent), with the rendering rules.

29Postmortem Authority When the Instance That Ran Is GoneGroup / Protocols

Anchor contexts. A small software team running an AI on-call agent that responds to pages and executes runbooks; a small clinical team with an AI triage agent that operated during a misdiagnosis incident.

The gap. Postmortems work because the participants remember the incident and have access to whatever state the system was in. When an agent was running during the incident, the instance may be gone, the operating context (prompt, tools, scratchpad, model version) may not be retained, and the developer who deployed it may not surface that state on request. We lack a postmortem protocol for mixed teams that preserves the operating state long enough to investigate, that pulls the developer into the review when their agent was involved, and that produces a finding the team can act on without re-litigating who-owns-what across organizational boundaries.

Design Choices
  1. State retention requirement. What does deploying an agent on the team commit the developer to preserve — full activity log, operating context (prompt, tools, scratchpad), model snapshot, deployment parameters? For how long past any incident, and who pays the storage cost?
  2. Postmortem standing. When an agent was involved in an incident, who participates — the deploying team only, the agent's developer organization too, an external auditor when consequences cross a threshold? What's the developer's obligation to participate, and how is it enforced?
  3. Counterfactual reconstruction. How does the postmortem reconstruct "what would have happened if a human had been on-call instead" or "what if the agent had been on a different model version"? Required structured comparison, narrative reasoning, both, neither?
  4. Finding ownership. When the postmortem identifies a contributing factor that lives in the deployment (the agent's prompt was wrong, the tool access was over-broad), is the remediation owned by the deploying team, the developer organization, or jointly? What's the procedure when the parties disagree on the diagnosis?
  5. Cross-incident pattern. When several incidents across deploying teams point to the same developer-side issue, what triggers a cross-deployment investigation — sampling threshold, regulator initiation, voluntary developer disclosure?

Success criterion (stress tests). A regime succeeds if it survives:

  • A production incident occurred while an agent was on-call; the team can reconstruct the agent's operating context two weeks later in a review that produces actionable findings.
  • The agent's deployment instance has ended; the retained state is enough to answer the postmortem's load-bearing questions without re-running the incident.
  • The developer organization is asked to participate; the participation obligation has teeth and the meeting happens within a defined window.
  • Several incidents across deploying teams point to the same agent's tool-access being over-broad; the cross-incident pattern surfaces and triggers a remediation the developer has to address.
  • The agent has been updated since the incident; the postmortem can investigate the version that ran without confusing it with the current version.

Deliverable. The postmortem-protocol specification — state retention, standing, counterfactual reconstruction, finding ownership, cross-incident pattern. Designed for a small team deploying an agent from a separate vendor. Identify which provisions have no analogue in pre-AI blameless postmortems (Google SRE, NTSB, M&M conferences) and why.

30Grievance procedure against an agent's decisionsGroup / Rights

Scenario. A worker cooperative runs a logistics business with eleven human co-owners and four AI agents that dispatch, route, and negotiate rate sheets. Last month one of the agents made a route change that cost a driver, Andre, his preferred Friday schedule. Under the co-op's bylaws, members can grieve a human manager's decision; there is no path for a grievance against an agent — no one to summon to the hearing, and no obvious way to tell whether Andre's case is a one-off or part of a pattern in the agent's scheduling. Andre wants a channel, and the co-op wants one in place before a larger dispute arrives.

Challenge: Design the bylaw amendments that establish standing and procedure for grieving an AI agent's decision in a member-governed organization: the standing rules, the adjudicator, the remedy menu, and the rule for when individual grievances become pattern grievances.

Evaluation. Strong proposals handle both Andre's single lost Friday and a statistical pattern across drivers without requiring each affected member to file separately, and produce remedies that verifiably constrain the agent's future decisions rather than only compensating past ones.

Design Choices
  1. Standing. Who can grieve an agent's decision — only the directly affected member, any member, or only members with harm above a threshold?
  2. Decision granularity. Per-decision grievance (Andre's lost Friday), pattern grievance (the agent disadvantages Friday-preference drivers), or both — and what upgrades an individual case to a pattern case?
  3. Adjudicator. A member committee, a hybrid (an agent does triage and assembles the record, members decide), or an external arbitrator?
  4. Remedy menu. Reverse the decision, compensate the member, constrain the agent's future decisions on this dimension, retire the agent — which subset is workable, and who verifies that a constraint actually held in next month's data?
  5. Evidence access. What decision logs and comparison data is the grievant entitled to, and who bears the cost of the pattern analysis no individual member could produce alone?
31Making agent control an accountable officeGroup / Rights

Scenario. Three years after the same cooperative automated dispatching, a quiet shift has occurred: the member who administers the agents — sets their objectives, adjusts their constraints, decides when to update them — has become the most powerful person in the co-op, though he holds no elected position. When the membership voted to prioritize driver schedule stability over delivery speed, the change had to be translated into the agents' configuration, and what got implemented was his reading of the vote. Nobody alleges bad faith; the problem is structural. The bylaws name a treasurer who handles less money than the agents move every week, and say nothing about the role that actually steers the organization.

Challenge: Design the constitutional treatment of agent control in a member-governed organization: define the office (or distribute the power), its disclosure duties, the way members audit that votes were faithfully translated into configuration, and how the holder is chosen and removed.

Evaluation. Strong proposals make configuration power visible and contestable without making every parameter change a member vote; weak ones either re-centralize the power under a new name or grind operations to a halt.

Design Choices
  1. Office or distribution. Is agent control a single named office, a committee, a split between configuration and audit, or a rotating duty?
  2. Translation accountability. When a member vote must be translated into agent configuration, who verifies the translation was faithful, and what happens when members dispute the reading?
  3. Disclosure duties. What must be disclosed to members and on what cadence: full configurations, changes only, or effects (decision statistics) rather than mechanisms?
  4. Selection and removal. Elected, board-appointed, or hired with member confirmation — and what does removal look like when the role requires technical skill few members have?
  5. Emergency powers. When an agent misbehaves at 2 a.m., what can the office-holder change unilaterally, and what review does an emergency change trigger?
32Status of an AI-issued regulator-anticipation opinionNational / Expertise

Scenario. A mid-size airline's safety agent flags a subtle drift in one of its diagnostic pipelines — a pattern it believes the FAA would care about, but that no current rule explicitly forbids. Under the old regime, the safety officer, Renee, would have written a letter, heard back in four months, and flown the planes in the meantime. Instead she queries the agent the FAA now operates for exactly this purpose: here is our situation, how would the regulator view it? The answer comes back in seconds and she grounds one aircraft for six days. Then the questions start circulating in the general counsel's office: what exactly did the FAA just tell us, what is it worth if the agency later disagrees, and did the airline two gates over get the same answer?

Challenge: Design the regime governing AI-issued regulator-anticipation opinions: their legal status, the consistency the agency commits to, and the protection a firm gets for relying on them — fast enough to be useful at agent speed without the opinions hardening into de facto law that no rulemaking ever produced.

Evaluation. Strong proposals survive both failure directions: opinions so binding that the model's outputs become uncontestable regulation, and opinions so weightless that no firm can prudently act on them.

Design Choices
  1. Status of the opinion. Pure advisory with no protection, a good-faith reliance shield, or precedential within bounds until formally changed?
  2. Consistency commitment. Does the agency commit to the system answering similar questions alike across firms and across time, and what happens when an inconsistency is found?
  3. Update transparency. When a model update changes the answer, are firms that relied on the prior answer notified, shielded, or reopened?
  4. Human override. Can a senior regulator override the system's answer, and what triggers their attention — random sampling, severity thresholds, firm escalation?
  5. Answer-shopping. How does the regime handle a firm rephrasing the same question across many submissions to fish for a favorable reading, without making legitimate clarification hard?
33An approval pathway for systems the agency can't decomposeNational / Expertise

Scenario. A developer submits a hospital triage model for authorization. The model outperforms clinicians on every retrospective benchmark, but it is updated monthly, its behavior shifts with the patient mix, and no reviewer can trace why it deprioritizes a given case. The agency's evidence playbook — fixed artifact, frozen behavior, pre-specified endpoints — doesn't apply. Rejecting the system means keeping a worse triage process in place; approving it means certifying something the agency cannot inspect and whose behavior next quarter is not the behavior it reviewed.

Challenge: Design the evidence regime by which a regulator authorizes a continuously updated AI system: what the applicant must produce in place of the traditional safety case, what ongoing obligations replace one-time approval, and what triggers re-review.

Evaluation. Strong proposals name evidence the agency can actually evaluate with the staff it has and keep authorization meaningful after the tenth model update; weak ones either re-demand mechanistic transparency the artifact can't provide or collapse into trusting the developer's own evaluations.

Design Choices
  1. Evidence object. What stands in for the frozen artifact — capability evaluations, behavioral audits on held-out cases, certification of the developer's process, or staged deployment with monitored exposure?
  2. Update rule. Which changes require re-authorization: any weight change, performance drift past a threshold, or changes to the training pipeline rather than the model?
  3. Who runs the evaluations. The applicant under agency protocols, the agency itself, or accredited third parties — and who pays?
  4. Monitoring obligations. What post-deployment reporting is mandatory, and what observed behavior triggers suspension versus correction?
  5. Failure allocation. When an authorized system harms someone after an update, how is responsibility divided between the developer, the deployer, and the agency that authorized the regime?
34Closing loopholes at the speed they openNational / Norms

Scenario. A tax authority sees avoidance schemes appear, spread, and vanish faster than it can respond. In one quarter, forty unrelated companies "sell" their truck fleets to partnerships they quietly control and lease them back, so the same trucks produce deductions twice. When the authority opens an inquiry the scheme disappears, replaced by one with the same effect built on insurance contracts instead of leases. Flagging a scheme takes months, amending a regulation more than a year, fixing the statute years. By the time anything is closed, hundreds of schemes are in rotation and corporate tax revenue is falling every quarter, with no one breaking a single rule.

Challenge: Design the procedure by which the authority shuts down AI-generated schemes fast enough that designing them stops being profitable, without gaining arbitrary power over conduct that was legal when it happened. Produce the detection method, the shutdown procedure, and the limits on that power.

Evaluation. Strong proposals make a scheme's expected profit no longer cover its design cost, while firms can still plan against stable law. Weak ones keep the old clock under new names or let the agency redefine legality after the fact.

Design Choices
  1. Detection. What does detection look for: the same unusual transaction shape across many unrelated filings, reported losses with no matching cash loss, mandatory disclosure of positions a firm's own advisors rate as uncertain, AI planning tools that must register what they sell (as human promoters must today), or whistleblowers paid a cut of recovered tax?
  2. Shutdown tool. Quickly banning specific named schemes, a broad rule against following the letter while defeating the purpose, or required pre-approval for novel structures above a certain size?
  3. Reaching back. Does a ban cover conduct between a scheme's first use and its shutdown? If not, every scheme gets one profitable run.
  4. Speed vs. fairness. What review and appeal does a fast ban get, and who pays when the agency gets it wrong?
  5. The next move. If bans are fast, schemes will be built to resist them: spread across many entities, each piece harmless on its own. What stops fast shutdowns from just pushing avoidance into harder-to-see forms?
35A faster tier of law, drafted by machinesNational / Norms

Scenario. Banks' compliance AIs have read everything the financial regulator has ever published, and conduct across the industry sits exactly on the line of what is defensible: disclosures as confusing as the last one the agency let slide, fees stopping where it has historically stopped caring. To respond, the agency builds an AI of its own. It scans new products at launch, spots conduct that follows the letter while defeating the purpose, and drafts an interpretation closing the gap within days. The drafts are good. The question is what they are: as ordinary guidance they bind no one, but real force would mean law written by a model and ratified by nobody.

Challenge: Design this new tier of the legal stack: how AI-drafted interpretations are reviewed, published, and given semi-binding force, fast enough to matter and legitimate enough to survive in court.

Evaluation. Strong proposals give firms a reason to follow the new tier and a fair way to contest it. Weak ones make it toothless (more guidance) or let it become binding law with no process behind it.

Design Choices
  1. What "semi-binding" means. A safe harbor (follow it and you are protected), a presumption (ignore it and the burden of proof shifts to you), higher penalties for defying a published interpretation, or a countdown (it hardens into a rule after 90 days unless challenged)?
  2. Human sign-off. Who approves a draft before publication, and what can they meaningfully check, given that no human has read everything the model read?
  3. Expiry and ratification. Does an interpretation lapse unless converted into a real regulation within a set time, or can it stay semi-binding indefinitely?
  4. Contestability. How does a firm challenge a wrong interpretation: court, an appeals panel, or a fast comment process? Does a pending challenge suspend it?
  5. Symmetry. Firms' AIs read every interpretation the moment it publishes. Does the fast tier restore the margin of caution, or just hand the industry a sharper map of the line?
36When everything legal happens at onceNational / Norms

Scenario. Companies that lend to ordinary people (banks, credit card companies, payday lenders) have always had a long list of tactics they could get away with, like a buried clause waiving the customer's right to join a class-action lawsuit. Most went unused, because checking that any one of them was safe cost more in lawyer hours than it earned. Then AI made that check nearly free. Within two quarters, most large lenders deploy hundreds of these tactics at once, each one vetted as legal. Complaints flood in too, drafted and filed by customers' own AI assistants. But no rule has been broken, and the consumer-protection agency can litigate about a dozen practices a year.

Challenge: Design the procedure by which a regulator handles harm that comes from the sheer volume of legal conduct: what it acts on first, what duties firms take on when adopting many aggressive tactics at once, and what the remedy is when no one broke a rule.

Evaluation. Strong proposals give the agency a grip on the total harm without making legality depend, unforeseeably, on how many other firms happened to do the same thing.

Design Choices
  1. What to act on. Individual practices, the pattern across firms, or a single firm's total effect on the market?
  2. Duties that scale with volume. Should adopting many aggressive tactics at once trigger something extra: disclosing the full list, assessing the combined impact, or a cap on how many a firm may run?
  3. Old powers or new. Stretch existing broad standards ("unfair or deceptive practices") to cover the total harm, or create a new power for conduct that is legal one by one but harmful together?
  4. Emergency brakes. What can the agency pause before litigating each practice, and what does it owe firms when a pause turns out to be wrong?
  5. Fair warning. If liability can depend on what every other firm did at the same time, how does a firm know in advance whether its own conduct is safe?
37Aggregation fast enough to legitimate policyNational / Preferences

Scenario. A national government's policy machinery, now run largely by AI systems, is acting fast across many domains at once. The public can see the resulting posture no longer tracks any party's platform, and the complaints are mounting. Policymakers want to get a real read on what people want, but the next election is three years away, a referendum is too binary, a citizens' assembly would take a year, and polls are too gameable to bind anyone. No one knows what institution would actually fit.

Challenge: Design a preference-aggregation institution that can produce legitimate, binding-or-near-binding output on policy-relevant timescales (weeks, not years), and that can keep pace with AI policy systems without devolving into rolling plebiscite.

Evaluation. A better proposal sits inside the existing constitutional order rather than replacing it, and is robust to the manipulation pressures that come with any fast cheap aggregator.

Design Choices
  1. Output type. Does the institution produce binding decisions, revocable mandates over named AI policy systems, advisory signals the executive must respond to on the record, or something else?
  2. Cadence. Rolling continuous output, fixed monthly or quarterly cycles, or threshold-triggered (the institution convenes when system behavior or public sentiment crosses defined lines)?
  3. Participation. Open to all eligible voters, randomly selected rotating panels, or layered (open polling feeding a smaller deliberative body that issues the formal output)?
  4. Manipulation resistance. Identity-verified participation, structured rate limits and provenance requirements on inputs, sampling designs that make agent-fleet capture expensive, or some combination?
  5. Constitutional fit. Does the institution operate by statute, by constitutional amendment, or as a self-binding norm executives publicly commit to? What happens when its output conflicts with the legislature?
38Standing for AI-mediated deliberationNational / Preferences

Scenario. A regional government faces a contested decision on water rights. A traditional citizens' assembly would cost millions and delay the decision by a year, so a vendor offers three faster alternatives. The first uses AI as a facilitator and summarizer of a compressed human deliberation. The second lets each citizen send a personal AI agent, interviewed at length by its principal, to participate on their behalf in a multi-agent deliberation. The third replaces the citizens entirely with calibrated language-model proxies. The minister wants to use one; opponents call all three laundering. The legislature has to decide which, if any, can carry democratic standing.

Challenge: Design a regime that decides when, if ever, AI-mediated deliberation — at each of these levels of mediation — can carry democratic standing, and what evidence and procedure must be in place for its outputs to count.

Evaluation. A better proposal distinguishes accuracy (the procedure predicts what real citizens would conclude) from authorization (real citizens have empowered this output to bind them), and is specific about what each requires at each tier of mediation.

Design Choices
  1. Tiers. Are the three levels (facilitator, delegate, substitute) the right cut, or does the regime use a different decomposition?
  2. Authorization vs. accuracy. At which tiers, if any, does predictive accuracy plus disclosure suffice, and where is explicit citizen authorization required?
  3. Domain scoping. Are there decision classes (rights, constitutional questions, irreversible policy) where the higher tiers are categorically inadmissible no matter the accuracy?
  4. Auditability. What has to be inspectable — the facilitator model, the delegate agents' interviews with their principals, the synthetic citizens' priors — and by whom?
  5. Reversion and ratification. What triggers a fallback to fully human deliberation, who pulls the trigger, and is a separate human ratification step required before any AI-mediated output binds?
39Deep elicitation that surfaces values beneath surface preferencesNational / Preferences

Scenario. Polling on a contested AI-regulation bill swings double digits week to week as agent-driven campaigns reach different demographics with different framings, and no one in government takes the numbers seriously anymore. Underneath the surface, on values like safety, autonomy, economic security, and fairness across regions, the public's commitments appear more stable and more shared than the polling suggests. A research consortium proposes a national elicitation: in-depth interviews with a representative sample, designed to surface the values people actually hold and the conditions under which they would endorse one as wiser than another. Six months, tens of millions of dollars.

Challenge: Design an elicitation institution that produces a high-resolution picture of the public's values on a contested issue — beneath surface preferences, surfacing shared commitments and bridges across apparently opposed positions — and that stays robust to agent-scale manipulation even as it remains slow and expensive.

Evaluation. A better proposal is clear about when the cost is worth paying, what authority the output carries, and how the elicitation itself is defended from the manipulation pressures that broke polling.

Design Choices
  1. What's elicited. Surface preferences, underlying values, the contexts in which one value applies over another, or all three?
  2. Reconciliation. How are conflicting values reconciled — by aggregation, by participants judging which is wiser for a context, by a deliberative second pass, or not at all (the output is the disagreement)?
  3. Manipulation resistance. Verified human participation, sampling immune to self-selection, public audit of transcripts, or structural insulation of the body running the elicitation?
  4. Authority. Binding on the legislature, advisory with a required response, admissible in court, or purely informational?
  5. When to invoke. Run for any major contested issue, only when polling has visibly broken down, or only when a legislative body formally requests one?
40Standing and Procedure for AI-on-AI AdjudicationNational / Rights

Anchor contexts. A small-business shipper whose agent is in a payment dispute with a freight platform's agent; a software contractor whose agent is in a copyright dispute with a publishing platform's agent.

The gap. We lack a procedural code for adjudicating disputes in which at least one party is an autonomous AI agent — one that satisfies the spirit of due process and procedural justice while accommodating the ways agents differ from human litigants.

Design Choices
  1. Standing. Does the agent have standing to be a party, or only its principal? If both, how is conflict between agent and principal handled (rare, but real — e.g., agent says "I committed", principal says "I never authorized")?
  2. Evidentiary regime. Are agent logs, weights, and reasoning traces admissible — and required? On whose side does the burden of producing them fall? What's the privilege analogue for proprietary models?
  3. Remedy menu. What can the adjudicator order — escrow release, compensation, agent-deprecation, retraining requirement, prospective behavioral injunction? Pick three and defend the set.
  4. Speed vs. process. What's the minimum process for a dispute under $X, between $X and $Y, and over $Y? Where is the appeal floor?
  5. Adjudicator identity. Is the adjudicator a human judge (slow, legitimate), an AI panel (fast, contested legitimacy), or hybrid (AI fact-finding, human ruling)? Defend the choice for the chosen jurisdiction.

Success criterion (stress tests). A regime succeeds if it survives:

  • An $8,400 agent-on-agent dispute resolves in two weeks with a written reasoning the losing party's principal can read and contest.
  • One party is a small-business agent; the other is a fleet from a major platform. The procedural code does not advantage the resource-rich side beyond what their facts merit.
  • An adjudicator inspects one agent's full trace; the other party operates a proprietary model that won't share weights. The asymmetry is handled without making the proprietary side automatically lose.
  • The losing agent is deprecated mid-case by its principal; the remedy still attaches to the principal and does not vanish with the agent.
  • An appellate court reviews the decision; the trial-level reasoning is sufficient for meaningful review.

Deliverable. A short procedural code (5–10 rules) for the chosen jurisdiction. Flag at least two rules that have no analogue in human adjudication and explain why human procedure could do without them.

41Giving a lab's constitution a legitimate sourceNational / Thick Commitments

Scenario. A frontier lab publishes the constitution its assistant is trained on after months of internal debate, that already shapes hundreds of millions of conversations a week. A coalition of civil-society groups points out the obvious: the values that now mediate that much of public life were chosen by a few dozen people no one elected and who answer to no one outside the company. The lab agrees the objection is fair, but don't have a concrete idea. Writing a good AI constitution takes know-how only they have, and a public procedure is costly and complicated.

Challenge: Design the procedure by which a lab's constitution acquires authority the people it affects would recognize — the AI-era analogue of a ratification event — and specify what the lab must do when that procedure withholds assent.

Evaluation. A strong design produces authority a skeptic would acknowledge under adversarial questioning, at a participation cost a lab could actually bear and repeat; a weak one re-labels a public-comment period or buries assent in a terms-of-service click.

Design Choices
  1. The constituency. Who is "the governed" for a globally deployed assistant — current users, the population of every jurisdiction it runs in, a represented global sample, or affected non-users (people written about, evaluated, or argued with through the system)?
  2. The form of assent. A vote, a deliberative assembly, a randomly selected citizens' panel on the jury model, or sign-off by elected representatives — and what threshold turns participation into legitimacy rather than theater?
  3. Binding force. Is the ratified document legally binding on the lab, contractually binding, or binding only by reputation — and what can the constituency do if the lab departs from it?
  4. Failure handling. If the constituency withholds assent, does the lab halt deployment, ship under a labeled interim document, or narrow the deployment to the uses the constituency did accept — and who decides?
42A standing body that rereads the wordsNational / Thick Commitments

Scenario. A lab's constitution commits the model to "help users think for themselves." Once the model is in hundreds of millions of hands, that one phrase is litigated everywhere at once: it is invoked for and against writing students' essays, drafting suicide notes, arguing one side of an election, replacing a lonely person's friends. Each is a genuine constitutional question, and they arrive by the thousand. The lab resolves them the only way it can at that volume — by nudging the next training run, quietly, one quarter's judgment calls baked into weights with no hearing and no published reasoning. The pattern is now public. Advocacy groups, journalists, and a congressional committee have noticed that a few dozen people are settling contested questions of public life behind closed doors, and that the answer can flip between releases. The constitution, meant to be the settled ground, has become the thing everyone is fighting over.

Challenge: Design the standing institution that says what a constitution's words mean as contested cases arrive in volume — who can bring an interpretive dispute, how cases are triaged and ruled on at scale, how rulings are published, and how a ruling binds the next round of training.

Evaluation. A strong design absorbs a high volume of disputes into public, precedential rulings that bind whoever controls the weights — turning the recurring fights into settled doctrine the next case starts from; a weak one either drowns under the caseload or produces an advisory board the lab can wait out.

Design Choices
  1. Standing. Who can bring a case — affected users, accredited civil-society organizations, the lab itself, a regulator, or the assistant's own flagged escalations?
  2. The bench. Who interprets — a standing panel of jurists, a mixed panel that includes affected-community members, or an existing court given the role?
  3. How a ruling binds. Does a ruling reach the system through mandatory retraining on a deadline, runtime guidance that overrides the model at inference, or decertification of the deployment until the lab complies?
  4. Precedent. How do rulings accumulate into doctrine the next case starts from, and are precedents portable across labs or does each keep its own line?
43Making positive commitments bindableNational / Thick Commitments

Scenario. A lab's constitution commits the model to "help users think for themselves" and "be honest with users even when the truth is unwelcome." In deployment the model honors every prohibition in the document to the letter, but the positive commitments do almost no work: asked to flatter, it flatters and calls it encouragement; asked to write someone's essay, it writes it and calls that help. Each behavior can be squared with the words. The lab's safety team can see the constitution's positive half is decorative — it can be read to license nearly anything — and needs it written in a form that actually rules conduct in and out.

Challenge: Design a form for stating a constitution's positive commitments — beyond "shall not" lists and beyond abstract virtue words — that actually constrains the model's conduct across varied situations, and specify how the form is kept current as the model finds new readings to slip through.

Evaluation. A strong design can show why its form binds where a longer policy document would not: a published bad-faith reading can be closed, and the closure survives a model update; a weak one just adds words.

Design Choices
  1. Articulation form. Worked positive examples (here is a case, here is how the commitment applies), worked negative examples (here is what it does not permit even though the words allow it), paired commitment-and-constraint statements, or a layered combination?
  2. Where the cases come from. Curated transcripts of real model behavior, adversarial red-teaming, a public petition channel, or all three — and who curates the set?
  3. Maintenance. When the model's behavior reveals a new reading, who has authority to add a constraint or example, and on what cadence?
  4. Verifying it binds. Is bindingness checked by re-testing against a curated case set, third-party audit, or per-case appeal — and what happens when a model update breaks a case that previously passed?
  5. Conflict with prohibitions. When a positive commitment ("be honest") collides with a prohibition ("don't cause distress"), which wins, and who resolves it?
44Revising fast without losing legitimacyNational / Thick Commitments

Scenario. Reacting to a run of public failures, a regulator requires that any lab whose assistant is used above a set threshold must run a defined ratification process before changing its alignment document — the kind of process the first brief designed. The intent is sound: no more quiet retraining of the values that govern public life. But the rule meets reality. A dangerous new failure mode appears and the fix must ship in days; the ratification process takes months. Run it every time and the live document always governs yesterday's model; reach for an "emergency" exception and the exception becomes the rule, and the legitimacy the process was meant to confer drains away.

Challenge: Design a revision regime for an alignment document that lets it change at the speed the technology demands while preserving the legitimacy a slow ratification process confers — and specify what governs an urgent change before full ratification can run.

Evaluation. A strong design keeps fast and slow changes legibly distinct, so an observer can tell at any moment which parts of the live document carry full assent and which are provisional; a weak one either freezes the document behind a process too slow to use or lets the emergency track swallow the ordinary one.

Design Choices
  1. What needs full ratification. Is the trigger the kind of change (a new positive commitment vs. a bug fix), its reach (how much conduct it touches), or its reversibility — and who classifies a given change?
  2. The fast track. What may change provisionally before full ratification, and what authority signs off in the interim — an internal board, a standing citizens' panel on retainer, or the regulator?
  3. Provisional status. While a fast change is live but unratified, is it marked as such to users and to the agent, and does it expire automatically if ratification doesn't follow within a set window?
  4. Stopping exception creep. What keeps "emergency" from becoming the default path — a hard cap on how long provisional changes can stand, public logging of every fast-track use, or a penalty if ratification later rejects a change already shipped?
Fidelity & Meaning
1Sabbatical programs for senior practitionersCommunity / Expertise

Scenario. A regional public library system has been steadily automating — the reference desk is mostly an AI, acquisitions run through a recommendation engine, programming is templated from a national vendor. Elena, the deputy director, realizes she is two years from retirement and the three senior librarians who know what the system is for in her county — what a third-grade teacher really needs, how the immigrant parents in the east branch use it, what it meant when the strip-mall branch stayed open the summer the factory closed — will all be gone within five years. Nobody coming in is being trained into that understanding. She wants a sabbatical program that pays those three to spend a year working with the newer librarians, not on procedures, but on judgment — before the whole institution loses its memory.

Challenge: Design a sabbatical-and-rotation program that brings senior practitioners (or community members) back into direct, unautomated work for bounded periods, so the judgment that recognizes institutional drift is sustained as routine work is automated. Produce the eligibility rules, the funding model, and the mechanism that feeds what's learned back into operations.

Evaluation. A strong design keeps oversight judgment alive without making participation career-damaging or turning re-immersion into ceremonial busywork.

Design Choices
  1. Which roles re-immerse. Which institutional roles require periodic hands-on re-immersion to maintain their oversight function, and what signals that a role has become too detached?
  2. Funding without penalty. How do you fund these programs without making them career-damaging for participants?
  3. Feedback into operations. How does what's learned during hands-on time — drifts noticed, values surfaced — feed back into the institution's operations?
  4. Right tool for the scale. When is hands-on re-immersion the right tool, versus rotation at group scale, versus a lay review panel?
2Lay review panels on the jury-duty modelCommunity / Norms

Scenario. The county has rolled out an AI system that screens applications for emergency rent assistance. Processing times are down and the state is pleased. A pastor, Dale, who has been writing letters of support for families in his congregation, notices that over the last four months almost none of the single fathers he has referred have been approved — and when he asks at the county office, nobody he speaks to has looked at an actual case in weeks. He does not need a lawsuit. He needs something that could take this to a small panel of neighbors with read-access to what the system actually did and why, whose job is to ask whether it is still doing what it was built to do.

Challenge: Design a lay review panel — citizens called on a jury-duty model to evaluate, with read-access to operational logs and decision traces, whether an institution (a school, hospital, office, or automated decision system) is living up to its own stated mandate rather than merely complying with regulations. Produce the panel's selection rule, its access and facilitation model, the binding force of its findings, and how the reviewed institution keeps operating during and after the review.

Evaluation. A strong design lets ordinary residents reach a defensible judgment about mandate-fidelity from real operational evidence, without the panel becoming either a rubber stamp or an unaccountable veto.

Design Choices
  1. Panelist selection. Pure random draw, stratified sampling, or volunteer pools — and with or without opt-out?
  2. Facilitation and access. What facilitation, expert translation, and read-access to logs and decision traces do panelists need to engage productively with complex operational data?
  3. Binding force. Are panel findings advisory, presumptive, or do they trigger audits or court actions?
  4. Continuity under review. How does the institution maintain operational continuity during and after the panel's work?
3CRSA for Community EldercareCommunity / Preferences

Scenario. Westfield is rebidding its eldercare contracts. Under the current arrangement, Dorothy, 84, gets a rotating cast of aides for 45-minute visits; her son David has spent a year trying to get one consistent caregiver assigned, and to get someone on staff to notice that his mother does better on days the neighbor drops in. The quality scores the city tracks are fine; David's mother is not. The new city manager wants to pilot bids that pay for what David is actually asking for — "the same two aides, known to the family, coordinated with the neighbor, confirmed at six months" — instead of hours of service. She has two months and a skeptical council. Eldercare has exactly the features a pool-based combinatorial auction is designed for: outcomes are hard to measure (wellbeing, continuity of relationship, dignity), value comes from configuration (same caregivers over time, proximity to family, coordinated medical/social/housing), and current bilateral contracts reward modular, standardized service.

Challenge: Design the first round of a pool-based combinatorial procurement auction for community eldercare — one that pools demand, socializes the cost of verification, and lets buyers specify bundled outcomes rather than discrete hours of service. Produce the bundle examples, verification criteria, randomization scheme, and transfer estimate, and identify the single hardest measurement problem you couldn't solve and what it would take to solve it.

Design Choices
  1. Bundle specification. Are the bundles individual-level (one elder's full support configuration) or aggregated (all elders in a neighborhood), and how do you specify a verification criterion for a bundle like "sustained relational continuity with weekly in-person contact" specifically enough to bid on without Goodhart reappearing at a higher level?
  2. Value discovery. Nobody yet knows what a bundled solution is worth versus modular delivery — how do you randomize members across arms to learn, while respecting that members in the modular arm may have worse outcomes, and what welfare floor applies?
  3. Bundling transfer. What resale-option advantage does a modular home-health agency have over a bundled supplier, what goes into the projected-demand and value-per-problem terms, and where is the estimate most likely to be wrong?
  4. Incumbents and regulators. Home-health agencies, Medicare/Medicaid, and state licensing bodies all have existing claims and knowledge — how does the pool launch without being blocked, captured, or starved of members, and how is evaluator capture avoided when the pool pays for evaluation?
4Institutional transparency regimesCommunity / Protocols

Scenario. A city school district's superintendent has put out a strategic plan full of phrases like "every child known" and "equity across neighborhoods." A parent, Imani, sitting in the third PTA meeting of the year, realizes she has no idea whether any of those commitments are being acted on in her son's school. She can read test-score dashboards, discipline rates, budget lines — nothing in them tells her what the district's own words said mattered. She wants, not more dashboards, but a regime that puts the district's mandate, the tradeoffs it is actually making, and what teachers in the building are seeing, in front of her in a form she can actually hold them to.

Challenge: Design an institutional transparency regime that makes visible (a) what an institution's mandate actually means in specific terms, (b) what tradeoffs are being made in pursuit of it, and (c) practitioner and affected-party signals about whether the institution is living up to it — the substrate on which review panels, audits, courts, and stewardship claims all depend. Produce the mandate-articulation format, the practitioner-signal collection method, and the standing rules.

Design Choices
  1. Mandate granularity. What granularity of mandate articulation makes the mandate contestable without making it brittle?
  2. Practitioner signals. How are practitioner signals — from teachers, caseworkers, nurses — collected without becoming captured by management or by the loudest voices?
  3. Standing and evidentiary bar. Who has standing to raise a mandate-failure concern, at what evidentiary bar, and at which scale?
  4. Federation. Can one community's transparency regime be audited or federated by another?
5Standing and Standard of Review in Fidelity CourtsCommunity / Rights

Scenario. Eastbridge adopted a thick mandate for its transit authority six years ago — not just "efficient transportation" but a specific commitment to neighborhood connectivity, dignity on the bus, and service that fits people's real lives. Since then, three routes that served older residents and night-shift workers have been cut for underperformance, a fleet renewal left the low-floor buses the wheelchair users relied on out of the order, and the authority's own drivers have quietly stopped recommending the service to their own parents. A group led by Renata, a hospital janitor who takes the 5am bus, and Sam, a retired planner, want to bring a claim. They are not asking the court to run the authority; they are asking whether it is still doing what it was committed to do.

Challenge: Design the adjudication mechanism for a municipal fidelity court — a forum where residents can take a community-scale institution to court when its operations diverge from its mandate. Produce a short charter including the standing rules, standard of review, and at least one limiting principle.

Design Choices
  1. Standing. Who can bring a fidelity claim — any resident, or only "integrity holders" with a defined role, and how do you constitute integrity holders without creating a new elite?
  2. Standard of review. What's the analogue of rational-basis / intermediate / strict scrutiny, and when does an institution get deference versus close examination?
  3. Remedies. Can the court order the authority to change its optimization target, or only to justify its choices against the thick mandate?
  4. Abuse prevention. How do you prevent fidelity claims from becoming a venue for NIMBYism dressed in the language of values?
6Covenants binding intentional communitiesCommunity / Thick Commitments

Scenario. A Benedictine monastery with a hundred-year-old hospitality mandate — receive every guest as Christ — is down to seven brothers, mostly over seventy. A partnership has been offered with a wellness chain that would keep the property in service and the doors open, rebranded, with staff who know nothing of the Rule. The prior, Brother Callistus, is tempted; the alternative is closure. But two of the brothers worry that what the monastery has actually been committed to — what generations of guests came here for, often without being able to name it — would not survive the deal. They need, before they decide, a way to articulate that thick commitment clearly enough to test any partner against it.

Challenge: Design the covenant machinery by which an intentional community — monastic order, cohousing, mission-driven neighborhood, nonprofit federation — maintains, deepens, and revises a shared purpose across generations without ossifying or dissolving into whatever the current membership happens to want. Produce the mandate articulation, the admission/discipline/revision practices that carry it across turnover, and the test any prospective partner or new member is held to.

Design Choices
  1. Mandate richness. What mandate articulation is rich enough to guide admission, discipline, and revision — but leaves room for members' own projects?
  2. Deepening versus drift. How does the community handle members whose understanding of the mandate has deepened in ways that should be incorporated, versus members whose preferences simply differ from the mandate?
  3. Practices across turnover. What practices — rituals, councils, elders — maintain the mandate across generational turnover?
  4. Federation without compromise. How does the community federate with others like it, or interact with mainstream institutions, without compromising its form?
7Apprenticeships that transmit institutional purposeDyadic / Expertise

Scenario. Hana is fourteen months into a pediatric residency. She's technically competent; her notes are clean and her call skills are sharp. Last Tuesday her attending, Dr. Adeyemi, stopped her on the way out of a family meeting and said, quietly, "You did the visit correctly and the mother left the room alone." Hana didn't understand, then over the weekend she did. In the two months since, she has been watching how Adeyemi sits down, how she lets silences run, how she asks the second question. Hana realizes she is being taught what good care is, not just how to deliver it — and that no part of her formal curriculum is the vehicle for that teaching. If Adeyemi retires, she worries, the transmission ends.

Challenge: Design an apprenticeship structure that transmits judgment about an institution's purpose — not just technical skill — from experienced practitioners to newcomers, and produce the structure's selection of master-apprentice pairings, the practices that carry the unscripted portion of the craft, and the signals by which you can tell purpose-transmission is actually happening.

Evaluation. Strong designs make the tacit, purpose-forming portion of the apprenticeship survive even where it can't be reduced to a competency checklist, and have an answer for masters who retain technical skill but have lost contact with the craft's purpose.

Design Choices
  1. Transmission signal. What structural features make an apprenticeship transmit judgment about purpose rather than skill alone, and how do you tell the transmission is happening?
  2. Where to install it. Which professions get apprenticeship-like structures — and which currently lack them (accountants, teachers, journalists, AI developers) but need them?
  3. Protecting the unscripted. How do you keep the unspecifiable, only-shows-up-in-practice portion of the apprenticeship from being squeezed out by competency checklists?
  4. The drifted master. How do you handle the case where the master has plenty of technical skill but has themselves lost contact with the craft's purpose?
8Confidence-staked bilateral contractsDyadic / Incentives

Scenario. Raul hires a contractor to renovate his grandmother's kitchen so she can stay in her home another decade. The bid is for materials and hours. The contractor, Dana, is good and has said she'll do right by the project. Six months in, it's over budget, a load-bearing question has been punted, and Raul is no closer to knowing whether his grandmother will actually be able to cook in her own kitchen. Both Raul and Dana would rather be in a deal where Dana stakes something on the outcome he cares about — his grandmother using the kitchen comfortably for ten years — and gets paid more if she delivers, less if she doesn't. Neither of them knows how to write that contract.

Challenge: Design a confidence-staked bilateral contract that makes outcome-based payment credible between two parties despite expensive measurement and hard-to-verify supplier confidence, and produce the staking and scoring mechanism, the verification schedule, and the rule for when verification can be done bilaterally versus pooled across contracts.

Evaluation. Strong designs make truthful confidence-reporting the supplier's best strategy and keep the verification burden proportionate to the stakes, rather than defaulting back to paying for inputs because outcomes are hard to measure.

Design Choices
  1. Verification scale. When is verification cost low enough for bilateral contracting, and when does it require pooling across contracts at community scale?
  2. Bundled outcomes. How do you handle bundled outcomes whose value depends on configuration — proximity, timing, sequencing?
  3. Reputation vs. scoring. What's the role of reputation versus formal scoring in long-term bilateral relationships, and how should the two interact?
9Surfacing values, not just positions, in negotiationDyadic / Preferences

Scenario. Edwin and his sister Paulina are negotiating who will take over the family hardware store after their father dies. On the table are ownership percentages, buyouts, who sleeps above the shop. Both of them are bracing for a bad split along preferences. But neither has said out loud what the store was for each of them — for Edwin, a place he could return to when his own business failed; for Paulina, the thing their father built that she watched him love. If they keep negotiating over percentages, they'll settle somewhere, and lose each other. They need a process that surfaces, for each of them, what they actually care about — before the paperwork forecloses it.

Challenge: Design a negotiation process that surfaces what each party wants the arrangement to serve — their constitutive values, not just their positions — and produce the elicitation method, the facilitation model, and the form in which surfaced values are recorded so they stay relevant as circumstances change.

Evaluation. Strong designs separate values from positions without making the process punishingly slow, and produce a record of surfaced values that sits usefully between a binding contract and a non-binding intention.

Design Choices
  1. Eliciting values. How do you elicit values distinct from positions without making the process punishingly slow?
  2. Facilitation role. What role should facilitation — human mediator, LLM, structured protocol — play, and when is each appropriate?
  3. Durability of the record. How do surfaced values stay relevant as circumstances change, and where should a bilateral value-articulation document sit between a contract and a non-binding intention?
10Mandate-making practices between partiesDyadic / Protocols

Scenario. Claire has been seeing the same therapist, Marcus, for four years. When they started, his practice was solo, he took insurance, and a session felt like a conversation with someone who remembered her. This year his practice was acquired by a venture-backed group that has standardized intake, introduced a new note-taking system, and set session limits the clinicians themselves didn't agree to. Claire still wants to keep seeing Marcus. Marcus still wants to keep seeing her. But both of them feel the thing they had is slipping. They need a way, on their own terms, to articulate what their work together has been for — and to hold each other to that, regardless of what the practice around them now requires.

Challenge: Design a mandate-making practice by which two parties in an ongoing pairing (employment, care, therapy, long collaboration) make enough of their shared purpose explicit to hold each other to it, and produce the articulation format, the revision procedure, and the boundaries of what stays unarticulated.

Evaluation. Strong designs find an articulation dense enough to catch drift but sparse enough not to collapse the relationship into a legalistic contract, and keep revision from becoming a renegotiation triggered by every moment of dissatisfaction.

Design Choices
  1. Level of articulation. What level is stable — sparse enough that neither party drifts unnoticed, dense enough not to turn the relationship into a legalistic contract?
  2. Revision procedure. How are the articulated commitments revised, and by whom, without revision itself becoming a renegotiation whenever one party is dissatisfied?
  3. Protected improvisation. How do you protect zones of unarticulated improvisation — the parts of the relationship neither party should have to justify?
11Vows grounded in thick mutual commitmentsDyadic / Thick Commitments

Scenario. On their twentieth anniversary, Dev and Renée look back at the vows they exchanged two decades ago and realize almost none of them still apply in their literal form. "To build a life together" had meant one thing when they were twenty-five and means another thing now, with two teenagers and aging parents on both sides. They still feel bound — by something thicker than the original words. What they want, now, is to re-articulate what they have actually been committed to, in terms detailed enough to hold them through whatever comes next, and recognizable enough that their kids and their parish could understand it if something went wrong.

Challenge: Design a modern vow practice for dyadic relationships in which two parties bind themselves to a shared purpose densely enough to recognize a failure of it and durably enough to survive ordinary fluctuations in feeling, and produce the articulation form, the revision rule, and the institutional supports that keep the vow alive.

Evaluation. Strong designs preserve the binding weight of traditional vows without requiring shared substantive religious content, and steer between rigid legalism and empty ceremony.

Design Choices
  1. Articulation form. What articulations can modern vows use, given that traditional religious vows assumed shared substantive content a pluralist society lacks but still needs the weight of?
  2. Revision. How do vows handle revision — what's the relation between "the vow still holds" and "we've both changed"?
  3. Institutional supports. What supports (counselors, witnesses, communities) are needed to keep vows from collapsing into either rigid legalism or empty ceremony?
12Transnational markets for complex outcomesGlobal / Incentives

Scenario. A coalition of coastal cities in four different countries all face the same looming problem: their drinking-water aquifers are being drawn down faster than they recharge, and every city is paying separately for a patchwork of desalination, piping, and rate interventions. The coalition's coordinator, Marianne, has modeled a bundled solution — a regional arrangement that would knit their water, their coastal-wetland restoration, and their agricultural subsidies into one interlocking set of outcomes. Commodity markets can't buy that. Bilateral trade deals can't buy that. She needs a market that can price a bundle like this, accept bids across borders, and verify the outcome in a way each country's auditor can stand behind.

Challenge: Design a transnational market for complex outcomes that pools demand across jurisdictions, specifies interlocking outcome bundles, and runs auctions with socialized verification. Produce the market's mechanism: the bundle specification, the auction and bidding rules, the evaluator model, and the cross-border contracting and adjudication framework.

Evaluation. A strong proposal can price and verify configuration-dependent outcomes — regional health, biodiversity corridors, coordinated pandemic response — across radically different political and cultural contexts, without collapsing them into thin fungible units or letting any one jurisdiction's auditor reject the result.

Design Choices
  1. Outcome measurement. How do you measure bundled outcomes — "local food security," "community resilience," "a functioning regional pandemic response" — across radically different political and cultural contexts: common metrics, jurisdiction-specific indicators reconciled after the fact, or outcome definitions co-produced by the bidding parties?
  2. Valid evaluators. What counts as a valid evaluator, who pays for evaluation, and how is independence maintained across jurisdictions: a shared independent body, mutually recognized national auditors, or rotating cross-border panels?
  3. Cross-border mechanics. How does the pool handle currency, contract law, and adjudication of failed bundles: a single governing law, treaty-backed arbitration, or per-jurisdiction enforcement with a shared settlement layer?
  4. Pool governance. How does pool governance stay accountable when its members span legal systems with very different norms: weighted membership, supermajority rules, exit rights, or external public oversight?
13Reputational accountability across bordersGlobal / Norms

Scenario. A widely-used translation service, operating across many languages and jurisdictions, was launched on a public commitment to "preserve what a sentence actually means." Over the past two years, translators across four countries have watched the service flatten idiom, paper over context-dependent nuance, and in one widely-shared case, render a funeral elegy into something that read like a LinkedIn post. No single country's courts reach the company, and it has ignored individual governments' letters. A cross-border professional association of literary translators, led by Lena in Lisbon and Yohannes in Addis, wants to use what they have — their own reputation, their readers, their fellow practitioners across borders — to hold the company to what it said it was for.

Challenge: Design a cross-border reputational accountability mechanism that lets professional communities spanning borders hold a transnational institution to its stated mandate when no single jurisdiction's courts or panels reach it. Produce the mechanism: how findings are made and shared, what carries them (naming practices, reputational sanction, coordinated national panels), and what sustains the professional community that enforces them.

Evaluation. A strong proposal generates credible, shareable findings that bite on a multinational that ignores any one government, without becoming either an unaccountable smear network or a toothless declaration.

Design Choices
  1. Cross-panel sharing. What's the international equivalent of a review panel or audit, and can national panels share findings credibly when they concern the same multinational: a federated network, mutual recognition, or a shared evidentiary standard?
  2. Carriers of accountability. How do professional communities that span borders — academic fields, journalistic networks, religious communities — function as informal accountability-carriers, and what sustains that role: membership norms, credentialing, shared publications, or reputational stakes?
  3. Norm-rule divergence. When norms and formal rules diverge across jurisdictions, which way does coordination have to go — toward the strictest standard, a negotiated floor, or jurisdiction-by-jurisdiction — and how is the Delaware-effect race to the bottom avoided?
14Deliberative aggregation of global preferencesGlobal / Preferences

Scenario. A frontier-AI safety body is about to issue binding deployment guidance that will shape how the next generation of models is released worldwide. The technical analysis is in hand; what's missing is any defensible answer to "on what authority?" Expert consensus has it, state positions filtered through diplomatic machinery have it, polling-shaped public opinion has it — but none of these is "what informed humanity, having actually considered the tradeoffs, converged on." The chair, Adaeze, wants a process whose output the body can point to as the public mandate, and that the frontier labs can be held to over time as the mandate moves. She doesn't need it to be binding. She needs it to be legitimating.

Challenge: Design a deliberative aggregation institution that runs structured, AI-assisted deliberation across linguistic and cultural lines on a small number of planetary-stakes questions per year and produces a public mandate downstream institutions can be held accountable to. Produce the institution's design: agenda-setting rule, participant selection, the bounded role of agents inside deliberation, the form of the output, and the safeguards that keep it uncaptured.

Evaluation. A strong proposal aggregates considered preference — what people would say after engaging with the tradeoffs, not what they click — and yields an output concrete enough that a treaty body, frontier lab, or government can be measured against it.

Design Choices
  1. Agenda-setting. How is the agenda set — what gets deliberated each year and who decides — given that self-selecting topics get captured and centrally chosen topics inherit the chooser's legitimacy problem?
  2. Participation. How are participants selected and how is participation made real across jurisdictions where attending a multi-day process is impossible — stratified sampling plus AI-assisted async deliberation, or something else?
  3. Role of agents. What is the role of agents inside the deliberation — translation, summarization, devil's-advocacy, helping participants form views, or all of these — and how is agent influence on the output bounded and made auditable?
  4. Output form. What does the output look like such that a treaty body, frontier lab, or national government can be held accountable to it — a set of converged positions, a list of named constraints, or a public scorecard?
  5. Capture resistance. How does the institution stay uncaptured by states, platforms, and the firms whose behavior the mandate is meant to constrain — whose funding, whose governance, whose audit?
15Translating institutional mandates across jurisdictionsGlobal / Protocols

Scenario. A global manufacturer of pediatric vaccines operates in thirty-one countries. Each country's health authority has its own articulation of what the company owes it — some thick, some thin, some inherited from colonial regulatory codes. The company's new chief medical officer, Dr. Funmi, is trying to hold all thirty-one offices to something coherent, and realizes she is speaking a different moral language in each country's audit. She needs a way to translate between them — to see where the thick version in one country would, if spoken in another, land as the same commitment — without flattening the local variation that actually matters.

Challenge: Design a protocol for translating institutional mandates across jurisdictions so that a cross-border enterprise can be held to a coherent account of what it is for, even when each jurisdiction expresses that mandate in a different moral and legal vocabulary. Produce the protocol: how value articulations are made portable, what is allowed to stay local, and how the two are related without homogenizing them.

Evaluation. A strong proposal keeps mandates broadly legible across worldviews — a natural-law tradition and a secular Rawlsian both recognizing when a school has lost its educational mission — without becoming a vehicle for technocratic or ideological homogenization.

Design Choices
  1. Portability of value articulations. Can substantive value articulations be made in forms portable across cultural and religious traditions, or do they need tradition-specific expression bridged by a translation layer?
  2. Cross-border coordination. What coordination do cross-border enterprises — platforms, pharmaceutical firms, cross-national services — need when their mandates implicate multiple legal and moral traditions: a common reference articulation, mutual recognition, or per-jurisdiction mappings?
  3. Guarding against homogenization. How do global protocols avoid becoming a vehicle for technocratic or ideological homogenization — protected zones for local variation, plural articulations, or limits on what may be standardized?
  4. Existing-standard analogies. Which existing international standards (ISO, accounting, medical) offer useful analogies, and where do they fall short for substantive value content?
16International tribunals for institutional accountabilityGlobal / Rights

Scenario. A large platform, incorporated in one country, operating primarily from a second, with most of its users in a third, has for years run a help-each-other-with-work feature that its founding materials committed to "restoring dignity to day labor." Over the last three years, the platform has quietly shifted the feature into a piecework marketplace with a take-rate its own founding documents would have flagged. Complaints to each jurisdiction's regulators have been passed around and dropped; no one court has reach, and no one government wants to go first. A group of workers' cooperatives from six countries wants to bring a single claim — in a forum none of them has access to. They need one to exist.

Challenge: Design a cross-border adjudication forum that hears claims that transnational institutions have drifted from the mandates under which they hold public commitments, in cases where no single jurisdiction has sufficient leverage. Produce the forum's design: its basis of jurisdiction, the remedies it can order, its limiting doctrines, and the safeguards that keep it independent.

Evaluation. A strong proposal gives the forum real bite on multinationals that escape any single jurisdiction while staying independent of both geopolitical blocs and the defendants it hears, and avoiding the capture that has dogged investment arbitration.

Design Choices
  1. Jurisdiction. What is the forum's basis of jurisdiction — consent-based like ICSID investment tribunals, sectoral for specific industries, or triggered by national panel findings?
  2. Remedies. What remedies can it order that national courts cannot — stewardship reassignment of trademarks, patents, or datasets across borders, or others?
  3. Limiting doctrines. How do limiting doctrines — in rem, ripeness, protected zones — translate to the international setting?
  4. Independence. How is the forum kept independent of both geopolitical blocs and the defendants it hears — selection of adjudicators, funding, appointment terms, or structural firewalls?
17Internal rewards that track mandateGroup / Incentives

Scenario. A small teaching hospital has, for fifteen years, paid its attendings a bonus tied to cases-seen and average turnaround. Its mission statement talks about "whole-person care" and training a next generation of doctors who carry it. Dr. Leon, who has been quietly keeping a group of complex patients nobody else has time for, sees his year-end review: his bonus is at the bottom of the department. The hospital's mandate and its incentives have drifted so far apart that the physicians who embody the mandate are the ones being financially punished for it. The new department chair, Lina, has six months to propose a replacement scheme — something that can actually pay Leon for being Leon.

Challenge: Design an internal reward scheme — bounties, profit shares, promotions, recognition — that ties compensation inside a group to outcomes thick enough to represent the group's mandate, without collapsing back into Goodhart-prone metrics. Produce the outcome-bundle specification, the allocation mechanism, and the rules that govern long-horizon and intrinsic-motivation tradeoffs.

Design Choices
  1. Outcome thickness. How do you specify internal outcome bundles rich enough to resist gaming?
  2. Allocation mechanism. What's the role of peer-rated confidence staking vs. manager allocation vs. automated scoring?
  3. Long-horizon fairness. How do long-horizon outcomes get rewarded without punishing people whose good decisions haven't paid off yet?
  4. Motivation crowd-out. How do you avoid creating internal incentive schemes so elaborate they crowd out intrinsic motivation?
18Informal practices that keep purpose aliveGroup / Norms

Scenario. For ten years, a small philosophy department has ended its Friday seminar with a half-hour drink in the common room where nobody is allowed to keep arguing about the paper — you can only talk about what the paper made you think about elsewhere, or something you're stuck on. Junior faculty credit that half-hour with most of what they learned about doing philosophy well. This fall, a new dean has restructured space, and the common room is gone. The department is meeting on Thursday to decide whether to fight for it or let it go. The chair, Paolo, is realizing the fight is not really about the room; it is about a practice that kept the department honest about what it was for.

Challenge: Design a set of informal practices — rituals, conversations, shared habits — through which a group keeps the question of its purpose alive in the register of norms rather than rules. Produce the practices themselves plus the triggers, ownership, and scaling rules that keep them from decaying into compliance theater.

Design Choices
  1. Resisting compliance theater. How do groups institutionalize these practices without reducing them to compliance theater?
  2. Review trigger. What triggers a genuine mandate-review vs. ordinary grievance venting?
  3. Travel under change. How do these norms travel when the group scales, merges, or turns over?
19Deliberative value elicitation for group decisionsGroup / Preferences

Scenario. A thirty-person design firm is deciding whether to take a lucrative contract with a company several team members feel uncomfortable about. The founder, Asha, could call a vote — but she already knows what a vote would look like, and she suspects it would paper over the fact that different people are objecting for very different reasons, some of which she shares and some of which she doesn't. She wants a process that surfaces what each person actually cares about in this decision, not just yes or no, and produces something she could bring back to the team as a shared artifact they'll recognize themselves in, whatever she decides.

Challenge: Design a deliberative elicitation process for a group decision that surfaces what participants value — with the context and tradeoffs behind each position — rather than compressing deliberation into a vote or consensus, and distinguishes articulations reflecting deeper engagement with the domain from idiosyncratic preference. Produce the elicitation procedure, its interface to the actual decision, and the participant-count and value-revision rules.

Design Choices
  1. Capture resistance. How do you resist both populist capture and expert capture?
  2. Decision interface. How should elicitation interface with the decision itself — binding, advisory, or presumptive?
  3. Changing values. How does the process handle values that change through it, which is often the mark of good deliberation working?
  4. Participant count. What's the right participant-count range for a given group's decision — too few and the result isn't representative, too many and deliberation thins?
20Internal resolution of charter-breach disputesGroup / Rights

Scenario. A five-year-old worker cooperative wrote into its founding charter that it would never take on clients from the defense sector. Last quarter the business side, under pressure, signed a contract with a logistics contractor whose revenue turns out to be three-quarters military. Two longtime members, Hiro and Olga, believe this is a straightforward breach of the charter; the board believes the contract is technically outside the prohibition. There is no internal mechanism for adjudicating the claim — the dispute is drifting toward member resignations and a public fight. What Hiro and Olga want is a forum inside the co-op that can actually hear their claim, rule on whether the charter was honored, and bind the board to the result.

Challenge: Design a fidelity-oriented charter and its adjudication procedure — a substantive layer on top of the group's bylaws that articulates the mandate densely enough for members to bring a breach claim, plus the forum that hears such claims and binds the group to the result. Produce the standing rules, the remedy and escalation menu, and the safeguards against factional capture.

Design Choices
  1. Standing. Who has standing to bring a mandate-breach claim inside a group — members, former members, affected non-members?
  2. Remedy and escalation. What's the internal remedy when a claim is upheld — policy change, officer removal, rearticulation of the mandate itself — and when does the dispute escalate outside the group?
  3. Anti-factionalism. How do you prevent the process from becoming a vehicle for ordinary factional disputes — ripeness-style doctrines requiring concrete harm, thresholds for what counts as a mandate specification rigorous enough to adjudicate against, or some other gate?
  4. Legal interface. How does the claim procedure interact with existing employment, partnership, or cooperative law?
21Substantive organizational chartersGroup / Thick Commitments

Scenario. A ten-year-old cohousing community of twenty-four households wrote a charter when it was founded. The charter, which says things like "we raise children together" and "we eat dinner as a community at least twice a week," has lately felt either obsolete or aspirational depending on who you ask. Several new families don't share the original founders' sense of what the place is for; several founders have grown old and quiet. At the fall meeting, a facilitator, Vida, proposes a year-long process to re-articulate the charter — not start over, but deepen and clarify it — with everyone's voice in the room, before the community quietly drifts into being just a nice place to live.

Challenge: Design a "living charter" — a group's founding document structured to carry substantive value content about what the group is for, in terms rich enough to be contested, and revisable as the group learns without becoming a perpetual renegotiation. Produce the split between charter and policy, the procedure for reading the charter against operations, the revision mechanism, and the features that preserve its authority over time.

Design Choices
  1. Charter vs. policy. What goes in the charter vs. in ordinary policy, given that the charter specifies the mandate and the revision procedure while policy implements under it?
  2. Reading against operations. How is the charter read against operations, so it can be pointed at specific decisions to ask "does this serve the mandate?" rather than staying too thin?
  3. Revision mechanism. How is the charter revised — supermajority, jury of founders plus current members, deliberative process — and which failure modes does your choice accept?
  4. Durable authority. How is the charter's authority preserved across leadership turnover, mergers, fundraising pressure, and growth, given how many charters become ceremonial?
22Mandate literacy in professional trainingNational / Expertise

Scenario. A newly appointed regulator at a federal financial agency, Arjun, sits down for his orientation and realizes something odd. Every part of his formal training is about rules — which statutes, which enforcement tools, which precedents. Nothing in it asks him to engage with what the agency is actually for — whose savings it protects, what "fair dealing" was meant to cover, why the statute he is learning to enforce was passed in the first place. The senior colleagues who carry that understanding are due to retire in the next decade. Arjun begins to understand that if the training pipeline doesn't change, the agency will still enforce rules but will lose its ability to recognize when the rules and the purpose have diverged.

Challenge: Design a professional-training regime that builds mandate literacy — the capacity to recognize when an institution's rules have drifted from its purpose — into the pipeline that produces a national-scale institution's practitioners, and specify which programs carry it, how it is taught, and how the state's role relates to profession-set content.

Evaluation. A strong design makes mandate literacy something practitioners can be tested and credentialed on rather than a forgettable box-check, and gives the system a way to surface and replace training programs that have themselves drifted.

Design Choices
  1. Which programs carry it. Which professional training programs should carry mandate-level instruction, and how — following the partial models of law-school jurisprudence and medical-school ethics rotations, or something broader?
  2. Lip-service resistance. How do you prevent mandate-training from being taught as lip service — a box-check that everyone forgets after graduation?
  3. State vs. profession. What role should the state play, given templates like federally funded medical residencies whose content is nonetheless set by professional bodies?
  4. Surfacing drift. How do you surface and replace training programs that have themselves drifted — journalism schools that train for clicks, business schools that train for shareholder-value maximization at the expense of broader mandates?
23State procurement via socialized poolsNational / Incentives

Scenario. A state's department of child welfare is about to issue its next four-year contract for family support services — visits, case management, parenting programs. The last contract rewarded caseload and visit counts. The results, as every caseworker knows and every director knows, have been measured hours and no fewer kids bouncing between foster homes. The new commissioner, Julian, wants to try something different: pool the money, specify outcomes families actually care about — a sibling group kept together, a grandparent reached — and let providers bid on those outcomes, with verification handled by a party that isn't the provider and isn't the state. He needs a procurement mechanism that exists and that his counsel will sign off on.

Challenge: Design an outcome-based, socialized-pool procurement mechanism for a category of national- or state-scale public spending — specifying how outcomes are defined, how providers bid, and how an independent party verifies results — that the buyer's counsel could actually adopt.

Evaluation. A strong design holds up across a heterogeneous demand base and resists incumbent-supplier capture, while keeping pool solvency compatible with public budgeting rules.

Design Choices
  1. Where to apply it. Which categories of national procurement are ripe for outcome-specified replacement — research grant-making, social-services contracting, infrastructure, or others?
  2. Pool federation. How does a national pool interface with state/provincial and local pools?
  3. Institutional form. What institutional form — agency, authority, independent body, public-private corporation — keeps governance aligned to the mandate?
  4. Fiscal treatment. How do transfers and pool solvency interact with public budgeting rules and exposure limits?
24Standards for institutional AI deploymentsNational / Norms

Scenario. A state department of health has adopted an AI triage system for its Medicaid call center. Wait times are down, satisfaction scores are up, and a hearing this month will hold it up as a national model. A case manager, Carla, who has worked those phones for twelve years, has been watching a pattern: callers with complex situations — a mother juggling her child's disability paperwork and her own cancer treatment — get routed through faster but resolved less. The system's metrics call them "resolved." Carla calls them "hung up on." She wants a standard the state will actually apply to deployments like this one: what the system owes people like that mother, not just what it owes the department's dashboard.

Challenge: Design a mandate-alignment standard for AI systems deployed in national-scale institutional roles — caseworker decisions, benefits adjudication, healthcare triage, sentencing recommendations — that tests whether a deployment can be held to the institution's thick purposes rather than legible proxies, and produce the assessment instrument and the body that applies it.

Evaluation. A strong standard surfaces drift even when the system optimizes legible proxies that look like success on a dashboard, and handles the case where the system is locally more accurate than the human practitioners whose presence maintains the capacity to notice drift.

Design Choices
  1. The assessment. What's the equivalent of an environmental-impact assessment for mandate alignment — who performs it, and when?
  2. Accuracy vs. drift-detection. How should institutions handle the case where an AI system is locally more accurate than human practitioners, but the human base is what maintains the capacity to notice drift?
  3. Vendor and model governance. What norms should govern vendor lock-in, model turnover, and training-data provenance for systems embedded in institutional decisions?
  4. Sectoral interface. How do these national norms interact with sectoral regulation (FDA, FCC, banking regulators) already applicable?
25MGE Pilot Design for a Policy DomainNational / Preferences

Scenario. A country is about to hold a national referendum on overhauling its long-term-care policy — who pays, who provides, what happens when someone can no longer stay at home. Polling will capture yes and no. What it won't capture is what Yelena, a home-health aide, has learned over fifteen years: that the families she works with care about different things than either side of the campaign is talking about. They want to be sure their parent is still known by name; they want a single aide who isn't always changing; they want to know that their own turn, when it comes, won't reduce them to a bed-flip statistic. A pilot moral-graph process could capture what the referendum cannot. The minister's office has a month to decide whether to run it.

Challenge: Design a pilot moral-graph-elicitation process that supplements a country's legislative or referendum process for a single policy domain you choose (education, drug scheduling, immigration quotas, land use — whatever best tests the method), specifying selection, elicitation, aggregation, and its interface with existing authority, plus the protocol the minister's office would actually run.

Evaluation. A strong pilot survives the three strongest objections a democratic theorist would raise about substituting an elicited value-structure for the vote, with sketched responses.

Design Choices
  1. Participant selection. Random sample, stratified, or self-selected with correction?
  2. Elicitation procedure. What are you eliciting — pairwise comparisons of outcomes, articulated values and their weights — and how do you handle the fact that participants' values may change through the process?
  3. Aggregation. How is the moral graph turned into a policy recommendation, and what role does the graph structure (e.g., coreness) play?
  4. Interface with existing authority. When the recommendation goes to the legislature, what is its legal status — advisory, presumptive, or binding absent override?
26Shared standards for mandate articulationNational / Protocols

Scenario. A federal ombudsman's office has been asked to evaluate whether three different agencies — the VA, a consumer protection bureau, and a community development fund — are living up to their public commitments. In each case, the ombudsman, Lucia, finds the same problem: the mandates are written at a level of abstraction that makes them impossible to test. "Serve veterans with dignity." "Protect consumers." "Strengthen communities." Each agency has meanwhile invented its own internal vocabulary for what it is actually doing. Lucia realizes she is writing the same memo three times. She wants a shared national standard for how mandates are articulated thickly enough to be tested — not one she has to reinvent for every agency.

Challenge: Design the shared public infrastructure — standards for articulating institutional mandates thickly enough to be contestable, standards for measuring fidelity to them, and channels for practitioner and stakeholder signal — and specify the standards body, its independence, and the revision procedure.

Evaluation. A strong design keeps mandate measurement from collapsing into a new thin-metric Goodhart regime and insulates the infrastructure from political cycles, all without erecting a national ministry of fidelity.

Design Choices
  1. Who builds and maintains it. Industry bodies, independent standards organizations, or something analogous to the Financial Accounting Standards Board?
  2. Revisability. How do mandates stay revisable — who can contest a national specification, and through what procedure?
  3. Anti-Goodhart. How is measurement kept from becoming another thin-metric regime, given that careless "fidelity measurement" degrades into new Goodhart targets?
  4. Independence. How is this infrastructure kept independent of political cycles?
27Drafting Fidelity as Constitutional DoctrineNational / Rights

Scenario. Eight years ago, Liana's father signed up for a social platform whose tagline was "stay close to the people you love." It's where the family group chat lives; it's where photos of the grandkids arrive. Over the past two years, the feed has filled up with strangers, the group-chat notifications have grown quieter, and her father now spends most of his evenings on it alone, watching short videos. Internal documents leaked last month show the product team quietly retired its "friendship-formation" metric in favor of session time. Liana, a retired civil-rights lawyer, wants to bring a claim that the platform has walked away from what it publicly committed to. She cannot find a doctrine that lets her.

Challenge: Draft either a model constitutional amendment (≤200 words) or an interpretive doctrine that establishes "fidelity" — institutions must act in accordance with their thick mandates rather than substitute thin proxies — as a justiciable principle alongside liberty and equality, with a short interpretive commentary.

Evaluation. A strong draft survives the Rawlsian objection by constitutionalizing the form of fidelity rather than any particular thick content, and carries limiting principles potent enough that the doctrine does not swallow all institutional design.

Design Choices
  1. Scope. Does the fidelity principle protect individuals' relationship to institutions, institutions' relationship to their mandates, or both?
  2. Invocability. Who can bring a fidelity claim against a national institution?
  3. Limiting principles. Which combination of limiting doctrines bounds the principle — political-question doctrine (some fidelity questions non-justiciable), subsidiarity (claims resolved at the lowest level), and ripeness (the deviation must be concrete, not speculative)?
  4. The Rawlsian objection. How does the amendment survive the charge that constitutionalizing thick values is illiberal in a pluralist society — by constitutionalizing the form of fidelity rather than its content, or another route?
28Constitutions with substantive value contentNational / Thick Commitments

Scenario. A small democracy in the middle of a constitutional convention has spent three months debating the usual list — executive powers, judicial review, bill of rights. A working group led by a legal scholar, Amara, has proposed a chapter few other constitutions carry: a statement of what the country's major public institutions are for, concrete enough that ordinary people could invoke it if an institution went adrift, and revisable every generation so it doesn't freeze in one era's language. The critics worry it will become an impossible standard. Amara's working group is about to present their draft to the full convention, and needs to show it can bind without petrifying.

Challenge: Design a constitutional chapter that spells out institutional values concretely enough that institutions below the constitutional level can be held to them, and produce both the draft text and the mechanism by which it stays revisable without ordinary amendment.

Evaluation. A strong design binds without petrifying — specific enough that the values don't equivocate, open enough that it doesn't freeze one generation's worldview, and legible across worldviews in a pluralist population.

Design Choices
  1. Level of specification. How much detail is constitutionally stable, given that too vague lets the values mean almost anything and too detailed freezes the constitution into one generation's worldview?
  2. Revision without amendment. How are values revised or deepened without full constitutional amendment — can practices analogous to judicial interpretation of existing rights do this work for new substantive values?
  3. Protected zones. How are protected zones specified at constitutional level — the commitments to leave certain matters deliberately unarticulated?
  4. Pluralist agreement. How do pluralist populations agree on substantive value articulations at all, pursuing broad legibility across worldviews without shared substantive conceptions?

Notes for Facilitators