The Rakhine Reconstruction Plan as a Payoff-Transformation Model

An interactive game-theory instrument. It does not assume trust — it shows the conditions under which cooperation becomes the rational choice for actors locked in a decades-long bad equilibrium.

Conflict & Resilience Research Institute Canada (CRRIC) · Strategic Incentive Framework · 2026

The conflict persists not because peace is unknown to be better, but because mutual defection is a Nash equilibrium: each actor defects in fear that unilateral cooperation will be exploited. The RRP is a mechanism-design proposal. It introduces a conditional peace dividend, third-party monitoring, and credible penalties that re-shape the payoffs until cooperation dominates — or, more realistically, until cooperation becomes a stable equilibrium that rational actors can coordinate on.

The Instrument

Move the levers. Watch the equilibrium change.

Each actor chooses to cooperate (comply with a monitored, phased RRP) or defect (continue the armed contest). The three RRP levers below transform the payoffs. The matrix, the equilibrium verdict, and the threshold conditions all update live.

Each cell shows (row payoff · col payoff). Gold border = Nash equilibrium. ▸ / ▾ mark each player's best response.

Repeated-game test · Grim-trigger sustainability

Minimum patience δ* =

RRP Levers

b · Reconstruction dividend0.0

Funds, infrastructure, livelihoods, recognition added to cooperation.

p · Defection penalty0.0

Snapback sanctions, fund suspension, exclusion, public attribution.

m · Monitoring & escrow0.00

Third-party verification + escrowed funds cushion the cost of being the lone cooperator.

b + p ≥ T−R

Cooperation beats exploitation when the other cooperates → makes peace an equilibrium

m·c + p ≥ P−S

Cooperation isn't ruinous even if betrayed → makes peace dominant

All Ten Actors — Composite Ledger

A single "RRP intensity" lever raises the cooperation payoff and the cost of holding out across every stakeholder. Watch how many actors flip.

RRP intensity (conditional dividend + penalty + monitoring, combined) 0%

Cooperation Index: 0 / 10 actors for whom cooperation now outranks defection

Actor	Payoff from defection (status quo)	Payoff from cooperation under RRP	Verdict

Reading the regimes

Three destinations, one dial

Prisoner's Dilemma · the war we have

With no dividend, no penalty, no monitoring, defection is a dominant strategy for every actor. Mutual defection is the only equilibrium even though everyone prefers peace. Set all levers to zero to see it.

Stag Hunt · peace becomes possible

Cross the first threshold and cooperation becomes an equilibrium — but so is war. Peace is now achievable but fragile: it requires mutual assurance. This is the honest, realistic target for most actors. Monitoring and repetition do the rest.

Cooperation-dominant · peace is the rational default

Cross both thresholds and cooperation dominates — the lone equilibrium is mutual cooperation. This requires strong guarantees and is realistic only for a subset of actors, but it is the design ceiling the RRP aims at.

Why the Marshall Plan logic holds

Reconstruction for a former adversary is not charity. It is rational when the cost of non-reconstruction — displacement, instability, sanctions, militarization — exceeds the cost of rebuilding. The levers above are simply the formal version of that argument.

Transparency layer · please read

The payoff numbers in this instrument are illustrative ordinal rankings, not empirical measurements. They encode the structure of incentives described in the concept note — the relative ordering of outcomes and the direction in which each RRP lever pushes them — so that the logic of the equilibrium shift can be inspected and contested. They are deliberately symmetric within each dyad to keep the threshold mathematics legible; real actors face asymmetric payoffs, internal principal–agent splits, and incomplete information, all discussed in the accompanying paper.

The instrument's claim is therefore conditional, not predictive: without a reconstruction-linked payoff transformation, Myanmar's actors retain decisive incentives to keep defecting. The RRP gives them a structured reason to test cooperation without requiring blind trust.