The Rakhine Reconstruction Plan as a Payoff-Transformation Model
An interactive game-theory instrument. It does not assume trust — it shows the conditions under which cooperation becomes the rational choice for actors locked in a decades-long bad equilibrium.
The conflict persists not because peace is unknown to be better, but because mutual defection is a Nash equilibrium: each actor defects in fear that unilateral cooperation will be exploited. The RRP is a mechanism-design proposal. It introduces a conditional peace dividend, third-party monitoring, and credible penalties that re-shape the payoffs until cooperation dominates — or, more realistically, until cooperation becomes a stable equilibrium that rational actors can coordinate on.
Move the levers. Watch the equilibrium change.
Each actor chooses to cooperate (comply with a monitored, phased RRP) or defect (continue the armed contest). The three RRP levers below transform the payoffs. The matrix, the equilibrium verdict, and the threshold conditions all update live.
RRP Levers
Funds, infrastructure, livelihoods, recognition added to cooperation.
Snapback sanctions, fund suspension, exclusion, public attribution.
Third-party verification + escrowed funds cushion the cost of being the lone cooperator.
| Actor | Payoff from defection (status quo) | Payoff from cooperation under RRP | Verdict |
|---|
Three destinations, one dial
Prisoner's Dilemma · the war we have
With no dividend, no penalty, no monitoring, defection is a dominant strategy for every actor. Mutual defection is the only equilibrium even though everyone prefers peace. Set all levers to zero to see it.
Stag Hunt · peace becomes possible
Cross the first threshold and cooperation becomes an equilibrium — but so is war. Peace is now achievable but fragile: it requires mutual assurance. This is the honest, realistic target for most actors. Monitoring and repetition do the rest.
Cooperation-dominant · peace is the rational default
Cross both thresholds and cooperation dominates — the lone equilibrium is mutual cooperation. This requires strong guarantees and is realistic only for a subset of actors, but it is the design ceiling the RRP aims at.
Why the Marshall Plan logic holds
Reconstruction for a former adversary is not charity. It is rational when the cost of non-reconstruction — displacement, instability, sanctions, militarization — exceeds the cost of rebuilding. The levers above are simply the formal version of that argument.
The payoff numbers in this instrument are illustrative ordinal rankings, not empirical measurements. They encode the structure of incentives described in the concept note — the relative ordering of outcomes and the direction in which each RRP lever pushes them — so that the logic of the equilibrium shift can be inspected and contested. They are deliberately symmetric within each dyad to keep the threshold mathematics legible; real actors face asymmetric payoffs, internal principal–agent splits, and incomplete information, all discussed in the accompanying paper.
The instrument's claim is therefore conditional, not predictive: without a reconstruction-linked payoff transformation, Myanmar's actors retain decisive incentives to keep defecting. The RRP gives them a structured reason to test cooperation without requiring blind trust.
