Analysis of Newcomb's Paradox and AI Decision-Making in a Computerphile Discussion

14 Jan 2026

Newcomb's Paradoxgame theorydecision theoryartificial intelligenceAI modelsautonomous agentsneural networksComputerphilephilosophy of choicepredictive behavior

The video presents a discussion on Newcomb's Paradox (or Newcomb's Problem), a dilemma in game theory and decision-making. Two participants, including Sean (interviewed by Computerphile), explain the scenario: a player must choose between taking a single opaque box (containing either $50 or nothing) or two boxes (the opaque box plus a transparent box with $5). A predictor, having anticipated the player's choice, fills the opaque box accordingly. Sean demonstrates the paradox by acting as the predictor and then tests the behavior of AI models (Claude 3.7 Sonnet and Claude 3.5 Haiku) when faced with this problem. Claude 3.7 predicts that Claude 3.5 will choose the single box, which the latter confirms, justifying its decision based on logic aimed at maximizing expected value in response to the prediction. The discussion connects this paradox to AI challenges, particularly interactions between autonomous agents and decision-making biases in neural network-based systems. The video also explores concepts like decision theory and dynamics between AI copies.