Digital Twins as Agent Sandboxes: Testing AI Interventions Before Deployment