Thousands tune into a livestream of Claude 3.7 Sonnet navigating the game on its own better than its predecessor. Anthropic tells PCMag it's a more effective way of measuring AI progress.