Portfolio

Polymath

Written by Cervin Ventures | Jun 1, 2026 5:17:01 PM

Polymath is an applied research lab building simulated environments to train and evaluate long-horizon AI agents. The lab develops benchmarks and training infrastructure aimed at increasing agent reliability and autonomy across extended task sequences with minimal human supervision. Polymath launched Horizon-SWE, a benchmark that places frontier AI models into a simulated software development environment with real tools and end-to-end software development lifecycle tasks spanning planning, coding, testing, deployment, and monitoring.