Sector
Infrastructure
Investment Stage
Seed Funding
Year Founded
2025

Polymath is an applied research lab building simulated environments to train and evaluate long-horizon AI agents. The lab develops benchmarks and training infrastructure aimed at increasing agent reliability and autonomy across extended task sequences with minimal human supervision. Polymath launched Horizon-SWE, a benchmark that places frontier AI models into a simulated software development environment with real tools and end-to-end software development lifecycle tasks spanning planning, coding, testing, deployment, and monitoring.