Polymath

Back

Visit Polymath

Sector

Infrastructure

Investment Stage

Seed Funding

Year Founded

2025

Polymath is an applied research lab building simulated environments to train and evaluate long-horizon AI agents. The lab develops benchmarks and training infrastructure aimed at increasing agent reliability and autonomy across extended task sequences with minimal human supervision. Polymath launched Horizon-SWE, a benchmark that places frontier AI models into a simulated software development environment with real tools and end-to-end software development lifecycle tasks spanning planning, coding, testing, deployment, and monitoring.