Training and deploying visual agents at scale