SearchSwarm

SearchSwarm-30B-A3B

Deploy Dedicated

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: mit

Overview

SearchSwarm focuses on several key aspects of agentic intelligence:

Subagents as Context Management: Subagents work in independent contexts and return compact, evidence-grounded reports, preventing the main agent's context window from being overwhelmed.
Delegation Intelligence: The model is trained to determine when and what to delegate, how to brief subagents effectively, and how to verify their findings.
Harness-guided Synthesis: The authors used a specialized harness to synthesize high-quality SFT data to internalize these delegation capabilities into the model weights.

Performance

SearchSwarm-30B-A3B achieves state-of-the-art results among comparable 30B-A3B open-source lightweight research agents on deep research benchmarks:

Table
Benchmark	Score
BrowseComp	68.1
BrowseComp-ZH	73.3

Citation

bibtex
@article{ning2026searchswarm,
  title={SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research},
  author={Ning, Pu and Chen, Quan and Tao, Kun and Tang, Xinyu and Wang, Tianshu and Cao, Qianggang and Kong, Xinyu and Wen, Zujie and Zhang, Zhiqiang and Zhou, Jun},
  journal={arXiv preprint arXiv:2606.09730},
  year={2026}
}