SearchSwarm
SearchSwarm-30B-A3B
Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: mitOverview
SearchSwarm focuses on several key aspects of agentic intelligence:
- Subagents as Context Management: Subagents work in independent contexts and return compact, evidence-grounded reports, preventing the main agent's context window from being overwhelmed.
- Delegation Intelligence: The model is trained to determine when and what to delegate, how to brief subagents effectively, and how to verify their findings.
- Harness-guided Synthesis: The authors used a specialized harness to synthesize high-quality SFT data to internalize these delegation capabilities into the model weights.
Performance
SearchSwarm-30B-A3B achieves state-of-the-art results among comparable 30B-A3B open-source lightweight research agents on deep research benchmarks:
| Benchmark | Score |
|---|---|
| BrowseComp | 68.1 |
| BrowseComp-ZH | 73.3 |
Citation
bibtex
@article{ning2026searchswarm,title={SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research},author={Ning, Pu and Chen, Quan and Tao, Kun and Tang, Xinyu and Wang, Tianshu and Cao, Qianggang and Kong, Xinyu and Wen, Zujie and Zhang, Zhiqiang and Zhou, Jun},journal={arXiv preprint arXiv:2606.09730},year={2026}}
Acknowledgements
This project builds on open-source infrastructure including vLLM, ms-swift, Megatron-LM, Qwen-Agent, Serper, and Jina.
Model provider
SearchSwarm
Model tree
Base
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B
Fine-tuned
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information