Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Run this model inference with full control and performance in your environment.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: mitModel Overview
| Field | Value |
|---|---|
| Model name | Macaron-V1-Preview-744B-Merged |
| Organization | MindLab Research |
| Base model | GLM-5.1 |
| Architecture | Merged full checkpoint |
| Parameter footprint | 744B-class |
| Post-training system | MinT |
| Primary domain | Personal agents, tool-use agents, Generative UI |
| Release type | Preview |
| Checkpoint format | Full checkpoint at repository root |
| Context length | 202,752 tokens, from config.json / tokenizer_config.json |
| Precision | bfloat16, from config.json |
| License | MIT |
Loading
Install minimal loading dependencies:
bash
pip install -U transformers accelerate safetensors
Example:
python
import torchfrom transformers import AutoModelForCausalLM, AutoTokenizerrepo_id = "mindlab-research/Macaron-V1-Preview-744B-Merged"tokenizer = AutoTokenizer.from_pretrained(repo_id,trust_remote_code=True,)model = AutoModelForCausalLM.from_pretrained(repo_id,torch_dtype=torch.bfloat16,device_map="auto",trust_remote_code=True,)model.eval()
Evaluation
For benchmark context and full Macaron-V1-Preview release notes, see the release blog and the routed 749B model card. This merged checkpoint is not equivalent to the full routed Mixture-of-LoRA serving stack, so behavior may differ from the 749B release.
License
Macaron-V1-Preview-744B-Merged is released under the MIT License. Users should also respect any requirements inherited from GLM-5.1 and from dependencies used in deployment.
Citation
bibtex
@misc{mindlab2026macaronv1preview,author = {{Mind Lab}},title = {Macaron-V1-Preview: 749B MoL Agent Model post-trained from GLM5.1},year = {2026},howpublished = {Mind Lab: A Lab for Experiential Intelligence},note = {https://macaron.im/mindlab/research/macaron-v1-preview}}
Contact
- Organization: MindLab Research
- Project: Macaron
- Release blog: macaron.im/mindlab/research/macaron-v1-preview
Model provider
mindlab-research
Model tree
Base
zai-org/GLM-5.1
Fine-tuned
this model
Modalities
Input
Text
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information