Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: mit

Model Overview

FieldValue
Model nameMacaron-V1-Preview-744B-Merged
OrganizationMindLab Research
Base modelGLM-5.1
ArchitectureMerged full checkpoint
Parameter footprint744B-class
Post-training systemMinT
Primary domainPersonal agents, tool-use agents, Generative UI
Release typePreview
Checkpoint formatFull checkpoint at repository root
Context length202,752 tokens, from config.json / tokenizer_config.json
Precisionbfloat16, from config.json
LicenseMIT

Loading

Install minimal loading dependencies:

bash

pip install -U transformers accelerate safetensors

Example:

python

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
repo_id = "mindlab-research/Macaron-V1-Preview-744B-Merged"
tokenizer = AutoTokenizer.from_pretrained(
repo_id,
trust_remote_code=True,
)
model = AutoModelForCausalLM.from_pretrained(
repo_id,
torch_dtype=torch.bfloat16,
device_map="auto",
trust_remote_code=True,
)
model.eval()

Evaluation

For benchmark context and full Macaron-V1-Preview release notes, see the release blog and the routed 749B model card. This merged checkpoint is not equivalent to the full routed Mixture-of-LoRA serving stack, so behavior may differ from the 749B release.

License

Macaron-V1-Preview-744B-Merged is released under the MIT License. Users should also respect any requirements inherited from GLM-5.1 and from dependencies used in deployment.

Citation

bibtex

@misc{mindlab2026macaronv1preview,
author = {{Mind Lab}},
title = {Macaron-V1-Preview: 749B MoL Agent Model post-trained from GLM5.1},
year = {2026},
howpublished = {Mind Lab: A Lab for Experiential Intelligence},
note = {https://macaron.im/mindlab/research/macaron-v1-preview}
}

Contact

Model provider

mindlab-research

Model tree

Base

zai-org/GLM-5.1

Fine-tuned

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today