0xSero

GLM-4.7-Flash

Deploy Dedicated

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

At a glance

Table

Base model	zai-org/GLM-4.7-Flash
Format	BF16
Total params	30B
Active / token	—
Experts / layer	64
Layers	47
Hidden size	2048
Context	202,752
On-disk size	120 GB

Which variant should I pick?

Table with columns: Variant, Format, Link
Variant	Format	Link
`GLM-4.7-Flash` (this)	BF16	link
`GLM-4.7-Flash-DPO`	DPO	link
`GLM-4.7-Flash-SFT`	SFT	link

License & citation

License inherited from the base model.

bibtex
@misc{lasby2025reap,
  title  = {REAP the Experts: Why Pruning Prevails for One-Shot MoE Compression},
  author = {Mike Lasby and Ivan Lazarevich and Nish Sinnadurai and Sean Lie and Yani Ioannou and Vithursan Thangarasa},
  year   = {2025}, eprint = {2510.13999}, archivePrefix = {arXiv}
}

Explore FriendliAI today

Get started Talk to an engineer

At a glance

Table

Base model	zai-org/GLM-4.7-Flash
Format	BF16
Total params	30B
Active / token	—
Experts / layer	64
Layers	47
Hidden size	2048
Context	202,752
On-disk size	120 GB

Which variant should I pick?

Table with columns: Variant, Format, Link
Variant	Format	Link
`GLM-4.7-Flash` (this)	BF16	link
`GLM-4.7-Flash-DPO`	DPO	link
`GLM-4.7-Flash-SFT`	SFT	link

License & citation

License inherited from the base model.

bibtex
@misc{lasby2025reap,
  title  = {REAP the Experts: Why Pruning Prevails for One-Shot MoE Compression},
  author = {Mike Lasby and Ivan Lazarevich and Nish Sinnadurai and Sean Lie and Yani Ioannou and Vithursan Thangarasa},
  year   = {2025}, eprint = {2510.13999}, archivePrefix = {arXiv}
}

GLM-4.7-Flash

Get help setting up a custom Dedicated Endpoints.

README

At a glance

Which variant should I pick?

License & citation

Sponsors

Explore FriendliAI today

README

At a glance

Which variant should I pick?

License & citation

Sponsors