Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Complete Ecosystem: 11 Models Across 3 Categories

📚 Language Models (5 Models)

🎨 Multimodal Models (5 Models)

🛡️ Safety & Moderation (1 Model, 2 Variants)

🚀 v1.0.1 Release Highlights

Recursive Self-Improvement (RAIS)

  • 94% effectiveness across training examples
  • Models learn from their own work sessions
  • Pattern recognition from real deployments
  • Continuous improvement through zoo-gym framework

Key Improvements

  • 🔒 Security: Fixed API token exposure, added path validation
  • 📚 Documentation: Hierarchical structure, comprehensive guides
  • 🎯 Identity: Clear branding, no base model confusion
  • 🔧 Technical: Multi-format support (MLX, GGUF, SafeTensors)
  • 🌍 Languages: Support for 119 languages (Guard models)

📊 Model Comparison

ModelParametersActiveUse CaseMemory (INT4)
Zen-Nano0.6B0.6BEdge/Mobile0.3GB
Zen-Eco4B4BDesktop/Laptop2GB
Zen-Omni30B30BServer/Cloud15GB
Zen-Coder480B30BCode Generation15GB
Zen-Next80B80BAdvanced Tasks40GB
Zen-Artist7B7BImage Generation3.5GB
Zen-Artist-Edit7B7BImage Editing3.5GB
Zen-Designer-Think235B22BVisual Reasoning11GB
Zen-Designer-Inst235B22BVision-Language11GB
Zen-Scribe2B2BSpeech-to-Text1GB
Zen-Guard-Gen8B8BSafety Generation4GB
Zen-Guard-Stream4B4BReal-time Safety2GB

🔧 Quick Start

Using Transformers

python

from transformers import AutoModelForCausalLM, AutoTokenizer
# Language models
model = AutoModelForCausalLM.from_pretrained("zenlm/zen-eco-4b-instruct")
tokenizer = AutoTokenizer.from_pretrained("zenlm/zen-eco-4b-instruct")
# Multimodal models
from transformers import AutoProcessor, AutoModelForVision2Seq
processor = AutoProcessor.from_pretrained("zenlm/zen-designer-235b-a22b-instruct")
model = AutoModelForVision2Seq.from_pretrained("zenlm/zen-designer-235b-a22b-instruct")
# Safety models
guard = AutoModelForCausalLM.from_pretrained("zenlm/zen-guard-gen-8b")

Using MLX (Apple Silicon)

python

from mlx_lm import load, generate

Using llama.cpp

bash

# Download GGUF from model page
llama-cli -m zen-eco-4b-instruct-q4_k_m.gguf -p "Your prompt here"

🏆 Benchmarks

Language Models

ModelMMLUGSM8KHumanEvalHellaSwag
Zen-Nano45.2%28.1%18.3%72.1%
Zen-Eco51.7%32.4%22.6%76.4%
Zen-Omni68.9%71.2%48.5%85.7%
Zen-Coder71.4%82.7%78.9%87.2%
Zen-Next73.8%86.3%52.1%88.9%

Multimodal Performance

  • Zen-Artist: FID score 12.4, IS 178.2
  • Zen-Designer: VQA 82.3%, TextVQA 78.9%
  • Zen-Scribe: WER 2.8% (LibriSpeech)
  • Zen-Guard: 96.4% accuracy across 119 languages

🌱 Environmental Impact

  • 95% reduction in energy vs 70B models
  • ~1kg CO₂ saved per user monthly
  • Edge deployment reduces data center load
  • Efficient quantization minimizes resource use

🤝 Partnership

Built by Hanzo AI (Techstars-backed) and Zoo Labs Foundation (501(c)(3) non-profit) for open, private, and sustainable AI.

Training Infrastructure

  • Zoo-Gym framework for advanced training
  • Recursive self-improvement system (RAIS)
  • LoRA fine-tuning support
  • Multi-format optimization

📖 Documentation

📈 Adoption

  • 1M+ downloads globally
  • 150+ countries reached
  • 10,000+ developers actively using
  • 500+ production deployments

🔜 Roadmap

  • Q1 2025: Function calling, tool use
  • Q2 2025: Extended context (128K+)
  • Q3 2025: Video understanding
  • Q4 2025: Embodied AI integration

📜 Citation

bibtex

@misc{zen_v1_0_1_2025,
title={Zen AI Model Family v1.0.1: Recursive Self-Improvement at Scale},
author={Hanzo AI and Zoo Labs Foundation},
year={2025},
version={1.0.1},
url={https://huggingface.co/collections/zenlm/zen-family}
}

📄 License

All models released under Apache 2.0 license for maximum openness.


© 2025 • Built with ❤️ by Hanzo AI & Zoo Labs Foundation

Model provider

zenlm

Model tree

Base

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today