⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
534,369 results found
Trending
Model Name
Input
Output
Type
usermma
EvoQuality-mlx-fp16
Fine-tuned
Deploy
fpadovani
dan-latn-10mb-ppt-Dp-10mb_seed3407
art87able
unstuck-qwen2.5-0.5b-steps
Base
dan-latn-10mb-ppt-Dp-100mb_seed3407
gradients-io-tournaments
tournament-tourn_d1afc9c2c6aec932_20260615-7d54f633-439c-444d-9284-d2c868200d58-5FpdSckw
Adapter
tournament-tourn_d1afc9c2c6aec932_20260615-0b5da922-4435-4ddc-9e64-42dbe9869554-5FUXojny
dan-latn-10mb-ppt-shuff-dyck-100mb_seed3407
isl-latn-100mb-10mb_seed3407
vintage-LLM-340m-v1-base-mlx-4Bit
Quantized
vintage-LLM-340m-v1-base-mlx-8Bit
helixdouble
glm-5.1-fp8-abliterated-research-checkpoint-v3
vintage-LLM-340m-v1-base-mlx-5Bit
vintage-LLM-340m-v1-base-mlx-3Bit
dan-latn-10mb-ppt-shuff-dyck-10mb_seed3407
vintage-LLM-340m-v1-base-mlx-fp16
vintage-LLM-340m-v1-base-mlx-6Bit
vintage-LLM-340m-v1-base-mlx-2Bit
tournament-tourn_d1afc9c2c6aec932_20260615-00555001-025f-4882-9137-c4fda38a3108-5Eh6F11Z
ishikauniphore
3bT-7bS_nemotron_stem_mcot
FastContext-1.0-4B-RL-mlx-fp16
Godwinlyamba
bkm1804
FastContext-1.0-4B-SFT-mlx-6Bit
FastContext-1.0-4B-RL-mlx-3Bit
OctoLong
Qwen3-4B-Instruct
Qwen3-8B-Instruct
Alanpool
GLM-5.1
FastContext-1.0-4B-SFT-mlx-8Bit
FastContext-1.0-4B-SFT-mlx-3Bit
FastContext-1.0-4B-RL-mlx-5Bit
FastContext-1.0-4B-RL-mlx-8Bit
FastContext-1.0-4B-SFT-mlx-2Bit
yw223
Gemma-2-9B-it-Wanda-unstructured_50
tournament-tourn_d1afc9c2c6aec932_20260615-6de6300a-976a-4097-8a69-b4b68283dd02-5HKEAZxF
FastContext-1.0-4B-RL-mlx-6Bit
FastContext-1.0-4B-SFT-mlx-5Bit
FastContext-1.0-4B-RL-mlx-2Bit
cdli
whisper-small_finetuned_ugandan_english_nonstandard_speech_v1.0
Upcycle-AI
Codeus-7B-Pre-Alpha
Merged
Qwen3-1.7B-Instruct
Qwen3-0.6B-Instruct
MiroThinker-1.7-mlx-fp16
FastContext-1.0-4B-RL-mlx-4Bit