⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
534,361 results found
Trending
Model Name
Input
Output
Type
dementor-research
sft_oasst1_qwen3-4b_as_gpt-oss-20b_seed1
Adapter
Deploy
usermma
ShellWhisperer-1.5B-mlx-fp16
Fine-tuned
sft_oasst1_qwen3-4b_as_nemotron-nano-30b-a3b_seed1
ShellWhisperer-1.5B-mlx-2Bit
Quantized
ShellWhisperer-1.5B-mlx-4Bit
pro-bunny
DeepSeek-R1-Distill-Llama-8B-openvino
nakue
SmolLM2-1.7B-W8A8-instruct
ShellWhisperer-1.5B-mlx-8Bit
Nemotron-Terminal-8B-openvino
ShellWhisperer-1.5B-mlx-6Bit
ShellWhisperer-1.5B-mlx-5Bit
ShellWhisperer-1.5B-mlx-3Bit
DeepSeek-R1-Distill-Llama-8B-openvino-4bit
jkim96
Llama-3.3-70B-Instruct-DASHQ-INT2-g32
Llama-3.1-70B-Instruct-DASHQ-INT2-g32
attashe
Bernini-MLLM-Qwen2.5-VL-7B
fpadovani
dan-latn-100mb-100mb_seed3407
Baguettotron-mlx-fp16
Baguettotron-mlx-8Bit
Baguettotron-mlx-5Bit
Baguettotron-mlx-3Bit
Baguettotron-mlx-6Bit
Baguettotron-mlx-4Bit
cjiao
goldengoose-divsweep_goose_n128_grouporc_tau0.10-25grp
Baguettotron-mlx-2Bit
paumkim
zomi-qlora-v1
Base
elfein
gemma-3-1b-pt-MED_CPT-Instruct
Aziz2010
qwen2-5-1-5b-alpaca-indonesian
gradients-io-tournaments
tournament-tourn_d1afc9c2c6aec932_20260615-0b5da922-4435-4ddc-9e64-42dbe9869554-5DS6XMVr
EvoQuality-mlx-3Bit
build-small-hackathon
robe-iniesta-lora
kabesaml
OctoLong
OctoLong-0.6B-Instruct
OctoLong-8B-Instruct
OctoLong-1.7B-Instruct
OctoLong-4B-Instruct
OctoLong-4B-Base-Merged
vintage-LLM-340m-v1-base-mlx-bf16
OctoLong-0.6B-Base-Merged
OctoLong-1.7B-Base-Merged
vintage-LLM-340m-v1-base-mlx-fp32
OctoLong-14B-Base-Merged