⚡ Hit your SLA, cut costs. Download the Friendli Guide to Inference Performance Optimization ➜
Product
Solutions
Developers
Company
Featured models
All models
571,254 results found
Trending
Model Name
Input
Output
Type
FrenzyMath
Herald_translator
Base
Deploy
GuilhermeNaturaUmana
Nature-Reason-1
Fine-tuned
neuralmagic
DeepSeek-R1-Distill-Qwen-14B-quantized.w4a16
Quantized
DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8
DeepSeek-R1-Distill-Llama-8B-FP8-dynamic
Spestly
Atlas-Pro-1.5B-Preview
UtkarshRishi
ArcMind
Alepach
notHumpback-M1
Na0s
Llama-3.2-3B-Instruct-Medical-Chatbot-LoRA-FT
Llama-3.2-3B-Medical-Chatbot-LoRA-FT
sapienzanlp
Minerva-7B-instruct-v1.0
ixxan
whisper-small-uyghur-common-voice
whisper-small-common-voice-ug
infly
OpenCoder-8B-Base
aisingapore
gemma2-9b-cpt-sea-lionv3-base
lovis93
testllm
ProdeusUnity
Celestial-Harmony-14b-v1.0-Experimental-1015
Merged
avankumar
llama_NER_battery_KG
huihui-ai
Qwen2.5-Coder-7B-Instruct-abliterated
rishabbahal
whisper-small-nigerian-accent
wassname
llama-3-2-1b-sft
mlx-community
Llama-3.2-3B-Instruct-4bit
taoki
Qwen2.5-Coder-7B-Instruct_lora_jmultiwoz-dolly-amenokaku-alpaca_jp_python
mamba-370m-hf-f16
mamba-130m-hf-f32
meta-llama
Llama-Guard-3-11B-Vision
SHASWATSINGH3101
Qwen2-0.5B-Instruct_lora_code
Henrychur
MMedS-Llama-3-8B
CameronRedmore
mistral-nemo-gutenberg-12B-v4-exl2
Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-DPO
Llama-3.1-8B-Pruned-4-Layers_LoRA-PEFT-3.0
SeaLLMs
SeaLLMs-v3-7B
Meta-Llama-3.1-70B-Instruct-quantized.w8a16
Meta-Llama-3.1-8B-Instruct-quantized.w8a16
h2oai
h2o-danube3-4b-base
wanghaikuan
Qwen1.5-0.5B_merge_v2.2
01-ai
Yi-1.5-6B-Chat
Gryphe
Pantheon-RP-1.0-8b-Llama-3
aeonium
Aeonium-v0-Base-1B
nvidia
Llama3-ChatQA-1.5-8B
unsloth
llama-3-8b
Samsung
BigTranslateSlotTranslator