• Pricing
  • Blog
Log inTalk to an expertGet started

Running real inference at scale? Apply for our limited $10K credit program — Find out more

HIGHLIGHTS
  • Jan 22, 2025
  • 3 min read

Deploy Models from Hugging Face to Friendli Endpoints

Read full article
Deploy Models from Hugging Face to Friendli Endpoints thumbnail

Introducing N-gram Speculative Decoding: Faster Inference for Structured Tasks thumbnail
  • August 8, 2025
  • 2 min read

Introducing N-gram Speculative Decoding: Faster Inference for Structured Tasks

Speculative Decoding
Inference
Dedicated Endpoints
WBA: The Community-Driven Platform for Blind Testing the World’s Best AI Models thumbnail
  • August 6, 2025
  • 2 min read

WBA: The Community-Driven Platform for Blind Testing the World’s Best AI Models

WBA
AI Comparison
Model Evaluation
Announcing Online Quantization: Faster, Cheaper Inference with Same Accuracy thumbnail
  • July 25, 2025
  • 2 min read

Announcing Online Quantization: Faster, Cheaper Inference with Same Accuracy

Quantization
Inference
Dedicated Endpoints
LG AI Research Partners with FriendliAI to Launch EXAONE 4.0 for Fast, Scalable API thumbnail
  • July 15, 2025
  • 2 min read

LG AI Research Partners with FriendliAI to Launch EXAONE 4.0 for Fast, Scalable API

LG AI Research
EXAONE 4.0
Partnership
The Essential Checklist: Fix 6 Common Errors When Sharing Models on Hugging Face thumbnail
  • July 1, 2025
  • 5 min read

The Essential Checklist: Fix 6 Common Errors When Sharing Models on Hugging Face

Hugging Face
Models
One Click from W&B to FriendliAI: Deploy Models as Live Endpoints thumbnail
  • June 5, 2025
  • 3 min read

One Click from W&B to FriendliAI: Deploy Models as Live Endpoints

Weights & Biases
W&B
AI DevOps
Cut Latency for Image & Video AI Models : A guide to Multimodal Caching thumbnail
  • May 15, 2025
  • 4 min read

Cut Latency for Image & Video AI Models : A guide to Multimodal Caching

Multimodal
Inference
Optimization
Explore 370K+ AI Models on FriendliAI's Models Page thumbnail
  • May 14, 2025
  • 3 min read

Explore 370K+ AI Models on FriendliAI's Models Page

Multimodal Models
Model Deployment
Hugging Face Integration
How to Use Hugging Face Multi-LoRA Adapters thumbnail
  • May 2, 2025
  • 2 min read

How to Use Hugging Face Multi-LoRA Adapters

Multi-LoRa
LoRA Adapter
Hugging Face LoRA

Products

Friendli Dedicated EndpointsFriendli Serverless EndpointsFriendli Container

Solutions

Inference
Pricing
Blog

Resources

ModelsDocsUse Cases

Company

AboutNewsCareersResearchPatentsBrand ResourcesContact

Contact us:

contact@friendli.ai

FriendliAI Corp:

3 E 3rd Ave #302,
San Mateo, CA 94401

Hub:

5F AMC Tower, 222 Bongeunsa-ro,
Gangnam-gu, Seoul, 06135, Korea

Privacy Policy

Service Level Agreement

Terms of Service

CA Notice

Copyright © 2025 FriendliAI Inc. All rights reserved