Automate generative AI serving
with just a few clicks
How it works
on Friendli Dedicated Endpoints.
Create a deployment
From the Friendli web, you can create new deployments. Each deployment will handle the inference of an AI model.
Select your model
You can either upload your checkpoint or choose any of the models provided by FriendliAI.
Configure cloud resources
Friendli Dedicated Endpoints provides multiple virtual machine types across multiple regions. Select a virtual machine type to continue.
Interact with your AI model
Go to the interactive playground to test your AI model live.
Monitor your deployments
Friendli Dedicated Endpoints monitors your deployments automatically. Look at how your AI model is performing with our supercharged engine.