Inference with gRPC
Run gRPC inference server with Friendli Container and interact with it through friendli SDK.
This guide will walk you through how to run gRPC inference server with Friendli Container and interact with it through friendli
SDK.
Prerequisites
Install friendli
to use gRPC client SDK:
Ensure you have the friendli
SDK version 1.4.1
or higher installed.
Starting the Friendli Container with gRPC
Running the Friendli Container with a gRPC server for completions is available by adding the --grpc true
option to the command argument.
This supports response-streaming gRPC, and you can send requests using our friendli
SDK.
To start the Friendli Container with gRPC support, use the following command:
You can change the port of the server with --web-server-port
argument.
Sending Requests with the Client SDK
Here is how to use the friendli
SDK to interact with the gRPC server.
This example assumes that the gRPC server is running on 0.0.0.0:8000
.
Properly Closing the Client
By default, the library closes underlying HTTP and gRPC connections when the client
is garbage-collected.
You can manually close the Friendli
or AsyncFriendli
client using the .close()
method or utilize a context manager to ensure proper closure when exiting a with
block.