Running real inference at scale? Apply for our limited $10K credit program — Find out more
cURL
curl --request POST \ --url http://localhost:8000/v1/tokenize \ --header 'Content-Type: application/json' \ --data '{ "prompt": "What is generative AI?" }'
{ "tokens": [ 128000, 3923, 374, 1803, 1413, 15592, 30 ] }
By giving a text input, generate a tokenized output of token IDs.
Input text prompt to tokenize.
"What is generative AI?"
Routes the request to a specific adapter.
"(adapter-route)"
Successfully tokenized the text.
A list of token IDs.
Was this page helpful?