September, 2025

Sep 4
Serveless Endpoints

Model Deprecations

  • K-intelligence/Midm-2.0-Mini-Instruct
Sep 1
Dedicated Endpoints

B200 Hardware Support

August, 2025

Aug 22
Serveless Endpoints

New built-in integrations w/ Linkup

Aug 19
Dedicated Endpoints

New auto-scaling type Request count added

  • Enterprise plan only
Aug 8
Serveless EndpointsDedicated Endpoints

Increased output token limits for reasoning models on Serverless endpoints


New endpoint feature N-GRAM speculative decoding

For predictable tasks, this can deliver substantial performance gains.
Aug 1
Serveless Endpoints

Model releases

  • Qwen/Qwen3-235B-A22B-Thinking-2507
  • Qwen/Qwen3-235B-A22B-Instruct-2507
  • skt/A.X-4.0
  • skt/A.X-3.1
  • naver-hyperclovax/HyperCLOVAX-SEED-Think-14B

July, 2025

Jul 25
Dedicated Endpoints

New endpoint feature Online quantization

Quantize your model endpoint without any preparations and accelerate inference
Jul 14
Serveless Endpoints

Model releases

Jul 11
Serveless Endpoints

Model releases

  • deepseek-ai/DeepSeek-R1-0528