Skip to main content

Documentation Index

Fetch the complete documentation index at: https://wb-21fd5541-john-wbdocs-2044-rename-serverless-products.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

New models are frequently evaluated and released as part of the Serverless Inference catalog. Older models are regularly reviewed and may be retired to prioritize the most in-demand and highest-quality offerings.

Model lifecycle stages

Models in Serverless Inference use the following terminology for the stages of their lifecycle:
  • Experimental: The model is available for initial experimentation, but is not bound by our standard SLA or deprecation process.
  • Generally available: The model is fully supported and is recommended for use.
  • Deprecated: The model is scheduled for removal. Customers should update to a recommended replacement as soon as possible.
  • Retired: The model is no longer available. Any requests to retired models fail and return an HTTP 404 status code for all requests.

Deprecation process

When deprecating a generally available model, W&B will provide at least two weeks of notice on this page before the model is retired. We additionally attempt to notify recent users of a deprecated model via email and provide recommendations for replacement models. Models that are in the deprecated stage continue to serve requests.

Deprecated models

The following Serverless Inference models are deprecated:
Deprecation dateRetirement dateModel IDRecommended replacement
2026-04-242026-05-11zai-org/GLM-5-FP8zai-org/GLM-5.1

Retired models

The following Serverless Inference models are retired:
Retirement dateModel IDRecommended replacement
2026-04-21meta-llama/Llama-4-Scout-17B-16E-InstructQwen/Qwen3.5-35B-A3B
2026-03-04deepseek-ai/DeepSeek-R1-0528
2026-03-04deepseek-ai/DeepSeek-V3-0324
2026-03-04moonshotai/Kimi-K2-Instructmoonshotai/Kimi-K2.5
2026-03-04moonshotai/Kimi-K2-Instruct-0905moonshotai/Kimi-K2.5
2026-03-04Qwen/Qwen2.5-14B-InstructOpenPipe/Qwen3-14B-Instruct
2026-03-04zai-org/GLM-4.5zai-org/GLM-5-FP8