Overseas access: www.kdjingpai.com
Bookmark Us
Current Position:fig. beginning " AI Answers

How to use the Inference Budget Control feature in Seed-OSS?

2025-08-23 426
Link directMobile View
qrcode

Seed-OSS provides a flexible inference budget control function, which allows users to dynamically adjust the inference length through the thinking_budget parameter to balance the inference speed and depth. The specific method of using this function is as follows:

  • simple task(as translated): set thinking_budget=128.
  • Medium complexity tasks(e.g. regular Q&A): suggests thinking_budget=512.
  • complex mission(e.g., mathematical reasoning or code generation): set thinking_budget=1024.

This parameter can be set directly in the generating script, e.g. in Python code:

tokenized_chat = tokenizer.apply_chat_template(
messages,
tokenize=True,
add_generation_prompt=True,
return_tensors="pt",
thinking_budget=1024
)

By adjusting this parameter, the user can optimize the model's reasoning efficiency and effectiveness according to the actual task requirements.

Recommended

Can't find AI tools? Try here!

Just type in the keyword Accessibility Bing SearchYou can quickly find all the AI tools on this site.

Top