Skip to content

Configuration

The provided presets should work well most of the time. However, every model is different, and the settings that work well for one model may not produce good results with another.

“Reasoning Effort” is used to indicate how much “effort” the model allocates to thinking. The higher the value, the more tokens the model will consume as it thinks about how to respond.

For now, reasoning is only supported in Chat.

Reasoning can improve dramatically improve response quality, especially for complex requests. Higher reasoning effort improves the model’s ability to devise interesting plot twists or incorporate information from the Lorebook.

We recommend leaving reasoning enabled in chat. We find the default setting of “Medium” works well with both DeepSeek and Claude Sonnet.

Reasoning tokens count towards the response limit. With a short response length, the model may consume most or all of its token budget on reasoning. For this reason, we recommend leaving the token limit disabled in chat.