Skip to content

Hugging Face (inference)

Hugging Face Inference Providers 提供透過單一路由器 API 相容 OpenAI 的聊天完成功能。您可以使用一個 Token 存取多種模型(DeepSeek、Llama 等)。OpenClaw 使用 OpenAI 相容端點(僅限聊天完成);若要進行文字轉圖片、嵌入或語音功能,請直接使用 HF inference clients

  • 提供者:huggingface
  • 驗證:HUGGINGFACE_HUB_TOKENHF_TOKEN(具備 Make calls to Inference Providers 權限的細粒度 Token)
  • API:OpenAI 相容 (https://router.huggingface.co/v1)
  • 計費:單一 HF Token;價格 遵循提供者費率,並提供免費層級。
  1. 建立細粒度 Token

    前往 Hugging Face Settings Tokens 並建立一個新的細粒度 Token。

  2. 執行引導設定

    在提供者下拉選單中選擇 Hugging Face,然後在提示時輸入您的 API 金鑰:

    Terminal window
    openclaw onboard --auth-choice huggingface-api-key
  3. 選擇預設模型

    Default Hugging Face model 下拉選單中,選取您想要的模型。當您擁有有效的 Token 時,清單會從 Inference API 載入;否則會顯示內建清單。您的選擇會被儲存為預設模型。

    您也可以稍後在設定中設定或變更預設模型:

    {
    agents: {
    defaults: {
    model: { primary: "huggingface/deepseek-ai/DeepSeek-R1" },
    },
    },
    }
  4. 驗證模型是否可用

    Terminal window
    openclaw models list --provider huggingface
Terminal window
openclaw onboard --non-interactive \
--mode local \
--auth-choice huggingface-api-key \
--huggingface-api-key "$HF_TOKEN"

這會將 huggingface/deepseek-ai/DeepSeek-R1 設定為預設模型。

模型參照使用 huggingface/<org>/<model> 格式(Hub 風格 ID)。以下清單來自 GET https://router.huggingface.co/v1/models;您的目錄可能包含更多內容。

模型Ref(前綴為 huggingface/
DeepSeek R1deepseek-ai/DeepSeek-R1
DeepSeek V3.2deepseek-ai/DeepSeek-V3.2
Qwen3 8BQwen/Qwen3-8B
Qwen2.5 7B InstructQwen/Qwen2.5-7B-Instruct
Qwen3 32BQwen/Qwen3-32B
Llama 3.3 70B Instructmeta-llama/Llama-3.3-70B-Instruct
Llama 3.1 8B Instructmeta-llama/Llama-3.1-8B-Instruct
GPT-OSS 120Bopenai/gpt-oss-120b
GLM 4.7zai-org/GLM-4.7
Kimi K2.5moonshotai/Kimi-K2.5

Model discovery and onboarding dropdown

OpenClaw 透過直接呼叫 推論端點 來探索模型:

Terminal window
GET https://router.huggingface.co/v1/models

(選用:發送 Authorization: Bearer $HUGGINGFACE_HUB_TOKEN$HF_TOKEN 以取得完整清單;某些端點在未經授權的情況下會傳回子集。)回應為 OpenAI 風格的 { "object": "list", "data": [ { "id": "Qwen/Qwen3-8B", "owned_by": "Qwen", ... }, ... ] }

當您設定 Hugging Face API 金鑰(透過 onboarding、HUGGINGFACE_HUB_TOKENHF_TOKEN)時,OpenClaw 會使用此 GET 來探索可用的聊天完成模型。在 互動式設定 期間,輸入 Token 後,您會看到一個從該清單填入的 預設 Hugging Face 模型 下拉式選單(如果請求失敗,則使用內建目錄)。在執行時期(例如 Gateway 啟動),當金鑰存在時,OpenClaw 會再次呼叫 GET https://router.huggingface.co/v1/models 來重新整理目錄。此清單會與內建目錄合併(用於上下文視窗和成本等元資料)。如果請求失敗或未設定金鑰,則僅使用內建目錄。

Model names, aliases, and policy suffixes
  • Name from API: The model display name is hydrated from GET /v1/models when the API returns name, title, or display_name; otherwise it is derived from the model id (e.g. deepseek-ai/DeepSeek-R1 becomes “DeepSeek R1”).
  • Override display name: You can set a custom label per model in config so it appears the way you want in the CLI and UI:
{
agents: {
defaults: {
models: {
"huggingface/deepseek-ai/DeepSeek-R1": { alias: "DeepSeek R1 (fast)" },
"huggingface/deepseek-ai/DeepSeek-R1:cheapest": { alias: "DeepSeek R1 (cheap)" },
},
},
},
}
  • Policy suffixes: OpenClaw’s bundled Hugging Face docs and helpers currently treat these two suffixes as the built-in policy variants:

    • :fastest — highest throughput.
    • :cheapest — lowest cost per output token.

    You can add these as separate entries in models.providers.huggingface.models or set model.primary with the suffix. You can also set your default provider order in Inference Provider settings (no suffix = use that order).

  • Config merge: Existing entries in models.providers.huggingface.models (e.g. in models.json) are kept when config is merged. So any custom name, alias, or model options you set there are preserved.

Environment and daemon setup

If the Gateway runs as a daemon (launchd/systemd), make sure HUGGINGFACE_HUB_TOKEN or HF_TOKEN is available to that process (for example, in ~/.openclaw/.env or via env.shellEnv).

Config: DeepSeek R1 with Qwen fallback
{
agents: {
defaults: {
model: {
primary: "huggingface/deepseek-ai/DeepSeek-R1",
fallbacks: ["huggingface/Qwen/Qwen3-8B"],
},
models: {
"huggingface/deepseek-ai/DeepSeek-R1": { alias: "DeepSeek R1" },
"huggingface/Qwen/Qwen3-8B": { alias: "Qwen3 8B" },
},
},
},
}
Config: 使用最便宜且最快的 Qwen 變體
{
agents: {
defaults: {
model: { primary: "huggingface/Qwen/Qwen3-8B" },
models: {
"huggingface/Qwen/Qwen3-8B": { alias: "Qwen3 8B" },
"huggingface/Qwen/Qwen3-8B:cheapest": { alias: "Qwen3 8B (cheapest)" },
"huggingface/Qwen/Qwen3-8B:fastest": { alias: "Qwen3 8B (fastest)" },
},
},
},
}
Config: 使用別名設定的 DeepSeek + Llama + GPT-OSS
{
agents: {
defaults: {
model: {
primary: "huggingface/deepseek-ai/DeepSeek-V3.2",
fallbacks: [
"huggingface/meta-llama/Llama-3.3-70B-Instruct",
"huggingface/openai/gpt-oss-120b",
],
},
models: {
"huggingface/deepseek-ai/DeepSeek-V3.2": { alias: "DeepSeek V3.2" },
"huggingface/meta-llama/Llama-3.3-70B-Instruct": { alias: "Llama 3.3 70B" },
"huggingface/openai/gpt-oss-120b": { alias: "GPT-OSS 120B" },
},
},
},
}
Config: 使用原則後綴的多個 Qwen 和 DeepSeek
{
agents: {
defaults: {
model: { primary: "huggingface/Qwen/Qwen2.5-7B-Instruct:cheapest" },
models: {
"huggingface/Qwen/Qwen2.5-7B-Instruct": { alias: "Qwen2.5 7B" },
"huggingface/Qwen/Qwen2.5-7B-Instruct:cheapest": { alias: "Qwen2.5 7B (cheap)" },
"huggingface/deepseek-ai/DeepSeek-R1:fastest": { alias: "DeepSeek R1 (fast)" },
"huggingface/meta-llama/Llama-3.1-8B-Instruct": { alias: "Llama 3.1 8B" },
},
},
},
}
模型選擇

所有提供者、模型參照和故障轉移行為的概覽。

模型選擇

如何選擇和設定模型。

推論提供者文件

Hugging Face 推論提供者官方文件。

設定

完整設定參考。