reflection:70b-q2_K - Ollama 框架

Reflection

一個高效能模型，使用名為 Reflection-tuning 的新技術訓練，該技術教導 LLM 偵測其推理中的錯誤並修正方向。

70b

103.4K 下載次數更新於 6 個月前

更新於 6 個月前

6 個月前

8fe3c853372c · 26GB

parameters70.6B

quantizationQ2_K

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>"

{{- range $i, $_ := .Messages }}<|start_header_id|>{{ .Role }}<|end_header_id|> {{ .Content }} {{- i

You are a world-class AI system, capable of complex reasoning and reflection. Reason through the que

LLAMA 3.1 COMMUNITY LICENSE AGREEMENT Llama 3.1 Version Release Date: July 23, 2024 “Agreement”

讀我檔案

在取樣期間，模型將首先在 <thinking> 和 </thinking> 標籤內輸出推理，然後一旦對其推理感到滿意，它將在 <output> 和 </output> 標籤內輸出最終答案。這些標籤中的每一個都是特殊 token，經過模型訓練。

這使模型能夠將其內部的想法和推理與最終答案分開，從而改善使用者體驗。

在 <thinking> 部分內，模型可能會輸出一個或多個 <reflection> 標籤，這表示模型已在其推理中發現錯誤，並將嘗試在提供最終答案之前對其進行修正。

參考文獻