qwq:32b-preview-q8_0 - Ollama 框架

qwq

QwQ 是一個實驗性研究模型，專注於提升 AI 推理能力。

工具 32b

153.9K 下載次數更新於 2 個月前

更新於 2 個月前

2 個月前

9c62a2e770b7 · 35GB

{ "stop": [ "<|im_start|>", "<|im_end|>" ] }

你是一個樂於助人且無害的助手。你是由阿里巴巴開發的 Qwen。你應該逐步思考

{{- if or .System .Tools }}<|im_start|>system {{- if .System }} {{ .System }} {{- end }} {{- if .Too

Apache License Version 2.0, January 2004

Readme

QwQ 是一個 32B 參數的實驗性研究模型，由 Qwen 團隊開發，專注於提升 AI 推理能力。

QwQ 在這些基準測試中展現了卓越的效能

GPQA 上達到 65.2%，展現其研究生級別的科學推理能力
AIME 上達到 50.0%，突顯其強大的數學問題解決能力
MATH-500 上達到 90.6%，展現其在各種主題中卓越的數學理解能力
LiveCodeBench 上達到 50.0%，驗證其在真實世界情境中穩健的程式設計能力。

這些結果 подчеркивают QwQ 在分析和問題解決能力方面的顯著進步，尤其是在需要深度推理的技術領域。

作為預覽版本，它展示了有前景的分析能力，但同時存在一些重要的限制

語言混合和程式碼切換： 模型可能會混合語言或意外地在語言之間切換，影響回應的清晰度。
遞迴推理迴圈： 模型可能會進入循環推理模式，導致冗長的回應而沒有明確的答案。
安全性和倫理考量： 模型需要加強安全措施，以確保可靠和安全的效能，使用者在部署時應謹慎行事。
效能和基準測試限制： 模型在數學和程式碼方面表現出色，但在其他領域，如常識推理和細緻的語言理解方面，仍有改進空間。

QwQ is a 32B parameter experimental research model developed by the Qwen Team, focused on advancing AI reasoning capabilities.

![image.png](/assets/mchiang0610/mikey3.1/e6d2ac3a-0d55-4e8b-9f53-ee5e269ed521)

![image.png](/assets/mchiang0610/mikey3.1/b56aaf87-c5bf-4249-be99-28930845e48e)

QwQ demonstrates remarkable performance across these benchmarks:

- **65.2% on GPQA**, showcasing its graduate-level scientific reasoning capabilities 
- **50.0% on AIME**, highlighting its strong mathematical problem-solving skills
- **90.6% on MATH-500**, demonstrating exceptional mathematical comprehension across diverse topics
- **50.0% on LiveCodeBench**, validating its robust programming abilities in real-world scenarios.

These results underscore QwQ’s significant advancement in analytical and problem-solving capabilities, particularly in technical domains requiring deep reasoning.

As a preview release, it demonstrates promising analytical abilities while having several important limitations:

1. **Language Mixing and Code-Switching:** The model may mix languages or switch between them unexpectedly, affecting response clarity.

2. **Recursive Reasoning Loops:** The model may enter circular reasoning patterns, leading to lengthy responses without a conclusive answer.

3. **Safety and Ethical Considerations:** The model requires enhanced safety measures to ensure reliable and secure performance, and users should exercise caution when deploying it.

4. **Performance and Benchmark Limitations:** The model excels in math and coding but has room for improvement in other areas, such as common sense reasoning and nuanced language understanding.

貼上、拖曳或點擊以上傳圖片 (.png, .jpeg, .jpg, .svg, .gif)