qwq:32b-preview-fp16 - Ollama 框架

qwq

QwQ 是一個專注於提升人工智慧推理能力的實驗性研究模型。

工具 32b

153.9K 下載次數更新於 2 個月前

更新於 2 個月前

2 個月前

44d5ed096b85 · 66GB

{ "stop": [ "<|im_start|>", "<|im_end|>" ] }

You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-b

{{- if or .System .Tools }}<|im_start|>system {{- if .System }} {{ .System }} {{- end }} {{- if .Too

Apache License Version 2.0, January 2004

說明文件

QwQ 是由 Qwen 團隊開發的 32B 參數實驗性研究模型，專注於提升人工智慧推理能力。

QwQ 在這些基準測試中展現了卓越的性能

在 GPQA 上達到 65.2%，展現其研究生級別的科學推理能力
在 AIME 上達到 50.0%，突顯其強大的數學問題解決能力
在 MATH-500 上達到 90.6%，展現其在不同主題中卓越的數學理解能力
在 LiveCodeBench 上達到 50.0%，驗證了其在真實場景中穩健的程式設計能力。

這些結果突顯了 QwQ 在分析和問題解決能力方面的顯著進展，尤其是在需要深度推理的技術領域。

作為預覽版本，它展現了有前景的分析能力，但同時也存在一些重要的限制

語言混合和程式碼切換： 模型可能會混合語言或意外地在語言之間切換，影響回應的清晰度。
遞迴推理迴圈： 模型可能會進入循環推理模式，導致冗長的回應而沒有結論性的答案。
安全性和倫理考量： 模型需要加強安全措施以確保可靠和安全的性能，使用者在部署時應謹慎。
性能和基準測試限制： 模型在數學和編碼方面表現出色，但在其他領域，例如常識推理和細緻的語言理解方面，仍有改進空間。

QwQ is a 32B parameter experimental research model developed by the Qwen Team, focused on advancing AI reasoning capabilities.

![image.png](/assets/mchiang0610/mikey3.1/e6d2ac3a-0d55-4e8b-9f53-ee5e269ed521)

![image.png](/assets/mchiang0610/mikey3.1/b56aaf87-c5bf-4249-be99-28930845e48e)

QwQ demonstrates remarkable performance across these benchmarks:

- **65.2% on GPQA**, showcasing its graduate-level scientific reasoning capabilities 
- **50.0% on AIME**, highlighting its strong mathematical problem-solving skills
- **90.6% on MATH-500**, demonstrating exceptional mathematical comprehension across diverse topics
- **50.0% on LiveCodeBench**, validating its robust programming abilities in real-world scenarios.

These results underscore QwQ’s significant advancement in analytical and problem-solving capabilities, particularly in technical domains requiring deep reasoning.

As a preview release, it demonstrates promising analytical abilities while having several important limitations:

1. **Language Mixing and Code-Switching:** The model may mix languages or switch between them unexpectedly, affecting response clarity.

2. **Recursive Reasoning Loops:** The model may enter circular reasoning patterns, leading to lengthy responses without a conclusive answer.

3. **Safety and Ethical Considerations:** The model requires enhanced safety measures to ensure reliable and secure performance, and users should exercise caution when deploying it.

4. **Performance and Benchmark Limitations:** The model excels in math and coding but has room for improvement in other areas, such as common sense reasoning and nuanced language understanding.

貼上、拖曳或點擊上傳圖片 (.png, .jpeg, .jpg, .svg, .gif)