qwen2.5:3b-instruct-q3_K_L

由於這些領域的專業模型，它擁有顯著更多的知識，並且在編碼和數學方面的能力大大增強。
它在指令遵循、長文本生成（超過 8K tokens）、理解結構化數據（例如，表格）和生成結構化輸出方面取得了顯著進展，尤其是在 JSON 格式方面。它也更能適應多樣化的系統提示，從而改善聊天機器人的角色扮演和條件設定。
它支持高達 128K tokens 的長上下文，並且可以生成高達 8K tokens。
它為超過 29 種語言提供多語言支持，包括中文、英文、法語、西班牙語、葡萄牙語、德語、意大利語、俄語、日語、韓語、越南語、泰語、阿拉伯語等等。

請注意：除了 3B 和 72B 模型之外的所有模型均以 Apache 2.0 許可證發布，而 3B 和 72B 模型則以 Qwen 許可證發布。

參考文獻

GitHub

部落格文章

HuggingFace

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, a range of base language models and instruction-tuned models are released, with sizes ranging from 0.5 to 72 billion parameters. Qwen2.5 introduces the following improvements over Qwen2:

- It possesses **significantly more knowledge** and has greatly enhanced capabilities in **coding** and **mathematics**, due to specialized expert models in these domains.
- It demonstrates significant advancements in **instruction following**, **long-text generation** (over 8K tokens), **understanding structured data** (e.g., tables), and **generating structured outputs**, especially in JSON format. It is also **more resilient to diverse system prompts**, improving role-play and condition-setting for chatbots.
- It supports **long contexts** of up to 128K tokens and can generate up to 8K tokens.
- It offers **multilingual support** for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

Please note: all models except the 3B and 72B are released under the Apache 2.0 license, while the 3B and 72B models are under the Qwen license.

## References

[GitHub](https://github.com/QwenLM/Qwen2.5)

[Blog post](https://qwenlm.github.io/blog/qwen2.5/)

[HuggingFace](https://huggingface.co/collections/Qwen/qwen25-66e81a666513e518adb90d9e)

貼上、拖曳或點擊上傳圖片 (.png, .jpeg, .jpg, .svg, .gif)