sailor2:8b-chat-q4_K_M - Ollama 框架

Sailor2 是一個社群驅動的倡議，旨在為東南亞 (SEA) 帶來尖端的多語言模型。我們的研究強調，市場對於生產環境中使用的 8B 和 20B 參數範圍模型以及用於特定應用（例如推測性解碼和研究目的）的 1B 模型有強烈需求。這些模型以 Apache 2.0 許可證發布，旨在提高整個區域對先進語言技術的可及性。

Sailor2 以出色的多語言模型 Qwen 2.5 為基礎構建，並在 500B tokens 上持續預訓練，以更好地支援包含英語、中文、緬甸語、宿霧語、伊洛卡諾語、印尼語、爪哇語、高棉語、寮語、馬來語、巽他語、他加祿語、泰語、越南語和瓦瑞語等 15 種語言的統一模型。通過應對對多樣化、穩健且可訪問的語言模型不斷增長的需求，Sailor2 旨在通過開放、包容且可訪問的多語言 LLM 為 SEA 地區服務不足的群體提供服務。Sailor2 模型提供 1B、8B 和 20B 三種尺寸，它們分別從 Qwen2.5 的 0.5B、7B 和 14B 基礎模型擴展而來。

![logo](/assets/mchiang0610/sailor2/a76a9182-cc11-47e1-bb50-478ad4ccb157)

Sailor2 is a community-driven initiative that brings cutting-edge multilingual language models to South-East Asia (SEA). Our research highlights a strong demand for models in the **8B and 20B** parameter range for production use, alongside **1B models** for specialized applications, such as speculative decoding and research purposes. These models, released under the **Apache 2.0 license**, provide enhanced accessibility to advanced language technologies across the region.

Sailor2 builds upon the foundation of the awesome multilingual model Qwen 2.5 and is continuously pre-trained on 500B tokens to support 15 languages better with a unified model. These languages include English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray. By addressing the growing demand for diverse, robust, and accessible language models, Sailor2 seeks to serve the underserved in SEA areas with open, inclusive, and accessible multilingual LLMs. The Sailor2 model comes in three sizes, 1B, 8B, and 20B, which are expanded from the Qwen2.5 base models of 0.5B, 7B, and 14B, respectively.

貼上、拖放或點擊以<0xE7><0xB7><0xA5>傳圖片 (.png, .jpeg, .jpg, .svg, .gif)