sailor2:8b-chat-q8_0 - Ollama 框架

Sailor2 是一個社群驅動的倡議，旨在為東南亞 (SEA) 帶來尖端的多語言語言模型。我們的研究強調，業界對於8B 和 20B 參數範圍的模型以及用於特定應用（例如推測性解碼和研究目的）的 1B 模型有強烈的需求。這些模型以 Apache 2.0 授權發布，提高了整個區域對先進語言技術的可近性。

Sailor2 以出色的多語言模型 Qwen 2.5 為基礎構建，並在 500B 個 tokens 上持續預訓練，以使用統一模型更好地支持 15 種語言。這些語言包括英語、中文、緬甸語、宿霧語、伊洛卡諾語、印尼語、爪哇語、高棉語、寮語、馬來語、巽他語、他加祿語、泰語、越南語和瓦雷語。透過解決對多樣化、穩健且可訪問的語言模型日益增長的需求，Sailor2 旨在透過開放、包容和可訪問的多語言 LLM 為東南亞地區服務不足的地區提供服務。Sailor2 模型有 1B、8B 和 20B 三種尺寸，它們分別從 Qwen2.5 的 0.5B、7B 和 14B 基礎模型擴展而來。

![logo](/assets/mchiang0610/sailor2/a76a9182-cc11-47e1-bb50-478ad4ccb157)

Sailor2 is a community-driven initiative that brings cutting-edge multilingual language models to South-East Asia (SEA). Our research highlights a strong demand for models in the **8B and 20B** parameter range for production use, alongside **1B models** for specialized applications, such as speculative decoding and research purposes. These models, released under the **Apache 2.0 license**, provide enhanced accessibility to advanced language technologies across the region.

Sailor2 builds upon the foundation of the awesome multilingual model Qwen 2.5 and is continuously pre-trained on 500B tokens to support 15 languages better with a unified model. These languages include English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray. By addressing the growing demand for diverse, robust, and accessible language models, Sailor2 seeks to serve the underserved in SEA areas with open, inclusive, and accessible multilingual LLMs. The Sailor2 model comes in three sizes, 1B, 8B, and 20B, which are expanded from the Qwen2.5 base models of 0.5B, 7B, and 14B, respectively.

貼上、拖曳或點擊以上傳圖片 (.png, .jpeg, .jpg, .svg, .gif)