sailor2:20b-chat-fp16 - Ollama 框架

Sailor2 是一項社群驅動的倡議，旨在將尖端的多語言模型帶到東南亞 (SEA)。我們的研究強調了對於生產環境使用 8B 和 20B 參數範圍模型以及用於特定應用（例如推測解碼和研究目的）的 1B 模型的強烈需求。這些模型以 Apache 2.0 授權條款發布，旨在提升整個區域對於先進語言技術的可近性。

Sailor2 建立在出色的多語言模型 Qwen 2.5 的基礎之上，並在 500B 個 token 上持續預訓練，以透過統一模型更好地支援 15 種語言。這些語言包括英語、中文、緬甸語、宿霧語、伊洛卡諾語、印尼語、爪哇語、高棉語、寮語、馬來語、巽他語、他加祿語、泰語、越南語和瓦雷語。為了應對日益增長的對於多樣化、穩健且易於存取的語言模型的需求，Sailor2 旨在透過開放、包容且易於存取的多語言 LLM，服務東南亞地區服務不足的群體。Sailor2 模型提供三種尺寸：1B、8B 和 20B，這些尺寸分別是從 0.5B、7B 和 14B 的 Qwen2.5 基礎模型擴展而來。

![logo](/assets/mchiang0610/sailor2/a76a9182-cc11-47e1-bb50-478ad4ccb157)

Sailor2 is a community-driven initiative that brings cutting-edge multilingual language models to South-East Asia (SEA). Our research highlights a strong demand for models in the **8B and 20B** parameter range for production use, alongside **1B models** for specialized applications, such as speculative decoding and research purposes. These models, released under the **Apache 2.0 license**, provide enhanced accessibility to advanced language technologies across the region.

Sailor2 builds upon the foundation of the awesome multilingual model Qwen 2.5 and is continuously pre-trained on 500B tokens to support 15 languages better with a unified model. These languages include English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray. By addressing the growing demand for diverse, robust, and accessible language models, Sailor2 seeks to serve the underserved in SEA areas with open, inclusive, and accessible multilingual LLMs. The Sailor2 model comes in three sizes, 1B, 8B, and 20B, which are expanded from the Qwen2.5 base models of 0.5B, 7B, and 14B, respectively.

貼上、拖曳或點擊上傳圖片 (.png, .jpeg, .jpg, .svg, .gif)