sailor2:8b-chat-q8_0 - Ollama 框架

Sailor2 是一项社区驱动的倡议，旨在为东南亚 (SEA) 带来最先进的多语种语言模型。我们的研究强调了对用于生产的8B 和 20B参数范围内的模型的强烈需求，以及用于推测性解码和研究等专门应用的1B 模型。这些模型根据 Apache 2.0 许可发布，为该地区的高级语言技术提供了更高的可访问性。

Sailor2 构建在出色的多语种模型 Qwen 2.5 的基础上，并持续在 500B 个 token 上进行预训练，以通过统一模型更好地支持 15 种语言。这些语言包括英语、中文、缅甸语、宿务语、伊洛卡诺语、印度尼西亚语、爪哇语、高棉语、老挝语、马来语、巽他语、他加禄语、泰语、越南语和瓦雷语。通过满足对多样化、强大且可访问的语言模型日益增长的需求，Sailor2 致力于为 SEA 地区服务不足的群体提供开放、包容且可访问的多语种 LLM。 Sailor2 模型有三种尺寸：1B、8B 和 20B，分别从 Qwen2.5 的 0.5B、7B 和 14B 基础模型扩展而来。

![logo](/assets/mchiang0610/sailor2/a76a9182-cc11-47e1-bb50-478ad4ccb157)

Sailor2 is a community-driven initiative that brings cutting-edge multilingual language models to South-East Asia (SEA). Our research highlights a strong demand for models in the **8B and 20B** parameter range for production use, alongside **1B models** for specialized applications, such as speculative decoding and research purposes. These models, released under the **Apache 2.0 license**, provide enhanced accessibility to advanced language technologies across the region.

Sailor2 builds upon the foundation of the awesome multilingual model Qwen 2.5 and is continuously pre-trained on 500B tokens to support 15 languages better with a unified model. These languages include English, Chinese, Burmese, Cebuano, Ilocano, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tagalog, Thai, Vietnamese, and Waray. By addressing the growing demand for diverse, robust, and accessible language models, Sailor2 seeks to serve the underserved in SEA areas with open, inclusive, and accessible multilingual LLMs. The Sailor2 model comes in three sizes, 1B, 8B, and 20B, which are expanded from the Qwen2.5 base models of 0.5B, 7B, and 14B, respectively.

粘贴、拖放或单击以上传图像 (.png, .jpeg, .jpg, .svg, .gif)