granite3.1-moe - Ollama 框架

granite3.1-moe

IBM Granite 1B 和 3B 模型是 IBM 的長上下文混合專家 (MoE) Granite 模型，專為低延遲使用而設計。

工具 1b 3b

20.6K 下載次數更新於 2 週前

更新於 2 週前

2 週前

b43d80d7fca7 · 2.0GB

quantizationQ4_K_M

知識截止日期：2024 年 4 月。您是 Granite，由 IBM 開發。

<|start_of_role|>system<|end_of_role|> {{- if and (gt (len .Messages) 0) (eq (index .Messages 0).Rol

Apache 2.0 許可證，2004 年 1 月

Readme

Granite 混合專家模型

IBM Granite 1B 和 3B 模型是 IBM 的長上下文混合專家 (MoE) Granite 模型，專為低延遲使用而設計。

這些模型在超過 10 兆個 tokens 的數據上進行訓練，Granite MoE 模型非常適合部署在設備端應用程式或需要即時推論的情況。

參數大小

1B

ollama run granite3.1-moe:1b

3B

ollama run granite3.1-moe:3b

支援語言

英文、德文、西班牙文、法文、日文、葡萄牙文、阿拉伯文、捷克文、義大利文、韓文、荷蘭文、中文（簡體）

功能

摘要
文本分類
文本提取
問答
檢索增強生成 (RAG)
程式碼相關任務
函數呼叫任務
多語言對話用例
長上下文任務，包括長文檔/會議摘要、長文檔問答等。

Granite 稠密模型

Granite 稠密模型提供 2B 和 8B 參數大小，旨在支援基於工具的用例和檢索增強生成 (RAG)，簡化程式碼生成、翻譯和錯誤修復。

查看模型頁面

了解更多

開發者： IBM Research
GitHub 儲存庫： ibm-granite/granite-language-models
網站： Granite 文件
發布日期：2024 年 12 月 18 日
許可證： Apache 2.0。

## Granite mixture of experts models

The IBM Granite **1B and 3B models** are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

The models are trained on over 10 trillion tokens of data, the Granite MoE models are ideal for deployment in on-device applications or situations requiring instantaneous inference.

### Parameter Sizes

**1B:**
  
`ollama run granite3.1-moe:1b`

**3B:**

`ollama run granite3.1-moe:3b`

### Supported Languages
English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified)

### Capabilities
* Summarization
* Text classification
* Text extraction
* Question-answering
* Retrieval Augmented Generation (RAG)
* Code related tasks
* Function-calling tasks
* Multilingual dialog use cases
* Long-context tasks including long document/meeting summarization, long document QA, etc.

## Granite dense models

The Granite dense models are available in **2B and 8B** parameter sizes designed to support tool-based use cases and for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

[See model page](https://ollama.dev.org.tw/library/granite3-dense)

## Learn more

- **Developers:** IBM Research
- **GitHub Repository:** [ibm-granite/granite-language-models](https://github.com/ibm-granite/granite-3.1-language-models)
- **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
- **Release Date**: December 18th, 2024
- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0).

貼上、拖放或點擊上傳圖片 (.png, .jpeg, .jpg, .svg, .gif)