phi3:3.8b-mini-128k-instruct-q4_K_M

我們的模型並非專門針對所有下游目的而設計或評估。開發人員在選擇使用案例時應考慮語言模型的常見限制，並在使用於特定下游使用案例（特別是高風險情境）之前，評估並減輕準確性、安全性和公平性。開發人員應注意並遵守適用於其使用案例的相關法律或法規（包括隱私權、貿易合規法律等）。本模型卡中包含的任何內容均不應被解釋或視為對模型發布所依據授權條款的限制或修改。

負責任任的 AI 考量

與其他語言模型一樣，Phi 系列模型可能會以不公平、不可靠或冒犯性的方式運作。需要注意的一些限制行為包括

服務品質：Phi 模型主要以英文文本進行訓練。英語以外的語言效能會較差。訓練資料中代表性較少的英語語言變體，其效能可能比標準美式英語更差。
危害的呈現與刻板印象的延續：這些模型可能會過度或過低呈現某些人群，消除某些人群的代表性，或強化貶低或負面的刻板印象。儘管進行了安全後訓練，但由於不同群體代表性的程度不同，或反映真實世界模式和社會偏見的訓練資料中負面刻板印象示例的普遍性，這些限制可能仍然存在。
不當或冒犯性內容：這些模型可能會產生其他類型的不當或冒犯性內容，這可能會使其不適合在敏感環境中部署，除非採取針對特定使用案例的額外減輕措施。
資訊可靠性：語言模型可能會產生無意義的內容或捏造聽起來合理但不準確或過時的內容。
程式碼的有限範圍：Phi-3 訓練資料的大部分基於 Python，並使用常見的套件，例如「typing、math、random、collections、datetime、itertools」。如果模型產生使用其他套件或以其他語言編寫的腳本的 Python 腳本，我們強烈建議使用者手動驗證所有 API 用法。

開發人員應採用負責任任的 AI 最佳實務，並有責任確保特定使用案例符合相關法律和法規（例如隱私權、貿易等）。需要考量的重要領域包括：+ 分配：模型可能不適用於可能對法律地位或資源或生活機會分配產生重大影響的情境（例如：住房、就業、信用等），除非經過進一步評估和額外的去偏見技術。

高風險情境：開發人員應評估在高風險情境中使用模型的適用性，在這些情境中，不公平、不可靠或冒犯性的輸出可能代價極高或導致傷害。這包括在準確性和可靠性至關重要的敏感或專業領域提供建議（例如：法律或健康建議）。應根據部署環境在應用程式層級實施額外的安全措施。
錯誤資訊：模型可能會產生不準確的資訊。開發人員應遵循透明度最佳實務，並告知終端使用者他們正在與 AI 系統互動。在應用程式層級，開發人員可以建置回饋機制和管道，以將回應紮根於特定使用案例的上下文資訊中，這是一種稱為檢索增強生成 (RAG) 的技術。
有害內容的產生：開發人員應評估輸出的上下文，並使用可用的安全分類器或適合其使用案例的自訂解決方案。
誤用：可能發生其他形式的誤用，例如詐欺、垃圾郵件或惡意軟體製作，開發人員應確保其應用程式不違反適用的法律和法規。

訓練

模型

架構：Phi-3 Mini 具有 3.8B 參數，是一個密集的僅解碼器 Transformer 模型。該模型使用監督式微調 (SFT) 和直接偏好最佳化 (DPO) 進行微調，以確保與人類偏好和安全指南保持一致。
輸入：文字。最適合使用聊天格式的提示。
上下文長度：128K tokens
GPUS：512 H100-80G
訓練時間：7 天
訓練資料：3.3T tokens
輸出：產生的文字以回應輸入
日期：我們的模型在 2024 年 2 月至 4 月期間進行訓練
狀態：這是一個靜態模型，在離線資料集上進行訓練，截止日期為 2023 年 10 月。隨著我們改進模型，未來可能會發布經過調整的模型版本。

資料集

我們的訓練資料包含各種來源，總計 3.3 兆個 tokens，並且是以下項目的組合：1) 公開可用的文件，經過嚴格的品質篩選、選定的高品質教育資料和程式碼； 2) 新建立的合成「教科書式」資料，用於教授數學、編碼、常識推理、世界常識（科學、日常活動、心智理論等）； 3) 高品質聊天格式監督資料，涵蓋各種主題，以反映人類在不同方面的偏好，例如指令遵循、真實性、誠實和樂於助人。

軟體

授權條款

該模型根據 MIT 授權條款授權。

商標

此專案可能包含專案、產品或服務的商標或標誌。授權使用 Microsoft 商標或標誌必須遵守並遵循 Microsoft 商標與品牌指南。在此專案的修改版本中使用 Microsoft 商標或標誌不得造成混淆或暗示 Microsoft 贊助。任何使用第三方商標或標誌均受該第三方政策的約束。

資源

Phi-3 is a family of open AI models developed by Microsoft.

## Parameter sizes

- [Phi-3 Mini](https://ollama.dev.org.tw/library/phi3:mini) – 3B parameters – `ollama run phi3:mini`
- [Phi-3 Medium](https://ollama.dev.org.tw/library/phi3:medium) – 14B parameters – `ollama run phi3:medium`

## Context window sizes

> Note: the 128k version of this model requires [Ollama 0.1.39](https://github.com/ollama/ollama/releases/tag/v0.1.39) or later.

- 4k `ollama run phi3:mini` `ollama run phi3:medium`
- 128k `ollama run phi3:medium-128k`

![image.png](https://ollama.dev.org.tw/assets/library/phi3/83b3de66-82d8-4455-9117-256802c1b82e)

## Phi-3 Mini

Phi-3 Mini is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense properties.

The model has underwent a post-training process that incorporates both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.

When assessed against benchmarks testing common sense, language understanding, math, code, long context and logical reasoning, Phi-3 Mini-4K-Instruct showcased a robust and state-of-the-art performance among models with less than 13 billion parameters.

## Phi-3 Medium

Phi-3 Medium is a 14B parameter language model, and outperforms Gemini 1.0 Pro.

![image.png](https://ollama.dev.org.tw/assets/library/phi3/2868e29b-3bba-4c4a-a6ed-1a27fb102867)

## Intended Uses

**Primary use cases**

The model is intended for commercial and research use in English. The model provides uses for applications which require
1) memory/compute constrained environments
2) latency bound scenarios
3) strong reasoning (especially math and logic)
4) long context

Our model is designed to accelerate research on language and multimodal models, for use as a building block for generative AI powered features.

**Use case considerations**

Our models are not specifically designed or evaluated for all downstream purposes. Developers should consider common limitations of language models as they select use cases, and evaluate and mitigate for accuracy, safety, and fariness before using within a specific downstream use case, particularly for high risk scenarios.
Developers  should be aware of and adhere to applicable laws or regulations (including privacy, trade compliance laws, etc.) that are relevant to their use case.
Nothing contained in this Model Card should be interpreted as or deemed a restriction or modification to the license the model is released under.

## Responsible AI Considerations
Like other language models, the Phi series models can potentially behave in ways that are unfair, unreliable, or offensive. Some of the limiting behaviors to be aware of include:

+ Quality of Service: the Phi models are trained primarily on English text. Languages other than English will experience worse performance. English language varieties with less representation in the training data might experience worse performance than standard American English.

+ Representation of Harms & Perpetuation of Stereotypes: These models can over- or under-represent groups of people, erase representation of some groups, or reinforce demeaning or negative stereotypes. Despite safety post-training, these limitations may still be present due to differing levels of representation of different groups or prevalence of examples of negative stereotypes in training data that reflect real-world patterns and societal biases.

+ Inappropriate or Offensive Content: these models may produce other types of inappropriate or offensive content, which may make it inappropriate to deploy for sensitive contexts without additional mitigations that are specific to the use case.

+ Information Reliability: Language models can generate nonsensical content or fabricate content that might sound reasonable but is inaccurate or outdated.
+ Limited Scope for Code: Majority of Phi-3 training data is based in Python and use common packages such as "typing, math, random, collections, datetime, itertools". If the model generates Python scripts that utilize other packages or scripts in other languages, we strongly recommend users manually verify all API uses.

Developers should apply responsible AI best practices and are responsible for ensuring that a specific use case complies with relevant laws and regulations (e.g. privacy, trade, etc.). Important areas for consideration include:
+ Allocation: Models may not be suitable for scenarios that could have consequential impact on legal status or the allocation of resources or life opportunities (ex: housing, employment, credit, etc.) without further assessments and additional debiasing techniques.

+ High-Risk Scenarios: Developers should assess suitability of using models in high-risk scenarios where unfair, unreliable or offensive outputs might be extremely costly or lead to harm. This includes providing advice in sensitive or expert domains where accuracy and reliability are critical (ex: legal or health advice). Additional safeguards should be implemented at the application level according to the deployment context.

+ Misinformation: Models may produce inaccurate information. Developers should follow transparency best practices and inform end-users they are interacting with an AI system. At the application level, developers can build feedback mechanisms and pipelines to ground responses in use-case specific, contextual information, a technique known as Retrieval Augmented Generation (RAG).

+ Generation of Harmful Content: Developers should assess outputs for their context and use available safety classifiers or custom solutions appropriate for their use case.

+ Misuse: Other forms of misuse such as fraud, spam, or malware production may be possible, and developers should ensure that their applications do not violate applicable laws and regulations.

## Training

### Model

* Architecture: Phi-3 Mini has 3.8B parameters and is a dense decoder-only Transformer model. The model is fine-tuned with Supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) to ensure alignment with human preferences and safety guidelines.
* Inputs: Text. It is best suited for prompts using chat format.
* Context length: 128K tokens
* GPUS: 512 H100-80G
* Training time: 7 days
* Training data: 3.3T tokens
* Outputs: Generated text in response to the input
* Dates: Our models were trained between February and April 2024
* Status: This is a static model trained on an offline dataset with cutoff date October 2023. Future versions of the tuned models may be released as we improve models.

### Datasets
Our training data includes a wide variety of sources, totaling 3.3 trillion tokens, and is a combination of
1) publicly available documents filtered rigorously for quality, selected high-quality educational data, and code;
2) newly created synthetic, “textbook-like” data for the purpose of teaching math, coding, common sense reasoning, general knowledge of the world (science, daily activities, theory of mind, etc.);
3) high quality chat format supervised data covering various topics to reflect human preferences on different aspects such as instruct-following, truthfulness, honesty and helpfulness.

### Software

* [PyTorch](https://github.com/pytorch/pytorch)
* [DeepSpeed](https://github.com/microsoft/DeepSpeed)
* [Transformers](https://github.com/huggingface/transformers)
* [Flash-Attention](https://github.com/HazyResearch/flash-attention)

### License

The model is licensed under the [MIT license](https://ollama.dev.org.tw/library/phi3:latest#fa8235e5b48fENSE).

## Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow [Microsoft’s Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks). Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party’s policies.

## Resources

+ [HuggingFace](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf)
+ [Phi-3 Microsoft Blog](https://aka.ms/phi3-blog)
+ [Phi-3 Technical Report](https://aka.ms/phi3-tech-report)
+ [Phi-3 on Azure AI Studio](https://aka.ms/phi3-azure-ai)
+ [Phi-3 on Hugging Face](https://aka.ms/phi3-hf)
+ Phi-3 ONNX: [4K](https://aka.ms/phi3-mini-4k-instruct-onnx) and [128K](https://aka.ms/phi3-mini-128k-instruct-onnx)

貼上、拖放或點擊以上傳圖片 (.png, .jpeg, .jpg, .svg, .gif)