基於 Phi-3 微調,針對資訊擷取在私有高品質合成資料集上進行微調的 3.8B 模型。

3.8b

20.1K 6 個月前

自述文件

NuMind 🔥 的結構化資訊擷取模型

NuExtract 是 phi-3-mini 的一個版本,針對資訊擷取在私有高品質合成資料集上進行微調。若要使用此模型,請提供輸入文字 (少於 2000 個 tokens) 以及描述您需要擷取之資訊的 JSON 模板。

注意:此模型純粹為擷取式,因此模型輸出的所有文字皆以原始文字的樣貌呈現。您也可以提供輸出格式範例,以更精確地協助模型理解您的任務。

使用方式

提示格式

當使用特定的提示格式來擷取文字時,此模型效果最佳

### Template:
{
    "Model": {
        "Name": "",
        "Number of parameters": "",
    },
    "Usage": {
        "Use case": [],
        "Licence": ""
    }
}
### Example:
{
    "Model": {
        "Name": "Llama3",
        "Number of parameters": "8 billion",
    },
    "Usage": {
        "Use case":[
			"chat",
			"code completion"
		],
        "Licence": "Meta Llama3"
    }
}
### Text:
We introduce Mistral 7B, a 7–billion-parameter language model engineered for superior performance and efficiency. Mistral 7B outperforms the best open 13B model (Llama 2) across all evaluated benchmarks, and the best released 34B model (Llama 1) in reasoning, mathematics, and code generation. Our model leverages grouped-query attention (GQA) for faster inference, coupled with sliding window attention (SWA) to effectively handle sequences of arbitrary length with a reduced inference cost. We also provide a model fine-tuned to follow instructions, Mistral 7B – Instruct, that surpasses Llama 2 13B – chat model both on human and automated benchmarks. Our models are released under the Apache 2.0 license. 

Code: https://github.com/mistralai/mistral-src 
Webpage: https://mistral.ai/news/announcing-mistral-7b/

參考文獻

Hugging Face