Mistral 7B Instruct v0.1
Model Overview¶
The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.1 generative text model using a variety of publicly available conversation datasets.
-
Paper: Mistral 7B
-
Model Source: mistralai/Mistral-7B-Instruct-v0.1
QPC Configurations¶
| Precision | SoCs / Tensor slicing | NSP-Cores (per SoC) | Full Batch Size | Chunking Prompt Length | Context Length (CL) | QPC URL | QPC Size | QPC Download | Onnx URL | Onnx Download | Generation Date |
|---|---|---|---|---|---|---|---|---|---|---|---|
| MXFP6 | 2 | 16 | 1 | 128 | 4096 | https://dc00tk1pxen80.cloudfront.net/SDK1.21.2/mistralai/Mistral-7B-Instruct-v0.1/mistralai_Mistral-7B-Instruct-v0.1_qpc_16cores_128pl_4096cl_1fbs_2devices_mxfp6_mxint8.tar.gz | 6.3GB | Download | https://dc00tk1pxen80.cloudfront.net/SDK1.21.2/mistralai/Mistral-7B-Instruct-v0.1/mistralai_Mistral-7B-Instruct-v0.1_ONNX.tar.gz | Download | 18-Mar-2026 |