Skip to content

Mistral 7B Instruct v0.1

Model Overview

The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.1 generative text model using a variety of publicly available conversation datasets.

QPC Configurations

Precision SoCs / Tensor slicing NSP-Cores (per SoC) Full Batch Size Chunking Prompt Length Context Length (CL) QPC URL QPC Size QPC Download Onnx URL Onnx Download Generation Date
MXFP6 2 16 1 128 4096 https://dc00tk1pxen80.cloudfront.net/SDK1.21.2/mistralai/Mistral-7B-Instruct-v0.1/mistralai_Mistral-7B-Instruct-v0.1_qpc_16cores_128pl_4096cl_1fbs_2devices_mxfp6_mxint8.tar.gz 6.3GB Download https://dc00tk1pxen80.cloudfront.net/SDK1.21.2/mistralai/Mistral-7B-Instruct-v0.1/mistralai_Mistral-7B-Instruct-v0.1_ONNX.tar.gz Download 18-Mar-2026