Meta-Llama-3.1-8B-Instruct-FP8-dynamic huggingface.co api & neuralmagic Meta-Llama-3.1-8B-Instruct-FP8-dynamic github AI Model

huggingface.co

neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16

Total runs: 526.7K

Run Growth: 518.7K

Growth Rate: 98.75%

Updated: 12月 17 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8

Total runs: 160.3K

Run Growth: -169.2K

Growth Rate: -106.31%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8

Total runs: 108.6K

Run Growth: 77.2K

Growth Rate: 65.59%

Updated: 2月 10 2025

huggingface.co

neuralmagic/Mistral-Nemo-Instruct-2407-FP8

Total runs: 34.0K

Run Growth: 1.7K

Growth Rate: 4.96%

Updated: 7月 19 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16

Total runs: 26.6K

Run Growth: 21.1K

Growth Rate: 79.07%

Updated: 2月 12 2025

huggingface.co

neuralmagic/Phi-3-medium-128k-instruct-quantized.w4a16

Total runs: 26.0K

Run Growth: -399.9K

Growth Rate: -1537.75%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Meta-Llama-3-70B-Instruct-FP8

Total runs: 24.1K

Run Growth: -9.4K

Growth Rate: -39.25%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-8B-quantized.w8a16

Total runs: 19.8K

Run Growth: 4.0K

Growth Rate: 20.18%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Llama-3.2-3B-Instruct-FP8

Total runs: 19.5K

Run Growth: 8.3K

Growth Rate: 45.11%

Updated: 10月 16 2024

huggingface.co

neuralmagic/pixtral-12b-FP8-dynamic

Total runs: 18.3K

Run Growth: 5.3K

Growth Rate: 64.95%

Updated: 2月 07 2025

huggingface.co

neuralmagic/Llama-3.2-1B-Instruct-FP8-dynamic

Total runs: 17.0K

Run Growth: 13.8K

Growth Rate: 87.06%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Mistral-Small-24B-Instruct-2501-FP8-Dynamic

Total runs: 14.3K

Run Growth: 14.0K

Growth Rate: 99.81%

Updated: 1月 31 2025

huggingface.co

neuralmagic/Meta-Llama-3-8B-Instruct-FP8

Total runs: 11.8K

Run Growth: 2.1K

Growth Rate: 18.02%

Updated: 7月 18 2024

huggingface.co

neuralmagic/bge-large-en-v1.5-quant

Total runs: 10.3K

Run Growth: 9.1K

Growth Rate: 93.65%

Updated: 11月 13 2023

huggingface.co

neuralmagic/Sparse-Llama-3.1-8B-2of4

Total runs: 9.9K

Run Growth: 8.9K

Growth Rate: 90.48%

Updated: 12月 16 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8

Total runs: 8.2K

Run Growth: 893

Growth Rate: 10.90%

Updated: 2月 11 2025

huggingface.co

neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Total runs: 7.2K

Run Growth: 3.9K

Growth Rate: 54.57%

Updated: 10月 23 2024

huggingface.co

neuralmagic/Llama-3.2-11B-Vision-Instruct-FP8-dynamic

Total runs: 6.9K

Run Growth: 1.3K

Growth Rate: 18.68%

Updated: 10月 02 2024

huggingface.co

neuralmagic/Llama-3.2-1B-Instruct-FP8

Total runs: 6.6K

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 16 2024

huggingface.co

neuralmagic/Meta-Llama-3-8B-Instruct-FP8-KV

Total runs: 6.0K

Run Growth: 1.9K

Growth Rate: 33.93%

Updated: 6月 19 2024

huggingface.co

neuralmagic/Qwen2-7B-Instruct-FP8

Total runs: 5.9K

Run Growth: 4.9K

Growth Rate: 84.02%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Meta-Llama-3-8B-Instruct-quantized.w8a16

Total runs: 5.8K

Run Growth: -718

Growth Rate: -12.38%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Meta-Llama-3-8B-Instruct-quantized.w8a8

Total runs: 5.6K

Run Growth: 4.9K

Growth Rate: 88.27%

Updated: 10月 09 2024

huggingface.co

neuralmagic/DeepSeek-Coder-V2-Lite-Instruct-FP8

Total runs: 5.6K

Run Growth: -2.3K

Growth Rate: -41.27%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8

Total runs: 4.6K

Run Growth: 3.3K

Growth Rate: 71.86%

Updated: 10月 09 2024

huggingface.co

neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin

Total runs: 4.3K

Run Growth: 1.8K

Growth Rate: 41.94%

Updated: 3月 06 2024

huggingface.co

neuralmagic/Llama-3.2-1B-Instruct-quantized.w8a8

Total runs: 4.2K

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 16 2024

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8

Total runs: 3.1K

Run Growth: 1.4K

Growth Rate: 52.06%

Updated: 2月 12 2025

huggingface.co

neuralmagic/Meta-Llama-3.1-8B-FP8

Total runs: 2.8K

Run Growth: 305

Growth Rate: 10.75%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w4a16

Total runs: 2.4K

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 10 2024

huggingface.co

neuralmagic/llama-2-7b-chat-marlin

Total runs: 2.2K

Run Growth: 1.1K

Growth Rate: 51.19%

Updated: 1月 18 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Total runs: 2.0K

Run Growth: 828

Growth Rate: 41.86%

Updated: 10月 19 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a16

Total runs: 1.7K

Run Growth: -10.8K

Growth Rate: -619.24%

Updated: 10月 23 2024

huggingface.co

neuralmagic/Llama-3.2-90B-Vision-Instruct-FP8-dynamic

Total runs: 1.7K

Run Growth: -1.4K

Growth Rate: -85.07%

Updated: 10月 02 2024

huggingface.co

neuralmagic/Qwen2-0.5B-Instruct-FP8

Total runs: 1.5K

Run Growth: -45

Growth Rate: -3.05%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Qwen2-1.5B-Instruct-quantized.w8a8

Total runs: 1.4K

Run Growth: 308

Growth Rate: 21.34%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Llama-2-7b-ultrachat200k

Total runs: 1.4K

Run Growth: 188

Growth Rate: 15.26%

Updated: 5月 07 2024

huggingface.co

neuralmagic/Phi-3-medium-128k-instruct-FP8

Total runs: 1.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Mistral-7B-Instruct-v0.3-GPTQ-4bit

Total runs: 1.2K

Run Growth: -488

Growth Rate: -39.90%

Updated: 6月 10 2024

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Llama-70B-quantized.w4a16

Total runs: 1.1K

Run Growth: 1.1K

Growth Rate: 98.55%

Updated: 2月 12 2025

huggingface.co

neuralmagic/Llama-3.2-3B-Instruct-FP8-dynamic

Total runs: 1.0K

Run Growth: 597

Growth Rate: 41.40%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Phi-3-medium-128k-instruct-quantized.w8a16

Total runs: 959

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Qwen2-1.5B-Instruct-FP8

Total runs: 934

Run Growth: 872

Growth Rate: 93.36%

Updated: 7月 18 2024

huggingface.co

neuralmagic/bge-small-en-v1.5-quant

Total runs: 910

Run Growth: -117

Growth Rate: -14.79%

Updated: 11月 13 2023

huggingface.co

neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic

Total runs: 906

Run Growth: -66.3K

Growth Rate: -7462.65%

Updated: 10月 17 2024

huggingface.co

neuralmagic/Mixtral-8x22B-Instruct-v0.1-FP8

Total runs: 905

Run Growth: 486

Growth Rate: 55.93%

Updated: 8月 12 2024

huggingface.co

neuralmagic/llama2.c-stories110M-pruned50

Total runs: 898

Run Growth: 40

Growth Rate: 4.68%

Updated: 3月 05 2024

huggingface.co

neuralmagic/Llama-3.2-3B-Instruct-quantized.w8a8

Total runs: 896

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/OpenHermes-2.5-Mistral-7B-marlin

Total runs: 824

Run Growth: 161

Growth Rate: 19.83%

Updated: 3月 06 2024

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Qwen-32B-FP8-dynamic

Total runs: 822

Run Growth: 730

Growth Rate: 96.82%

Updated: 2月 11 2025

huggingface.co

neuralmagic/Llama-2-7b-chat-quantized.w8a8

Total runs: 786

Run Growth: 355

Growth Rate: 45.17%

Updated: 10月 09 2024

huggingface.co

neuralmagic/gemma-2-2b-it-quantized.w4a16

Total runs: 709

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/bge-base-en-v1.5-quant

Total runs: 693

Run Growth: 107

Growth Rate: 15.35%

Updated: 11月 13 2023

huggingface.co

neuralmagic/gemma-2-9b-it-FP8

Total runs: 660

Run Growth: 147

Growth Rate: 24.62%

Updated: 7月 18 2024

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic

Total runs: 652

Run Growth: 562

Growth Rate: 94.30%

Updated: 2月 10 2025

huggingface.co

neuralmagic/Mistral-7B-Instruct-v0.3-quantized.w4a16

Total runs: 610

Run Growth: 342

Growth Rate: 56.07%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Mistral-Nemo-Instruct-2407-quantized.w4a16

Total runs: 605

Run Growth: 193

Growth Rate: 29.92%

Updated: 10月 09 2024

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16

Total runs: 602

Run Growth: 507

Growth Rate: 94.77%

Updated: 2月 12 2025

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Qwen-14B-FP8-dynamic

Total runs: 591

Run Growth: 562

Growth Rate: 97.91%

Updated: 2月 11 2025

huggingface.co

neuralmagic/Meta-Llama-3-70B-Instruct-quantized.w8a16

Total runs: 579

Run Growth: 253

Growth Rate: 43.77%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Mistral-7B-Instruct-v0.3-quantized.w8a16

Total runs: 561

Run Growth: -31

Growth Rate: -5.53%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Phi-3-mini-128k-instruct-quantized.w8a8

Total runs: 545

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a16

Total runs: 502

Run Growth: 279

Growth Rate: 55.58%

Updated: 10月 09 2024

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Llama-8B-FP8-dynamic

Total runs: 497

Run Growth: 409

Growth Rate: 83.30%

Updated: 2月 10 2025

huggingface.co

neuralmagic/Phi-3-mini-128k-instruct-quantized.w4a16

Total runs: 482

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Phi-3-medium-128k-instruct-quantized.w8a8

Total runs: 466

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w8a8

Total runs: 465

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 10 2024

huggingface.co

neuralmagic/Phi-3-mini-128k-instruct-FP8

Total runs: 456

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Mistral-7B-Instruct-v0.3-FP8

Total runs: 445

Run Growth: -48

Growth Rate: -10.79%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-70B-FP8

Total runs: 420

Run Growth: 92

Growth Rate: 21.90%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Mixtral-8x7B-Instruct-v0.1-FP8

Total runs: 389

Run Growth: -5.6K

Growth Rate: -1432.39%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Qwen2-72B-Instruct-FP8

Total runs: 372

Run Growth: -178

Growth Rate: -46.35%

Updated: 7月 18 2024

huggingface.co

neuralmagic/gemma-2-2b-it-FP8

Total runs: 362

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/oBERT-teacher-squadv1

Total runs: 353

Run Growth: 94

Growth Rate: 26.86%

Updated: 7月 31 2022

huggingface.co

neuralmagic/Mistral-7B-Instruct-v0.3-quantized.w8a8

Total runs: 349

Run Growth: 50

Growth Rate: 14.33%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Phi-3-small-128k-instruct-quantized.w8a16

Total runs: 344

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Qwen2-57B-A14B-Instruct-FP8

Total runs: 337

Run Growth: -39

Growth Rate: -11.57%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w8a16

Total runs: 334

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8

Total runs: 323

Run Growth: 71

Growth Rate: 22.54%

Updated: 2月 12 2025

huggingface.co

neuralmagic/bge-small-en-v1.5-sparse

Total runs: 277

Run Growth: -9

Growth Rate: -3.21%

Updated: 11月 13 2023

huggingface.co

neuralmagic/Qwen2-0.5B-Instruct-quantized.w8a8

Total runs: 272

Run Growth: -18

Growth Rate: -6.62%

Updated: 10月 09 2024

huggingface.co

neuralmagic/DeepSeek-Coder-V2-Instruct-FP8

Total runs: 271

Run Growth: -641

Growth Rate: -236.53%

Updated: 7月 22 2024

huggingface.co

neuralmagic/bge-base-en-v1.5-sparse

Total runs: 260

Run Growth: -17

Growth Rate: -6.37%

Updated: 11月 13 2023

huggingface.co

neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8-dynamic

Total runs: 251

Run Growth: 42

Growth Rate: 16.54%

Updated: 10月 19 2024

huggingface.co

neuralmagic/bge-large-en-v1.5-sparse

Total runs: 250

Run Growth: -16

Growth Rate: -6.37%

Updated: 11月 13 2023

huggingface.co

neuralmagic/zephyr-7b-beta-marlin

Total runs: 240

Run Growth: -304

Growth Rate: -124.59%

Updated: 3月 06 2024

huggingface.co

neuralmagic/Llama-2-7b-chat-hf-FP8

Total runs: 222

Run Growth: -156

Growth Rate: -70.27%

Updated: 7月 18 2024

huggingface.co

neuralmagic/gemma-2-9b-it-quantized.w4a16

Total runs: 216

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Meta-Llama-3-8B-Instruct-quantized.w4a16

Total runs: 202

Run Growth: -18

Growth Rate: -8.91%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Qwen2-0.5B-Instruct-quantized.w4a16

Total runs: 200

Run Growth: 8

Growth Rate: 4.00%

Updated: 7月 18 2024

huggingface.co

neuralmagic/DeepSeek-R1-Distill-Qwen-7B-FP8-dynamic

Total runs: 199

Run Growth: 175

Growth Rate: 94.59%

Updated: 2月 11 2025

huggingface.co

neuralmagic/Qwen2-1.5B-Instruct-quantized.w4a16

Total runs: 197

Run Growth: 14

Growth Rate: 7.11%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Phi-3.5-mini-instruct-FP8-KV

Total runs: 190

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 01 2024

huggingface.co

neuralmagic/Qwen2-7B-Instruct-quantized.w4a16

Total runs: 186

Run Growth: -92

Growth Rate: -49.46%

Updated: 7月 18 2024

huggingface.co

neuralmagic/Qwen2-7B-Instruct-quantized.w8a8

Total runs: 186

Run Growth: -351

Growth Rate: -188.71%

Updated: 10月 09 2024

huggingface.co

neuralmagic/starcoder2-15b-FP8

Total runs: 185

Run Growth: -485

Growth Rate: -262.16%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Llama-3.2-1B-FP8

Total runs: 178

Run Growth: 0

Growth Rate: 0.00%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Llama-2-7b-chat-quantized.w8a16

Total runs: 162

Run Growth: 24

Growth Rate: 14.81%

Updated: 7月 18 2024

huggingface.co

neuralmagic/SmolLM-135M-Instruct-quantized.w8a8

Total runs: 157

Run Growth: -2

Growth Rate: -1.27%

Updated: 10月 09 2024

huggingface.co

neuralmagic/Llama-2-7b-gsm8k

Total runs: 157

Run Growth: -124

Growth Rate: -82.67%

Updated: 6月 20 2024

Benchmark	Meta-Llama-3.1-8B-Instruct	Meta-Llama-3.1-8B-Instruct-FP8-dynamic(this model)	Recovery
MMLU (5-shot)	67.94	68.09	100.2%
ARC Challenge (0-shot)	83.11	82.34	99.07%
GSM-8K (CoT, 8-shot, strict-match)	82.03	82.34	100.3%
Hellaswag (10-shot)	80.01	79.68	99.59%
Winogrande (5-shot)	77.90	77.03	98.88%
TruthfulQA (0-shot, mc2)	54.04	53.37	98.76%
Average	74.17	73.81	99.48%

neuralmagic / Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Introduction of Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Model Details of Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Model Overview

Model Optimizations

Deployment

Use with vLLM

Creation

Evaluation

Accuracy

Open LLM Leaderboard evaluation scores

Reproduction

MMLU

ARC-Challenge

GSM-8K

Hellaswag

Winogrande

TruthfulQA

Runs of neuralmagic Meta-Llama-3.1-8B-Instruct-FP8-dynamic on huggingface.co

More Information About Meta-Llama-3.1-8B-Instruct-FP8-dynamic huggingface.co Model

More Meta-Llama-3.1-8B-Instruct-FP8-dynamic license Visit here:

Meta-Llama-3.1-8B-Instruct-FP8-dynamic huggingface.co

Meta-Llama-3.1-8B-Instruct-FP8-dynamic huggingface.co Url

neuralmagic Meta-Llama-3.1-8B-Instruct-FP8-dynamic online free

neuralmagic Meta-Llama-3.1-8B-Instruct-FP8-dynamic online free url in huggingface.co:

Meta-Llama-3.1-8B-Instruct-FP8-dynamic install

Meta-Llama-3.1-8B-Instruct-FP8-dynamic install url in huggingface.co:

Url of Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Meta-Llama-3.1-8B-Instruct-FP8-dynamic huggingface.co Url

Provider of Meta-Llama-3.1-8B-Instruct-FP8-dynamic huggingface.co

Other API from neuralmagic