Llama-3.1-70B-Instruct-FP8 huggingface.co api & nvidia Llama-3.1-70B-Instruct-FP8 github AI Model

huggingface.co

nvidia/speakerverification_en_titanet_large

Total runs: 1.3M

Run Growth: 264.8K

Growth Rate: 20.12%

Updated: November 14 2023

huggingface.co

nvidia/segformer-b1-finetuned-ade-512-512

Total runs: 1.2M

Run Growth: 159.2K

Growth Rate: 12.72%

Updated: August 06 2022

huggingface.co

nvidia/dragon-multiturn-context-encoder

Total runs: 759.5K

Run Growth: 3.2K

Growth Rate: 0.42%

Updated: May 24 2024

huggingface.co

nvidia/dragon-multiturn-query-encoder

Total runs: 758.5K

Run Growth: 45.7K

Growth Rate: 6.01%

Updated: May 24 2024

huggingface.co

nvidia/parakeet-rnnt-0.6b

Total runs: 742.7K

Run Growth: 599.5K

Growth Rate: 82.56%

Updated: January 03 2024

huggingface.co

nvidia/bigvgan_v2_22khz_80band_256x

Total runs: 575.4K

Run Growth: -1.2M

Growth Rate: -175.69%

Updated: September 05 2024

huggingface.co

nvidia/bigvgan_v2_44khz_128band_512x

Total runs: 333.2K

Run Growth: 221.1K

Growth Rate: 65.56%

Updated: September 05 2024

huggingface.co

nvidia/NV-Embed-v2

Total runs: 273.2K

Run Growth: 58.7K

Growth Rate: 23.39%

Updated: November 30 2024

huggingface.co

nvidia/MambaVision-B-1K

Total runs: 258.5K

Run Growth: 28.8K

Growth Rate: 9.56%

Updated: July 25 2024

huggingface.co

nvidia/MambaVision-S-1K

Total runs: 250.2K

Run Growth: 40.8K

Growth Rate: 13.60%

Updated: July 25 2024

huggingface.co

nvidia/Cosmos-1.0-Diffusion-7B-Text2World

Total runs: 233.3K

Run Growth: 225.7K

Growth Rate: 99.31%

Updated: January 10 2025

huggingface.co

nvidia/parakeet-tdt-1.1b

Total runs: 178.8K

Run Growth: 30.2K

Growth Rate: 17.21%

Updated: April 30 2024

huggingface.co

nvidia/Aegis-AI-Content-Safety-LlamaGuard-Defensive-1.0

Total runs: 169.4K

Run Growth: 105.7K

Growth Rate: 62.34%

Updated: January 24 2025

huggingface.co

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Total runs: 154.2K

Run Growth: -194.0K

Growth Rate: -116.99%

Updated: October 25 2024

huggingface.co

nvidia/domain-classifier

Total runs: 127.8K

Run Growth: 10.5K

Growth Rate: 18.58%

Updated: January 24 2025

huggingface.co

nvidia/parakeet-ctc-1.1b

Total runs: 119.0K

Run Growth: -14.1K

Growth Rate: -11.98%

Updated: January 13 2024

huggingface.co

nvidia/segformer-b5-finetuned-ade-640-640

Total runs: 119.0K

Run Growth: 6.6K

Growth Rate: 5.50%

Updated: August 06 2022

huggingface.co

nvidia/Llama-3_1-Nemotron-51B-Instruct

Total runs: 97.1K

Run Growth: 7.6K

Growth Rate: 7.71%

Updated: October 13 2024

huggingface.co

nvidia/Cosmos-1.0-Diffusion-14B-Text2World

Total runs: 91.9K

Run Growth: 89.5K

Growth Rate: 98.98%

Updated: January 10 2025

huggingface.co

nvidia/segformer-b0-finetuned-ade-512-512

Total runs: 63.2K

Run Growth: 5.3K

Growth Rate: 8.51%

Updated: January 14 2024

huggingface.co

nvidia/bigvgan_v2_24khz_100band_256x

Total runs: 53.8K

Run Growth: 47.3K

Growth Rate: 88.70%

Updated: September 05 2024

huggingface.co

nvidia/NVLM-D-72B

Total runs: 49.4K

Run Growth: 41.0K

Growth Rate: 85.79%

Updated: January 14 2025

huggingface.co

nvidia/mit-b0

Total runs: 43.9K

Run Growth: -119.7K

Growth Rate: -284.27%

Updated: November 15 2023

huggingface.co

nvidia/parakeet-rnnt-1.1b

Total runs: 41.5K

Run Growth: -1.7M

Growth Rate: -4176.66%

Updated: January 03 2024

huggingface.co

nvidia/stt_en_conformer_transducer_xlarge

Total runs: 39.3K

Run Growth: -23.6K

Growth Rate: -54.22%

Updated: October 29 2022

huggingface.co

nvidia/mit-b1

Total runs: 37.2K

Run Growth: 3.6K

Growth Rate: 9.10%

Updated: August 06 2022

huggingface.co

nvidia/segformer-b2-finetuned-ade-512-512

Total runs: 36.9K

Run Growth: 19.3K

Growth Rate: 54.30%

Updated: August 06 2022

huggingface.co

nvidia/mit-b5

Total runs: 20.6K

Run Growth: 1.3K

Growth Rate: 5.90%

Updated: August 06 2022

huggingface.co

nvidia/mit-b2

Total runs: 20.3K

Run Growth: 1.6K

Growth Rate: 7.71%

Updated: August 06 2022

huggingface.co

nvidia/Cosmos-1.0-Diffusion-7B-Video2World

Total runs: 19.8K

Run Growth: 6.7K

Growth Rate: 84.20%

Updated: February 08 2025

huggingface.co

nvidia/segformer-b5-finetuned-cityscapes-1024-1024

Total runs: 17.8K

Run Growth: -46.2K

Growth Rate: -288.52%

Updated: August 09 2022

huggingface.co

nvidia/canary-1b

Total runs: 17.3K

Run Growth: 7.5K

Growth Rate: 44.69%

Updated: May 08 2024

huggingface.co

nvidia/quality-classifier-deberta

Total runs: 16.5K

Run Growth: 12.6K

Growth Rate: 84.30%

Updated: January 31 2025

huggingface.co

nvidia/Mistral-NeMo-Minitron-8B-Base

Total runs: 15.4K

Run Growth: 8.3K

Growth Rate: 52.42%

Updated: August 22 2024

huggingface.co

nvidia/segformer-b1-finetuned-cityscapes-1024-1024

Total runs: 12.0K

Run Growth: 4.5K

Growth Rate: 39.17%

Updated: August 09 2022

huggingface.co

nvidia/Cosmos-1.0-Diffusion-14B-Video2World

Total runs: 11.7K

Run Growth: 4.7K

Growth Rate: 84.35%

Updated: February 08 2025

huggingface.co

nvidia/Llama3-ChatQA-1.5-8B

Total runs: 10.9K

Run Growth: 1.5K

Growth Rate: 13.28%

Updated: May 24 2024

huggingface.co

nvidia/segformer-b4-finetuned-ade-512-512

Total runs: 10.5K

Run Growth: 687

Growth Rate: 6.63%

Updated: August 06 2022

huggingface.co

nvidia/stt_en_conformer_ctc_large

Total runs: 9.8K

Run Growth: 1.7K

Growth Rate: 17.84%

Updated: October 28 2022

huggingface.co

nvidia/mit-b4

Total runs: 9.2K

Run Growth: 6.3K

Growth Rate: 69.92%

Updated: August 06 2022

huggingface.co

nvidia/parakeet-tdt_ctc-110m

Total runs: 8.7K

Run Growth: -15.6K

Growth Rate: -181.03%

Updated: October 22 2024

huggingface.co

nvidia/segformer-b3-finetuned-ade-512-512

Total runs: 8.5K

Run Growth: 2.7K

Growth Rate: 32.42%

Updated: August 06 2022

huggingface.co

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

Total runs: 5.9K

Run Growth: -6.0K

Growth Rate: -92.12%

Updated: October 15 2024

huggingface.co

nvidia/Hymba-1.5B-Base

Total runs: 5.4K

Run Growth: 7.9K

Growth Rate: 84.99%

Updated: January 02 2025

huggingface.co

nvidia/NV-Embed-v1

Total runs: 5.2K

Run Growth: -1.5K

Growth Rate: -26.67%

Updated: November 30 2024

huggingface.co

nvidia/Cosmos-1.0-Prompt-Upsampler-12B-Text2World

Total runs: 5.1K

Run Growth: 4.0K

Growth Rate: 75.24%

Updated: January 10 2025

huggingface.co

nvidia/Cosmos-1.0-Tokenizer-CV8x8x8

Total runs: 5.1K

Run Growth: 3.6K

Growth Rate: 68.06%

Updated: January 12 2025

huggingface.co

nvidia/segformer-b0-finetuned-cityscapes-1024-1024

Total runs: 5.1K

Run Growth: 3.5K

Growth Rate: 68.58%

Updated: August 08 2022

huggingface.co

nvidia/Cosmos-1.0-Guardrail

Total runs: 4.9K

Run Growth: 3.3K

Growth Rate: 65.24%

Updated: January 10 2025

huggingface.co

nvidia/parakeet-ctc-0.6b

Total runs: 4.7K

Run Growth: 3.5K

Growth Rate: 76.54%

Updated: August 22 2024

huggingface.co

nvidia/diar_sortformer_4spk-v1

Total runs: 4.0K

Run Growth: 3.5K

Growth Rate: 95.78%

Updated: February 03 2025

huggingface.co

nvidia/bigvgan_v2_22khz_80band_fmax8k_256x

Total runs: 3.8K

Run Growth: -1.2K

Growth Rate: -29.61%

Updated: September 05 2024

huggingface.co

nvidia/Mistral-NeMo-Minitron-8B-Instruct

Total runs: 3.7K

Run Growth: 697

Growth Rate: 18.85%

Updated: October 09 2024

huggingface.co

nvidia/Eagle2-1B

Total runs: 3.4K

Run Growth: 3.2K

Growth Rate: 91.94%

Updated: January 28 2025

huggingface.co

nvidia/Eagle2-9B

Total runs: 3.2K

Run Growth: 3.0K

Growth Rate: 95.56%

Updated: January 28 2025

huggingface.co

nvidia/MambaVision-T-1K

Total runs: 3.1K

Run Growth: 71

Growth Rate: 2.21%

Updated: July 25 2024

huggingface.co

nvidia/Cosmos-0.1-Tokenizer-CV4x8x8

Total runs: 2.9K

Run Growth: 2.5K

Growth Rate: 86.28%

Updated: November 11 2024

huggingface.co

nvidia/groupvit-gcc-yfcc

Total runs: 2.8K

Run Growth: 852

Growth Rate: 31.19%

Updated: September 26 2022

huggingface.co

nvidia/prompt-task-and-complexity-classifier

Total runs: 2.7K

Run Growth: 2.0K

Growth Rate: 76.87%

Updated: January 24 2025

huggingface.co

nvidia/segformer-b2-finetuned-cityscapes-1024-1024

Total runs: 2.6K

Run Growth: 866

Growth Rate: 35.09%

Updated: August 09 2022

huggingface.co

nvidia/audio-codec-44khz

Total runs: 2.5K

Run Growth: 2.4K

Growth Rate: 98.87%

Updated: December 06 2024

huggingface.co

nvidia/mit-b3

Total runs: 2.5K

Run Growth: 844

Growth Rate: 35.96%

Updated: August 06 2022

huggingface.co

nvidia/OpenMath2-Llama3.1-8B

Total runs: 2.5K

Run Growth: 1.3K

Growth Rate: 51.99%

Updated: November 25 2024

huggingface.co

nvidia/stt_en_conformer_ctc_small

Total runs: 2.3K

Run Growth: -4.6K

Growth Rate: -213.78%

Updated: June 12 2023

huggingface.co

nvidia/Cosmos-1.0-Diffusion-7B-Decoder-DV8x16x16ToCV8x8x8

Total runs: 2.3K

Run Growth: 1.9K

Growth Rate: 79.12%

Updated: January 10 2025

huggingface.co

nvidia/stt_fr_fastconformer_hybrid_large_pc

Total runs: 2.1K

Run Growth: 1.7K

Growth Rate: 76.11%

Updated: September 12 2023

huggingface.co

nvidia/stt_en_fastconformer_transducer_large

Total runs: 2.1K

Run Growth: 546

Growth Rate: 25.29%

Updated: June 08 2023

huggingface.co

nvidia/Hymba-1.5B-Instruct

Total runs: 2.1K

Run Growth: -2.8K

Growth Rate: -133.41%

Updated: January 02 2025

huggingface.co

nvidia/Llama3-ChatQA-2-8B

Total runs: 2.0K

Run Growth: -551

Growth Rate: -27.58%

Updated: September 10 2024

huggingface.co

nvidia/parakeet-tdt_ctc-1.1b

Total runs: 1.9K

Run Growth: 122

Growth Rate: 6.18%

Updated: August 26 2024

huggingface.co

nvidia/C-RADIO

Total runs: 1.9K

Run Growth: 520

Growth Rate: 27.72%

Updated: December 18 2024

huggingface.co

nvidia/bigvgan_v2_44khz_128band_256x

Total runs: 1.9K

Run Growth: 1.4K

Growth Rate: 71.65%

Updated: September 05 2024

huggingface.co

nvidia/AceMath-1.5B-Instruct

Total runs: 1.7K

Run Growth: 1.5K

Growth Rate: 100.00%

Updated: January 17 2025

huggingface.co

nvidia/AceMath-7B-Instruct

Total runs: 1.7K

Run Growth: 1.7K

Growth Rate: 100.00%

Updated: January 17 2025

huggingface.co

nvidia/segformer-b0-finetuned-cityscapes-768-768

Total runs: 1.7K

Run Growth: 1.2K

Growth Rate: 73.76%

Updated: August 09 2022

huggingface.co

nvidia/Cosmos-0.1-Tokenizer-CI16x16

Total runs: 1.7K

Run Growth: -202

Growth Rate: -12.18%

Updated: December 25 2024

huggingface.co

nvidia/Cosmos-0.1-Tokenizer-CI8x8

Total runs: 1.6K

Run Growth: -92

Growth Rate: -5.63%

Updated: November 11 2024

huggingface.co

nvidia/segformer-b3-finetuned-cityscapes-1024-1024

Total runs: 1.5K

Run Growth: 132

Growth Rate: 9.31%

Updated: August 09 2022

huggingface.co

nvidia/stt_en_citrinet_256_ls

Total runs: 1.4K

Run Growth: 921

Growth Rate: 67.87%

Updated: July 15 2022

huggingface.co

nvidia/Cosmos-1.0-Autoregressive-4B

Total runs: 1.4K

Run Growth: 785

Growth Rate: 53.58%

Updated: February 11 2025

huggingface.co

nvidia/Cosmos-1.0-Autoregressive-5B-Video2World

Total runs: 1.3K

Run Growth: 776

Growth Rate: 60.82%

Updated: February 08 2025

huggingface.co

nvidia/Cosmos-1.0-Tokenizer-DV8x16x16

Total runs: 1.2K

Run Growth: 578

Growth Rate: 47.73%

Updated: January 12 2025

huggingface.co

nvidia/Cosmos-0.1-Tokenizer-CV8x16x16

Total runs: 1.2K

Run Growth: 930

Growth Rate: 76.99%

Updated: November 11 2024

huggingface.co

nvidia/MM-Embed

Total runs: 1.2K

Run Growth: 57

Growth Rate: 5.14%

Updated: November 06 2024

huggingface.co

nvidia/low-frame-rate-speech-codec-22khz

Total runs: 1.2K

Run Growth: 249

Growth Rate: 22.78%

Updated: December 12 2024

huggingface.co

nvidia/stt_ru_conformer_transducer_large

Total runs: 1.1K

Run Growth: -3.6K

Growth Rate: -313.71%

Updated: November 01 2022

huggingface.co

nvidia/bigvgan_22khz_80band

Total runs: 1.1K

Run Growth: 236

Growth Rate: 22.08%

Updated: July 22 2024

huggingface.co

nvidia/stt_ru_fastconformer_hybrid_large_pc

Total runs: 1.0K

Run Growth: 609

Growth Rate: 60.48%

Updated: May 26 2023

huggingface.co

nvidia/segformer-b4-finetuned-cityscapes-1024-1024

Total runs: 991

Run Growth: 366

Growth Rate: 40.40%

Updated: April 24 2023

huggingface.co

nvidia/RADIO-H

Total runs: 981

Run Growth: -979

Growth Rate: -85.80%

Updated: December 02 2024

huggingface.co

nvidia/segformer-b0-finetuned-cityscapes-512-1024

Total runs: 946

Run Growth: 628

Growth Rate: 73.28%

Updated: August 09 2022

huggingface.co

nvidia/Llama-3.1-8B-Instruct-FP8

Total runs: 946

Run Growth: 285

Growth Rate: 30.61%

Updated: January 10 2025

huggingface.co

nvidia/stt_fr_conformer_ctc_large

Total runs: 912

Run Growth: 696

Growth Rate: 74.60%

Updated: October 29 2022

huggingface.co

nvidia/stt_en_fastconformer_ctc_large

Total runs: 872

Run Growth: -2.5K

Growth Rate: -290.80%

Updated: January 02 2024

huggingface.co

nvidia/RADIO

Total runs: 846

Run Growth: 99

Growth Rate: 11.99%

Updated: December 10 2024

huggingface.co

nvidia/Eagle2-2B

Total runs: 807

Run Growth: 776

Growth Rate: 96.64%

Updated: January 28 2025

huggingface.co

nvidia/Nemotron-4-Minitron-8B-Base

Total runs: 778

Run Growth: 0

Growth Rate: 0.00%

Updated: August 15 2024

huggingface.co

nvidia/Cosmos-1.0-Autoregressive-13B-Video2World

Total runs: 741

Run Growth: 433

Growth Rate: 56.60%

Updated: February 08 2025

huggingface.co

nvidia/Cosmos-1.0-Autoregressive-12B

Total runs: 732

Run Growth: 434

Growth Rate: 60.87%

Updated: February 11 2025

huggingface.co

nvidia/Llama-3.1-405B-Instruct-FP8

Total runs: 719

Run Growth: 429

Growth Rate: 61.11%

Updated: January 10 2025

Precision	MMLU	TPS
FP16	82.5	1356.92
FP8	82.3	2040.30

nvidia / Llama-3.1-70B-Instruct-FP8

Introduction of Llama-3.1-70B-Instruct-FP8

Model Details of Llama-3.1-70B-Instruct-FP8

Model Overview

Description:

Third-Party Community Consideration

License/Terms of Use:

Model Architecture:

Input:

Output:

Software Integration:

Model Version(s):

Datasets:

Inference:

Post Training Quantization

Usage

Deploy with TensorRT-LLM

Evaluation

Deploy with vLLM

Runs of nvidia Llama-3.1-70B-Instruct-FP8 on huggingface.co

More Information About Llama-3.1-70B-Instruct-FP8 huggingface.co Model

Llama-3.1-70B-Instruct-FP8 huggingface.co

Llama-3.1-70B-Instruct-FP8 huggingface.co Url

nvidia Llama-3.1-70B-Instruct-FP8 online free

nvidia Llama-3.1-70B-Instruct-FP8 online free url in huggingface.co:

Llama-3.1-70B-Instruct-FP8 install

Llama-3.1-70B-Instruct-FP8 install url in huggingface.co:

Url of Llama-3.1-70B-Instruct-FP8

Llama-3.1-70B-Instruct-FP8 huggingface.co Url

Provider of Llama-3.1-70B-Instruct-FP8 huggingface.co

Other API from nvidia