Eagle2-9B huggingface.co api & nvidia Eagle2-9B github AI Model

Introduction of Eagle2-9B

Model Details of Eagle2-9B

Eagle-2

[📂 GitHub] [📜 Eagle2 Tech Report] [🗨️ Chat Demo] [🤗 HF Demo]

Introduction

We are thrilled to release our latest Eagle2 series Vision-Language Model. Open-source Vision-Language Models (VLMs) have made significant strides in narrowing the gap with proprietary models. However, critical details about data strategies and implementation are often missing, limiting reproducibility and innovation. In this project, we focus on VLM post-training from a data-centric perspective, sharing insights into building effective data strategies from scratch. By combining these strategies with robust training recipes and model design, we introduce Eagle2, a family of performant VLMs. Our work aims to empower the open-source community to develop competitive VLMs with transparent processes.

In this repo, we are open-sourcing Eagle2-9B, which strikes the perfect balance between performance and inference speed.

Model Zoo

We provide the following models:

model name	LLM	Vision	Max Length	HF Link
Eagle2-1B	Qwen2.5-0.5B-Instruct	Siglip	16K	🤗 link
Eagle2-2B	Qwen2.5-1.5B-Instruct	Siglip	16K	🤗 link
Eagle2-9B	Qwen2.5-7B-Instruct	Siglip+ConvNext	16K	🤗 link

Benchmark Results

Benchmark	MiniCPM-Llama3-V-2_5	InternVL-Chat-V1-5	InternVL2-8B	QwenVL2-7B	Eagle2-9B
Model Size	8.5B	25.5B	8.1B	8.3B	8.9B

DocVQA _test	84.8	90.9	91.6	94.5	92.6
ChartQA _test	-	83.8	83.3	83.0	86.4
InfoVQA _test	-	72.5	74.8	74.3	77.2
TextVQA _val	76.6	80.6	77.4	84.3	83.0
OCRBench	725	724	794	845	868
MME _sum	2024.6	2187.8	2210.3	2326.8	2260
RealWorldQA	63.5	66.0	64.4	70.1	69.3
AI2D _test	78.4	80.7	83.8	-	83.9
MMMU _val	45.8	45.2 / 46.8	49.3 / 51.8	54.1	56.1
MMBench_V11 _test			79.5	79.4	80.6
MMVet _GPT-4-Turbo	52.8	55.4	54.2	62.0	62.2
SEED-Image	72.3	76.0	76.2		77.1
HallBench _avg	42.4	49.3	45.2	50.6	49.3
MathVista _testmini	54.3	53.5	58.3	58.2	63.8
MMstar	-	-	60.9	60.7	62.6

Quick Start

We provide a demo inference script to help you quickly start using the model. We support different input types:

pure text input
single image input
multiple image input
video input

0. Install the dependencies

pip install transformers==4.37.2
pip install flash-attn

Note : Latest version of transformers is not compatible with the model.

1. Prepare the Model worker

Click to expand


"""
A model worker executes the model.
Copied and modified from https://github.com/OpenGVLab/InternVL/blob/main/streamlit_demo/model_worker.py
"""
# Importing torch before transformers can cause `segmentation fault`
from transformers import AutoModel, AutoTokenizer, TextIteratorStreamer, AutoConfig

import argparse
import base64
import json
import os
import decord
import threading
import time
from io import BytesIO
from threading import Thread
import math
import requests
import torch
import torchvision.transforms as T
from PIL import Image
from torchvision.transforms.functional import InterpolationMode
import numpy as np


IMAGENET_MEAN = (0.485, 0.456, 0.406)
IMAGENET_STD = (0.229, 0.224, 0.225)

SIGLIP_MEAN = (0.5, 0.5, 0.5)
SIGLIP_STD = (0.5, 0.5, 0.5)


def get_seq_frames(total_num_frames, desired_num_frames=-1, stride=-1):
    """
    Calculate the indices of frames to extract from a video.

    Parameters:
    total_num_frames (int): Total number of frames in the video.
    desired_num_frames (int): Desired number of frames to extract.

    Returns:
    list: List of indices of frames to extract.
    """
    
    assert desired_num_frames > 0 or stride > 0 and not (desired_num_frames > 0 and stride > 0)

    if stride > 0:
        return list(range(0, total_num_frames, stride))
    
    # Calculate the size of each segment from which a frame will be extracted
    seg_size = float(total_num_frames - 1) / desired_num_frames

    seq = []
    for i in range(desired_num_frames):
        # Calculate the start and end indices of each segment
        start = int(np.round(seg_size * i))
        end = int(np.round(seg_size * (i + 1)))

        # Append the middle index of the segment to the list
        seq.append((start + end) // 2)

    return seq

def build_video_prompt(meta_list, num_frames, time_position=False):
    # if time_position is True, the frame_timestamp is used.
    # 1. pass time_position, 2. use env TIME_POSITION
    time_position = os.environ.get("TIME_POSITION", time_position)
    prefix = f"This is a video:\n"
    for i in range(num_frames):
        if time_position:
            frame_txt = f"Frame {i+1} sampled at {meta_list[i]:.2f} seconds: <image>\n"
        else:
            frame_txt = f"Frame {i+1}: <image>\n"
        prefix += frame_txt
    return prefix

def load_video(video_path, num_frames=64, frame_cache_root=None):
    if isinstance(video_path, str):
        video = decord.VideoReader(video_path)
    elif isinstance(video_path, dict):
        assert False, 'we not support vidoe: "video_path" as input'
    fps = video.get_avg_fps()
    sampled_frames = get_seq_frames(len(video), num_frames)
    samepld_timestamps = [i / fps for i in sampled_frames]
    frames = video.get_batch(sampled_frames).asnumpy()
    images = [Image.fromarray(frame) for frame in frames]
    
    return images, build_video_prompt(samepld_timestamps, len(images), time_position=True)

def load_image(image):
    if isinstance(image, str) and os.path.exists(image):
        return Image.open(image)
    elif isinstance(image, dict):
        if 'disk_path' in image:
            return Image.open(image['disk_path'])
        elif 'base64' in image:
            return Image.open(BytesIO(base64.b64decode(image['base64'])))
        elif 'url' in image:
            response = requests.get(image['url'])
            return Image.open(BytesIO(response.content))
        elif 'bytes' in image:
            return Image.open(BytesIO(image['bytes']))
        else:
            raise ValueError(f'Invalid image: {image}')
    else:
        raise ValueError(f'Invalid image: {image}')

def build_transform(input_size, norm_type='imagenet'):
    if norm_type == 'imagenet':
        MEAN, STD = IMAGENET_MEAN, IMAGENET_STD
    elif norm_type == 'siglip':
        MEAN, STD = SIGLIP_MEAN, SIGLIP_STD
        
    transform = T.Compose([
        T.Lambda(lambda img: img.convert('RGB') if img.mode != 'RGB' else img),
        T.Resize((input_size, input_size), interpolation=InterpolationMode.BICUBIC),
        T.ToTensor(),
        T.Normalize(mean=MEAN, std=STD)
    ])
    return transform


def find_closest_aspect_ratio(aspect_ratio, target_ratios, width, height, image_size):
    """
    previous version mainly foucs on ratio.
    We also consider area ratio here.
    """
    best_factor = float('-inf')
    best_ratio = (1, 1)
    area = width * height
    for ratio in target_ratios:
        target_aspect_ratio = ratio[0] / ratio[1]
        ratio_diff = abs(aspect_ratio - target_aspect_ratio)
        area_ratio = (ratio[0]*ratio[1]*image_size*image_size)/ area
        """
        new area > 60% of original image area is enough.
        """
        factor_based_on_area_n_ratio = min((ratio[0]*ratio[1]*image_size*image_size)/ area, 0.6)* \
                                     min(target_aspect_ratio/aspect_ratio, aspect_ratio/target_aspect_ratio)
        
        if factor_based_on_area_n_ratio > best_factor:
            best_factor = factor_based_on_area_n_ratio
            best_ratio = ratio
        
    return best_ratio


def dynamic_preprocess(image, min_num=1, max_num=6, image_size=448, use_thumbnail=False):
    orig_width, orig_height = image.size
    aspect_ratio = orig_width / orig_height

    # calculate the existing image aspect ratio
    target_ratios = set(
        (i, j) for n in range(min_num, max_num + 1) for i in range(1, n + 1) for j in range(1, n + 1) if
        i * j <= max_num and i * j >= min_num)
    target_ratios = sorted(target_ratios, key=lambda x: x[0] * x[1])

    # find the closest aspect ratio to the target
    target_aspect_ratio = find_closest_aspect_ratio(
        aspect_ratio, target_ratios, orig_width, orig_height, image_size)

    # calculate the target width and height
    target_width = image_size * target_aspect_ratio[0]
    target_height = image_size * target_aspect_ratio[1]
    blocks = target_aspect_ratio[0] * target_aspect_ratio[1]

    # resize the image
    resized_img = image.resize((target_width, target_height))
    processed_images = []
    for i in range(blocks):
        box = (
            (i % (target_width // image_size)) * image_size,
            (i // (target_width // image_size)) * image_size,
            ((i % (target_width // image_size)) + 1) * image_size,
            ((i // (target_width // image_size)) + 1) * image_size
        )
        # split the image
        split_img = resized_img.crop(box)
        processed_images.append(split_img)
    assert len(processed_images) == blocks
    if use_thumbnail and len(processed_images) != 1:
        thumbnail_img = image.resize((image_size, image_size))
        processed_images.append(thumbnail_img)
    return processed_images

def split_model(model_path, device):

    device_map = {}
    world_size = torch.cuda.device_count()
    config = AutoConfig.from_pretrained(model_path, trust_remote_code=True)
    num_layers = config.llm_config.num_hidden_layers

    print('world_size', world_size)
    num_layers_per_gpu_ = math.floor(num_layers / (world_size - 1))
    num_layers_per_gpu = [num_layers_per_gpu_] * world_size
    num_layers_per_gpu[device] = num_layers - num_layers_per_gpu_ * (world_size-1)
    print(num_layers_per_gpu)
    layer_cnt = 0
    for i, num_layer in enumerate(num_layers_per_gpu):
        for j in range(num_layer):
            device_map[f'language_model.model.layers.{layer_cnt}'] = i
            layer_cnt += 1
    device_map['vision_model'] = device
    device_map['mlp1'] = device
    device_map['language_model.model.tok_embeddings'] = device
    device_map['language_model.model.embed_tokens'] = device
    device_map['language_model.output'] = device
    device_map['language_model.model.norm'] = device
    device_map['language_model.lm_head'] = device
    device_map['language_model.model.rotary_emb'] = device
    device_map[f'language_model.model.layers.{num_layers - 1}'] = device
    return device_map

class ModelWorker:
    def __init__(self, model_path, model_name,
                 load_8bit, device):

        if model_path.endswith('/'):
            model_path = model_path[:-1]
        if model_name is None:
            model_paths = model_path.split('/')
            if model_paths[-1].startswith('checkpoint-'):
                self.model_name = model_paths[-2] + '_' + model_paths[-1]
            else:
                self.model_name = model_paths[-1]
        else:
            self.model_name = model_name

        print(f'Loading the model {self.model_name}')

        tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True, use_fast=False)
        tokens_to_keep = ['<box>', '</box>', '<ref>', '</ref>']
        tokenizer.additional_special_tokens = [item for item in tokenizer.additional_special_tokens if item not in tokens_to_keep]
        self.tokenizer = tokenizer
        config = AutoConfig.from_pretrained(model_path, trust_remote_code=True)
        model_type = config.vision_config.model_type
        self.device = torch.cuda.current_device()
        if model_type == 'siglip_vision_model':
            self.norm_type = 'siglip'
        elif model_type == 'MOB':
            self.norm_type = 'siglip'
        else:
            self.norm_type = 'imagenet'

        if any(x in model_path.lower() for x in ['34b']):
            device_map = split_model(model_path, self.device)
        else:
            device_map = None
        
        if device_map is not None:    
            self.model = AutoModel.from_pretrained(model_path, torch_dtype=torch.bfloat16,
                                               low_cpu_mem_usage=True,
                                               device_map=device_map, 
                                               trust_remote_code=True,
                                               load_in_8bit=load_8bit).eval()
        else:
            self.model = AutoModel.from_pretrained(model_path, torch_dtype=torch.bfloat16,
                                               trust_remote_code=True,
                                               load_in_8bit=load_8bit).eval()  

        if not load_8bit and device_map is None:
            self.model = self.model.to(device)
        self.load_8bit = load_8bit
        
        self.model_path = model_path
        self.image_size = self.model.config.force_image_size
        self.context_len = tokenizer.model_max_length
        self.per_tile_len = 256

    def reload_model(self):
        del self.model
        torch.cuda.empty_cache()
        if self.device == 'auto':
            os.environ['CUDA_LAUNCH_BLOCKING'] = '1'
            # This can make distributed deployment work properly
            self.model = AutoModel.from_pretrained(
                self.model_path,
                load_in_8bit=self.load_8bit,
                torch_dtype=torch.bfloat16,
                device_map=self.device_map,
                trust_remote_code=True).eval()
        else:
            self.model = AutoModel.from_pretrained(
                self.model_path,
                load_in_8bit=self.load_8bit,
                torch_dtype=torch.bfloat16,
                trust_remote_code=True).eval()
        if not self.load_8bit and not self.device == 'auto':
            self.model = self.model.cuda()

    @torch.inference_mode()
    def generate(self, params):
        system_message = params['prompt'][0]['content']
        send_messages = params['prompt'][1:]
        max_input_tiles = params['max_input_tiles']
        temperature = params['temperature']
        top_p = params['top_p']
        max_new_tokens = params['max_new_tokens']
        repetition_penalty = params['repetition_penalty']
        video_frame_num = params.get('video_frame_num', 64)
        do_sample = True if temperature > 0.0 else False

        global_image_cnt = 0
        history, pil_images, max_input_tile_list = [], [], []
        for message in send_messages:
            if message['role'] == 'user':
                prefix = ''
                if 'image' in message:
                    for image_data in message['image']:
                        pil_images.append(load_image(image_data))
                        prefix = prefix + f'<image {global_image_cnt + 1}><image>\n'
                        global_image_cnt += 1
                        max_input_tile_list.append(max_input_tiles)
                if 'video' in message:
                    for video_data in message['video']:
                        video_frames, tmp_prefix = load_video(video_data, num_frames=video_frame_num)
                        pil_images.extend(video_frames)
                        prefix = prefix + tmp_prefix
                        global_image_cnt += len(video_frames)
                        max_input_tile_list.extend([1] * len(video_frames))
                content = prefix + message['content']
                history.append([content, ])
            else:
                history[-1].append(message['content'])
        question, history = history[-1][0], history[:-1]

        if global_image_cnt == 1:
            question = question.replace('<image 1><image>\n', '<image>\n')
            history = [[item[0].replace('<image 1><image>\n', '<image>\n'), item[1]] for item in history]


        try:
            assert len(max_input_tile_list) == len(pil_images), 'The number of max_input_tile_list and pil_images should be the same.'
        except Exception as e:
            from IPython import embed; embed()
            exit()
            print(f'Error: {e}')
            print(f'max_input_tile_list: {max_input_tile_list}, pil_images: {pil_images}')
            # raise e

        old_system_message = self.model.system_message
        self.model.system_message = system_message
        
        transform = build_transform(input_size=self.image_size, norm_type=self.norm_type)
        if len(pil_images) > 0:
            max_input_tiles_limited_by_contect = params['max_input_tiles']
            while True:
                image_tiles = []
                for current_max_input_tiles, pil_image in zip(max_input_tile_list, pil_images):
                    if self.model.config.dynamic_image_size:
                        tiles = dynamic_preprocess(
                            pil_image, image_size=self.image_size, max_num=min(current_max_input_tiles, max_input_tiles_limited_by_contect),
                            use_thumbnail=self.model.config.use_thumbnail)
                    else:
                        tiles = [pil_image]
                    image_tiles += tiles
                if (len(image_tiles) * self.per_tile_len < self.context_len):
                    break
                else:
                    max_input_tiles_limited_by_contect -= 2
                
                if max_input_tiles_limited_by_contect < 1:
                    break
                    
            pixel_values = [transform(item) for item in image_tiles]
            pixel_values = torch.stack(pixel_values).to(self.model.device, dtype=torch.bfloat16)
            print(f'Split images to {pixel_values.shape}')
        else:
            pixel_values = None

        generation_config = dict(
            num_beams=1,
            max_new_tokens=max_new_tokens,
            do_sample=do_sample,
            temperature=temperature,
            repetition_penalty=repetition_penalty,
            max_length=self.context_len,
            top_p=top_p,
        )

        response = self.model.chat(
            tokenizer=self.tokenizer,
            pixel_values=pixel_values,
            question=question,
            history=history,
            return_history=False,
            generation_config=generation_config,
        )
        self.model.system_message = old_system_message
        return {'text': response, 'error_code': 0}





if __name__ == '__main__':
    parser = argparse.ArgumentParser()
    parser.add_argument('--model-path', type=str, default='nvidia/Eagle2-9B')
    parser.add_argument('--model-name', type=str, default='Eagle2-9B')
    parser.add_argument('--device', type=str, default='cuda')
    parser.add_argument('--load-8bit', action='store_true')
    args = parser.parse_args()
    print(f'args: {args}')

    worker = ModelWorker(
                         args.model_path,
                         args.model_name,
                         args.load_8bit,
                         args.device)

2. Prepare the Prompt

Single image input

prompt = [
        {'role': 'system', 'content': 'You are a helpful assistant.'},
        {'role': 'user', 'content': 'Describe this image in details.', 
            'image':[
                {'url': 'https://www.nvidia.com/content/dam/en-zz/Solutions/about-nvidia/logo-and-brand/01-nvidia-logo-vert-500x200-2c50-d@2x.png'}
            ],
        }
    ]

Multiple image input

prompt = [
        {'role': 'system', 'content': 'You are a helpful assistant.'},
        {'role': 'user', 'content': 'Describe these two images in details.', 
            'image':[
                {'url': 'https://www.nvidia.com/content/dam/en-zz/Solutions/about-nvidia/logo-and-brand/01-nvidia-logo-vert-500x200-2c50-d@2x.png'},
                {'url': 'https://www.nvidia.com/content/dam/en-zz/Solutions/about-nvidia/logo-and-brand/01-nvidia-logo-vert-500x200-2c50-d@2x.png'}
            ],
        }
    ]

Video input

prompt = [
        {'role': 'system', 'content': 'You are a helpful assistant.'},
        {'role': 'user', 'content': 'Describe this video in details.', 
            'video':[
                'path/to/your/video.mp4'
            ],
        }
    ]

3. Generate the response

params = {
    'prompt': prompt,
    'max_input_tiles': 24,
    'temperature': 0.7,
    'top_p': 1.0,
    'max_new_tokens': 4096,
    'repetition_penalty': 1.0,
    }
worker.generate(params)

TODO

Support vLLM Inference
Provide AWQ Quantization Weights
Provide fine-tuning scripts

License/Terms of Use

The code is released under the Apache 2.0 license as found in the LICENSE file.
The pretrained model weights are released under the Creative Commons Attribution: Non-Commercial 4.0 International
The service is a research preview intended for non-commercial use only, and is subject to the following licenses and terms:
- Model License of Qwen2.5-7B-Instruct: Apache-2.0
- Model License of PaliGemma: Gemma license

Citation

Ethical Considerations

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.

Please report security vulnerabilities or NVIDIA AI Concerns here .

Runs of nvidia Eagle2-9B on huggingface.co

3.2K

Total runs

24-hour runs

110

3-day runs

777

7-day runs

3.0K

30-day runs

More Information About Eagle2-9B huggingface.co Model

More Eagle2-9B license Visit here:

https://choosealicense.com/licenses/cc-by-nc-4.0

Eagle2-9B huggingface.co

Eagle2-9B huggingface.co is an AI model on huggingface.co that provides Eagle2-9B's model effect (), which can be used instantly with this nvidia Eagle2-9B model. huggingface.co supports a free trial of the Eagle2-9B model, and also provides paid use of the Eagle2-9B. Support call Eagle2-9B model through api, including Node.js, Python, http.

Eagle2-9B huggingface.co Url

https://huggingface.co/nvidia/Eagle2-9B

nvidia Eagle2-9B online free

Eagle2-9B huggingface.co is an online trial and call api platform, which integrates Eagle2-9B's modeling effects, including api services, and provides a free online trial of Eagle2-9B, you can try Eagle2-9B online for free by clicking the link below.

nvidia Eagle2-9B online free url in huggingface.co:

https://huggingface.co/nvidia/Eagle2-9B

Eagle2-9B install

Eagle2-9B is an open source model from GitHub that offers a free installation service, and any user can find Eagle2-9B on GitHub to install. At the same time, huggingface.co provides the effect of Eagle2-9B install, users can directly use Eagle2-9B installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

Eagle2-9B install url in huggingface.co:

https://huggingface.co/nvidia/Eagle2-9B

huggingface.co

nvidia/speakerverification_en_titanet_large

Total runs: 1.3M

Run Growth: 264.8K

Growth Rate: 20.12%

Updated: November 14 2023

huggingface.co

nvidia/segformer-b1-finetuned-ade-512-512

Total runs: 1.2M

Run Growth: 159.2K

Growth Rate: 12.72%

Updated: August 06 2022

huggingface.co

nvidia/dragon-multiturn-context-encoder

Total runs: 759.5K

Run Growth: 3.2K

Growth Rate: 0.42%

Updated: May 24 2024

huggingface.co

nvidia/dragon-multiturn-query-encoder

Total runs: 758.5K

Run Growth: 45.7K

Growth Rate: 6.01%

Updated: May 24 2024

huggingface.co

nvidia/parakeet-rnnt-0.6b

Total runs: 742.7K

Run Growth: 599.5K

Growth Rate: 82.56%

Updated: January 03 2024

huggingface.co

nvidia/bigvgan_v2_22khz_80band_256x

Total runs: 575.4K

Run Growth: -1.2M

Growth Rate: -175.69%

Updated: September 05 2024

huggingface.co

nvidia/bigvgan_v2_44khz_128band_512x

Total runs: 333.2K

Run Growth: 221.1K

Growth Rate: 65.56%

Updated: September 05 2024

huggingface.co

nvidia/NV-Embed-v2

Total runs: 273.2K

Run Growth: 58.7K

Growth Rate: 23.39%

Updated: November 30 2024

huggingface.co

nvidia/MambaVision-B-1K

Total runs: 258.5K

Run Growth: 28.8K

Growth Rate: 9.56%

Updated: July 25 2024

huggingface.co

nvidia/MambaVision-S-1K

Total runs: 250.2K

Run Growth: 40.8K

Growth Rate: 13.60%

Updated: July 25 2024

huggingface.co

nvidia/Cosmos-1.0-Diffusion-7B-Text2World

Total runs: 233.3K

Run Growth: 225.7K

Growth Rate: 99.31%

Updated: January 10 2025

huggingface.co

nvidia/parakeet-tdt-1.1b

Total runs: 178.8K

Run Growth: 30.2K

Growth Rate: 17.21%

Updated: April 30 2024

huggingface.co

nvidia/Aegis-AI-Content-Safety-LlamaGuard-Defensive-1.0

Total runs: 169.4K

Run Growth: 105.7K

Growth Rate: 62.34%

Updated: January 24 2025

huggingface.co

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Total runs: 154.2K

Run Growth: -194.0K

Growth Rate: -116.99%

Updated: October 25 2024

huggingface.co

nvidia/domain-classifier

Total runs: 127.8K

Run Growth: 10.5K

Growth Rate: 18.58%

Updated: January 24 2025

huggingface.co

nvidia/parakeet-ctc-1.1b

Total runs: 119.0K

Run Growth: -14.1K

Growth Rate: -11.98%

Updated: January 13 2024

huggingface.co

nvidia/segformer-b5-finetuned-ade-640-640

Total runs: 119.0K

Run Growth: 6.6K

Growth Rate: 5.50%

Updated: August 06 2022

huggingface.co

nvidia/Llama-3_1-Nemotron-51B-Instruct

Total runs: 97.1K

Run Growth: 7.6K

Growth Rate: 7.71%

Updated: October 13 2024

huggingface.co

nvidia/Cosmos-1.0-Diffusion-14B-Text2World

Total runs: 91.9K

Run Growth: 89.5K

Growth Rate: 98.98%

Updated: January 10 2025

huggingface.co

nvidia/segformer-b0-finetuned-ade-512-512

Total runs: 63.2K

Run Growth: 5.3K

Growth Rate: 8.51%

Updated: January 14 2024

huggingface.co

nvidia/bigvgan_v2_24khz_100band_256x

Total runs: 53.8K

Run Growth: 47.3K

Growth Rate: 88.70%

Updated: September 05 2024

huggingface.co

nvidia/NVLM-D-72B

Total runs: 49.4K

Run Growth: 41.0K

Growth Rate: 85.79%

Updated: January 14 2025

huggingface.co

nvidia/mit-b0

Total runs: 43.9K

Run Growth: -119.7K

Growth Rate: -284.27%

Updated: November 15 2023

huggingface.co

nvidia/parakeet-rnnt-1.1b

Total runs: 41.5K

Run Growth: -1.7M

Growth Rate: -4176.66%

Updated: January 03 2024

huggingface.co

nvidia/stt_en_conformer_transducer_xlarge

Total runs: 39.3K

Run Growth: -23.6K

Growth Rate: -54.22%

Updated: October 29 2022

huggingface.co

nvidia/mit-b1

Total runs: 37.2K

Run Growth: 3.6K

Growth Rate: 9.10%

Updated: August 06 2022

huggingface.co

nvidia/segformer-b2-finetuned-ade-512-512

Total runs: 36.9K

Run Growth: 19.3K

Growth Rate: 54.30%

Updated: August 06 2022

huggingface.co

nvidia/mit-b5

Total runs: 20.6K

Run Growth: 1.3K

Growth Rate: 5.90%

Updated: August 06 2022

huggingface.co

nvidia/mit-b2

Total runs: 20.3K

Run Growth: 1.6K

Growth Rate: 7.71%

Updated: August 06 2022

huggingface.co

nvidia/Cosmos-1.0-Diffusion-7B-Video2World

Total runs: 19.8K

Run Growth: 6.7K

Growth Rate: 84.20%

Updated: February 08 2025

huggingface.co

nvidia/segformer-b5-finetuned-cityscapes-1024-1024

Total runs: 17.8K

Run Growth: -46.2K

Growth Rate: -288.52%

Updated: August 09 2022

huggingface.co

nvidia/canary-1b

Total runs: 17.3K

Run Growth: 7.5K

Growth Rate: 44.69%

Updated: May 08 2024

huggingface.co

nvidia/quality-classifier-deberta

Total runs: 16.5K

Run Growth: 12.6K

Growth Rate: 84.30%

Updated: January 31 2025

huggingface.co

nvidia/Mistral-NeMo-Minitron-8B-Base

Total runs: 15.4K

Run Growth: 8.3K

Growth Rate: 52.42%

Updated: August 22 2024

huggingface.co

nvidia/segformer-b1-finetuned-cityscapes-1024-1024

Total runs: 12.0K

Run Growth: 4.5K

Growth Rate: 39.17%

Updated: August 09 2022

huggingface.co

nvidia/Cosmos-1.0-Diffusion-14B-Video2World

Total runs: 11.7K

Run Growth: 4.7K

Growth Rate: 84.35%

Updated: February 08 2025

huggingface.co

nvidia/Llama3-ChatQA-1.5-8B

Total runs: 10.9K

Run Growth: 1.5K

Growth Rate: 13.28%

Updated: May 24 2024

huggingface.co

nvidia/segformer-b4-finetuned-ade-512-512

Total runs: 10.5K

Run Growth: 687

Growth Rate: 6.63%

Updated: August 06 2022

huggingface.co

nvidia/stt_en_conformer_ctc_large

Total runs: 9.8K

Run Growth: 1.7K

Growth Rate: 17.84%

Updated: October 28 2022

huggingface.co

nvidia/mit-b4

Total runs: 9.2K

Run Growth: 6.3K

Growth Rate: 69.92%

Updated: August 06 2022

huggingface.co

nvidia/parakeet-tdt_ctc-110m

Total runs: 8.7K

Run Growth: -15.6K

Growth Rate: -181.03%

Updated: October 22 2024

huggingface.co

nvidia/segformer-b3-finetuned-ade-512-512

Total runs: 8.5K

Run Growth: 2.7K

Growth Rate: 32.42%

Updated: August 06 2022

huggingface.co

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

Total runs: 5.9K

Run Growth: -6.0K

Growth Rate: -92.12%

Updated: October 15 2024

huggingface.co

nvidia/Hymba-1.5B-Base

Total runs: 5.4K

Run Growth: 7.9K

Growth Rate: 84.99%

Updated: January 02 2025

huggingface.co

nvidia/NV-Embed-v1

Total runs: 5.2K

Run Growth: -1.5K

Growth Rate: -26.67%

Updated: November 30 2024

huggingface.co

nvidia/Cosmos-1.0-Prompt-Upsampler-12B-Text2World

Total runs: 5.1K

Run Growth: 4.0K

Growth Rate: 75.24%

Updated: January 10 2025

huggingface.co

nvidia/Cosmos-1.0-Tokenizer-CV8x8x8

Total runs: 5.1K

Run Growth: 3.6K

Growth Rate: 68.06%

Updated: January 12 2025

huggingface.co

nvidia/segformer-b0-finetuned-cityscapes-1024-1024

Total runs: 5.1K

Run Growth: 3.5K

Growth Rate: 68.58%

Updated: August 08 2022

huggingface.co

nvidia/Cosmos-1.0-Guardrail

Total runs: 4.9K

Run Growth: 3.3K

Growth Rate: 65.24%

Updated: January 10 2025

huggingface.co

nvidia/parakeet-ctc-0.6b

Total runs: 4.7K

Run Growth: 3.5K

Growth Rate: 76.54%

Updated: August 22 2024

huggingface.co

nvidia/diar_sortformer_4spk-v1

Total runs: 4.0K

Run Growth: 3.5K

Growth Rate: 95.78%

Updated: February 03 2025

huggingface.co

nvidia/bigvgan_v2_22khz_80band_fmax8k_256x

Total runs: 3.8K

Run Growth: -1.2K

Growth Rate: -29.61%

Updated: September 05 2024

huggingface.co

nvidia/Mistral-NeMo-Minitron-8B-Instruct

Total runs: 3.7K

Run Growth: 697

Growth Rate: 18.85%

Updated: October 09 2024

huggingface.co

nvidia/Eagle2-1B

Total runs: 3.4K

Run Growth: 3.2K

Growth Rate: 91.94%

Updated: January 28 2025

huggingface.co

nvidia/MambaVision-T-1K

Total runs: 3.1K

Run Growth: 71

Growth Rate: 2.21%

Updated: July 25 2024

huggingface.co

nvidia/Cosmos-0.1-Tokenizer-CV4x8x8

Total runs: 2.9K

Run Growth: 2.5K

Growth Rate: 86.28%

Updated: November 11 2024

huggingface.co

nvidia/groupvit-gcc-yfcc

Total runs: 2.8K

Run Growth: 852

Growth Rate: 31.19%

Updated: September 26 2022

huggingface.co

nvidia/prompt-task-and-complexity-classifier

Total runs: 2.7K

Run Growth: 2.0K

Growth Rate: 76.87%

Updated: January 24 2025

huggingface.co

nvidia/segformer-b2-finetuned-cityscapes-1024-1024

Total runs: 2.6K

Run Growth: 866

Growth Rate: 35.09%

Updated: August 09 2022

huggingface.co

nvidia/audio-codec-44khz

Total runs: 2.5K

Run Growth: 2.4K

Growth Rate: 98.87%

Updated: December 06 2024

huggingface.co

nvidia/mit-b3

Total runs: 2.5K

Run Growth: 844

Growth Rate: 35.96%

Updated: August 06 2022

huggingface.co

nvidia/OpenMath2-Llama3.1-8B

Total runs: 2.5K

Run Growth: 1.3K

Growth Rate: 51.99%

Updated: November 25 2024

huggingface.co

nvidia/stt_en_conformer_ctc_small

Total runs: 2.3K

Run Growth: -4.6K

Growth Rate: -213.78%

Updated: June 12 2023

huggingface.co

nvidia/Cosmos-1.0-Diffusion-7B-Decoder-DV8x16x16ToCV8x8x8

Total runs: 2.3K

Run Growth: 1.9K

Growth Rate: 79.12%

Updated: January 10 2025

huggingface.co

nvidia/stt_fr_fastconformer_hybrid_large_pc

Total runs: 2.1K

Run Growth: 1.7K

Growth Rate: 76.11%

Updated: September 12 2023

huggingface.co

nvidia/stt_en_fastconformer_transducer_large

Total runs: 2.1K

Run Growth: 546

Growth Rate: 25.29%

Updated: June 08 2023

huggingface.co

nvidia/Hymba-1.5B-Instruct

Total runs: 2.1K

Run Growth: -2.8K

Growth Rate: -133.41%

Updated: January 02 2025

huggingface.co

nvidia/Llama3-ChatQA-2-8B

Total runs: 2.0K

Run Growth: -551

Growth Rate: -27.58%

Updated: September 10 2024

huggingface.co

nvidia/parakeet-tdt_ctc-1.1b

Total runs: 1.9K

Run Growth: 122

Growth Rate: 6.18%

Updated: August 26 2024

huggingface.co

nvidia/C-RADIO

Total runs: 1.9K

Run Growth: 520

Growth Rate: 27.72%

Updated: December 18 2024

huggingface.co

nvidia/bigvgan_v2_44khz_128band_256x

Total runs: 1.9K

Run Growth: 1.4K

Growth Rate: 71.65%

Updated: September 05 2024

huggingface.co

nvidia/AceMath-1.5B-Instruct

Total runs: 1.7K

Run Growth: 1.5K

Growth Rate: 100.00%

Updated: January 17 2025

huggingface.co

nvidia/AceMath-7B-Instruct

Total runs: 1.7K

Run Growth: 1.7K

Growth Rate: 100.00%

Updated: January 17 2025

huggingface.co

nvidia/segformer-b0-finetuned-cityscapes-768-768

Total runs: 1.7K

Run Growth: 1.2K

Growth Rate: 73.76%

Updated: August 09 2022

huggingface.co

nvidia/Cosmos-0.1-Tokenizer-CI16x16

Total runs: 1.7K

Run Growth: -202

Growth Rate: -12.18%

Updated: December 25 2024

huggingface.co

nvidia/Cosmos-0.1-Tokenizer-CI8x8

Total runs: 1.6K

Run Growth: -92

Growth Rate: -5.63%

Updated: November 11 2024

huggingface.co

nvidia/segformer-b3-finetuned-cityscapes-1024-1024

Total runs: 1.5K

Run Growth: 132

Growth Rate: 9.31%

Updated: August 09 2022

huggingface.co

nvidia/stt_en_citrinet_256_ls

Total runs: 1.4K

Run Growth: 921

Growth Rate: 67.87%

Updated: July 15 2022

huggingface.co

nvidia/Cosmos-1.0-Autoregressive-4B

Total runs: 1.4K

Run Growth: 785

Growth Rate: 53.58%

Updated: February 11 2025

huggingface.co

nvidia/Cosmos-1.0-Autoregressive-5B-Video2World

Total runs: 1.3K

Run Growth: 776

Growth Rate: 60.82%

Updated: February 08 2025

huggingface.co

nvidia/Cosmos-1.0-Tokenizer-DV8x16x16

Total runs: 1.2K

Run Growth: 578

Growth Rate: 47.73%

Updated: January 12 2025

huggingface.co

nvidia/Cosmos-0.1-Tokenizer-CV8x16x16

Total runs: 1.2K

Run Growth: 930

Growth Rate: 76.99%

Updated: November 11 2024

huggingface.co

nvidia/MM-Embed

Total runs: 1.2K

Run Growth: 57

Growth Rate: 5.14%

Updated: November 06 2024

huggingface.co

nvidia/low-frame-rate-speech-codec-22khz

Total runs: 1.2K

Run Growth: 249

Growth Rate: 22.78%

Updated: December 12 2024

huggingface.co

nvidia/stt_ru_conformer_transducer_large

Total runs: 1.1K

Run Growth: -3.6K

Growth Rate: -313.71%

Updated: November 01 2022

huggingface.co

nvidia/bigvgan_22khz_80band

Total runs: 1.1K

Run Growth: 236

Growth Rate: 22.08%

Updated: July 22 2024

huggingface.co

nvidia/stt_ru_fastconformer_hybrid_large_pc

Total runs: 1.0K

Run Growth: 609

Growth Rate: 60.48%

Updated: May 26 2023

huggingface.co

nvidia/segformer-b4-finetuned-cityscapes-1024-1024

Total runs: 991

Run Growth: 366

Growth Rate: 40.40%

Updated: April 24 2023

huggingface.co

nvidia/RADIO-H

Total runs: 981

Run Growth: -979

Growth Rate: -85.80%

Updated: December 02 2024

huggingface.co

nvidia/segformer-b0-finetuned-cityscapes-512-1024

Total runs: 946

Run Growth: 628

Growth Rate: 73.28%

Updated: August 09 2022

huggingface.co

nvidia/Llama-3.1-8B-Instruct-FP8

Total runs: 946

Run Growth: 285

Growth Rate: 30.61%

Updated: January 10 2025

huggingface.co

nvidia/stt_fr_conformer_ctc_large

Total runs: 912

Run Growth: 696

Growth Rate: 74.60%

Updated: October 29 2022

huggingface.co

nvidia/stt_en_fastconformer_ctc_large

Total runs: 872

Run Growth: -2.5K

Growth Rate: -290.80%

Updated: January 02 2024

huggingface.co

nvidia/RADIO

Total runs: 846

Run Growth: 99

Growth Rate: 11.99%

Updated: December 10 2024

huggingface.co

nvidia/Llama-3.1-70B-Instruct-FP8

Total runs: 816

Run Growth: 509

Growth Rate: 70.69%

Updated: January 10 2025

huggingface.co

nvidia/Eagle2-2B

Total runs: 807

Run Growth: 776

Growth Rate: 96.64%

Updated: January 28 2025

huggingface.co

nvidia/Nemotron-4-Minitron-8B-Base

Total runs: 778

Run Growth: 0

Growth Rate: 0.00%

Updated: August 15 2024

huggingface.co

nvidia/Cosmos-1.0-Autoregressive-13B-Video2World

Total runs: 741

Run Growth: 433

Growth Rate: 56.60%

Updated: February 08 2025

huggingface.co

nvidia/Cosmos-1.0-Autoregressive-12B

Total runs: 732

Run Growth: 434

Growth Rate: 60.87%

Updated: February 11 2025

huggingface.co

nvidia/Llama-3.1-405B-Instruct-FP8

Total runs: 719

Run Growth: 429

Growth Rate: 61.11%

Updated: January 10 2025

nvidia / Eagle2-9B

Introduction of Eagle2-9B

Model Details of Eagle2-9B

Eagle-2

Introduction

Model Zoo

Benchmark Results

Quick Start

0. Install the dependencies

1. Prepare the Model worker

2. Prepare the Prompt

3. Generate the response

TODO

License/Terms of Use

Citation

Ethical Considerations

Runs of nvidia Eagle2-9B on huggingface.co

More Information About Eagle2-9B huggingface.co Model

More Eagle2-9B license Visit here:

Eagle2-9B huggingface.co

Eagle2-9B huggingface.co Url

nvidia Eagle2-9B online free

nvidia Eagle2-9B online free url in huggingface.co:

Eagle2-9B install

Eagle2-9B install url in huggingface.co:

Url of Eagle2-9B

Eagle2-9B huggingface.co Url

Provider of Eagle2-9B huggingface.co

Other API from nvidia