SegGPT employs a decoder-only (GPT-like) Transformer that can generate a segmentation mask given an input image, a prompt image and its corresponding prompt mask.
The model achieves remarkable one-shot results with 56.1 mIoU on COCO-20 and 85.6 mIoU on FSS-1000.
Intended uses & limitations
You can use the raw model for one-shot image segmentation.
How to use
Here's how to use the model for one-shot semantic segmentation:
import torch
from datasets import load_dataset
from transformers import SegGptImageProcessor, SegGptForImageSegmentation
model_id = "BAAI/seggpt-vit-large"
image_processor = SegGptImageProcessor.from_pretrained(checkpoint)
model = SegGptForImageSegmentation.from_pretrained(checkpoint)
dataset_id = "EduardoPacheco/FoodSeg103"
ds = load_dataset(dataset_id, split="train")
# Number of labels in FoodSeg103 (not including background)
num_labels = 103
image_input = ds[4]["image"]
ground_truth = ds[4]["label"]
image_prompt = ds[29]["image"]
mask_prompt = ds[29]["label"]
inputs = image_processor(
images=image_input,
prompt_images=image_prompt,
prompt_masks=mask_prompt,
num_labels=num_labels,
return_tensors="pt"
)
with torch.no_grad():
outputs = model(**inputs)
target_sizes = [image_input.size[::-1]]
mask = image_processor.post_process_semantic_segmentation(outputs, target_sizes, num_labels=num_labels)[0]
BibTeX entry and citation info
@misc{wang2023seggpt,
title={SegGPT: Segmenting Everything In Context},
author={Xinlong Wang and Xiaosong Zhang and Yue Cao and Wen Wang and Chunhua Shen and Tiejun Huang},
year={2023},
eprint={2304.03284},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
Runs of BAAI seggpt-vit-large on huggingface.co
4.5K
Total runs
417
24-hour runs
744
3-day runs
927
7-day runs
-9.5K
30-day runs
More Information About seggpt-vit-large huggingface.co Model
seggpt-vit-large huggingface.co is an AI model on huggingface.co that provides seggpt-vit-large's model effect (), which can be used instantly with this BAAI seggpt-vit-large model. huggingface.co supports a free trial of the seggpt-vit-large model, and also provides paid use of the seggpt-vit-large. Support call seggpt-vit-large model through api, including Node.js, Python, http.
seggpt-vit-large huggingface.co is an online trial and call api platform, which integrates seggpt-vit-large's modeling effects, including api services, and provides a free online trial of seggpt-vit-large, you can try seggpt-vit-large online for free by clicking the link below.
BAAI seggpt-vit-large online free url in huggingface.co:
seggpt-vit-large is an open source model from GitHub that offers a free installation service, and any user can find seggpt-vit-large on GitHub to install. At the same time, huggingface.co provides the effect of seggpt-vit-large install, users can directly use seggpt-vit-large installed effect in huggingface.co for debugging and trial. It also supports api for free installation.