google / siglip-so400m-patch14-224

huggingface.co
Total runs: 224.3K
24-hour runs: -6.4K
7-day runs: 35.9K
30-day runs: 189.7K
Model's Last Updated: August 23 2024
zero-shot-image-classification

Introduction of siglip-so400m-patch14-224

Model Details of siglip-so400m-patch14-224

SigLIP (shape-optimized model)

SigLIP model pre-trained on WebLi at resolution 224x224. It was introduced in the paper Sigmoid Loss for Language Image Pre-Training by Zhai et al. and first released in this repository .

This model has the SoViT-400m architecture, which is the shape-optimized version as presented in Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design by Alabdulmohsin et al.

Disclaimer: The team releasing SigLIP did not write a model card for this model so this model card has been written by the Hugging Face team.

Model description

SigLIP is CLIP , a multimodal model, with a better loss function. The sigmoid loss operates solely on image-text pairs and does not require a global view of the pairwise similarities for normalization. This allows further scaling up the batch size, while also performing better at smaller batch sizes.

A TLDR of SigLIP by one of the authors can be found here .

Intended uses & limitations

You can use the raw model for tasks like zero-shot image classification and image-text retrieval. See the model hub to look for other versions on a task that interests you.

How to use

Here is how to use this model to perform zero-shot image classification:

from PIL import Image
import requests
from transformers import AutoProcessor, AutoModel
import torch

model = AutoModel.from_pretrained("google/siglip-so400m-patch14-224")
processor = AutoProcessor.from_pretrained("google/siglip-so400m-patch14-224")

url = "http://images.cocodataset.org/val2017/000000039769.jpg"
image = Image.open(requests.get(url, stream=True).raw)

texts = ["a photo of 2 cats", "a photo of 2 dogs"]
inputs = processor(text=texts, images=image, padding="max_length", return_tensors="pt")

with torch.no_grad():
    outputs = model(**inputs)

logits_per_image = outputs.logits_per_image
probs = torch.sigmoid(logits_per_image) # these are the probabilities
print(f"{probs[0][0]:.1%} that image 0 is '{texts[0]}'")

Alternatively, one can leverage the pipeline API which abstracts away the complexity for the user:

from transformers import pipeline
from PIL import Image
import requests

# load pipe
image_classifier = pipeline(task="zero-shot-image-classification", model="google/siglip-so400m-patch14-224")

# load image
url = 'http://images.cocodataset.org/val2017/000000039769.jpg'
image = Image.open(requests.get(url, stream=True).raw)

# inference
outputs = image_classifier(image, candidate_labels=["2 cats", "a plane", "a remote"])
outputs = [{"score": round(output["score"], 4), "label": output["label"] } for output in outputs]
print(outputs)

For more code examples, we refer to the documentation .

Training procedure
Training data

SigLIP is pre-trained on the WebLI dataset (Chen et al., 2023) .

Preprocessing

Images are resized/rescaled to the same resolution (384x384) and normalized across the RGB channels with mean (0.5, 0.5, 0.5) and standard deviation (0.5, 0.5, 0.5).

Texts are tokenized and padded to the same length (64 tokens).

Compute

The model was trained on 16 TPU-v4 chips for three days.

Evaluation results

Evaluation of SigLIP compared to CLIP is shown below (taken from the paper).

drawing

BibTeX entry and citation info
@misc{zhai2023sigmoid,
      title={Sigmoid Loss for Language Image Pre-Training}, 
      author={Xiaohua Zhai and Basil Mustafa and Alexander Kolesnikov and Lucas Beyer},
      year={2023},
      eprint={2303.15343},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Runs of google siglip-so400m-patch14-224 on huggingface.co

224.3K
Total runs
-6.4K
24-hour runs
3.0K
3-day runs
35.9K
7-day runs
189.7K
30-day runs

More Information About siglip-so400m-patch14-224 huggingface.co Model

More siglip-so400m-patch14-224 license Visit here:

https://choosealicense.com/licenses/apache-2.0

siglip-so400m-patch14-224 huggingface.co

siglip-so400m-patch14-224 huggingface.co is an AI model on huggingface.co that provides siglip-so400m-patch14-224's model effect (), which can be used instantly with this google siglip-so400m-patch14-224 model. huggingface.co supports a free trial of the siglip-so400m-patch14-224 model, and also provides paid use of the siglip-so400m-patch14-224. Support call siglip-so400m-patch14-224 model through api, including Node.js, Python, http.

siglip-so400m-patch14-224 huggingface.co Url

https://huggingface.co/google/siglip-so400m-patch14-224

google siglip-so400m-patch14-224 online free

siglip-so400m-patch14-224 huggingface.co is an online trial and call api platform, which integrates siglip-so400m-patch14-224's modeling effects, including api services, and provides a free online trial of siglip-so400m-patch14-224, you can try siglip-so400m-patch14-224 online for free by clicking the link below.

google siglip-so400m-patch14-224 online free url in huggingface.co:

https://huggingface.co/google/siglip-so400m-patch14-224

siglip-so400m-patch14-224 install

siglip-so400m-patch14-224 is an open source model from GitHub that offers a free installation service, and any user can find siglip-so400m-patch14-224 on GitHub to install. At the same time, huggingface.co provides the effect of siglip-so400m-patch14-224 install, users can directly use siglip-so400m-patch14-224 installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

siglip-so400m-patch14-224 install url in huggingface.co:

https://huggingface.co/google/siglip-so400m-patch14-224

Url of siglip-so400m-patch14-224

siglip-so400m-patch14-224 huggingface.co Url

Provider of siglip-so400m-patch14-224 huggingface.co

google
ORGANIZATIONS

Other API from google

huggingface.co

Total runs: 2.2M
Run Growth: -11.4M
Growth Rate: -527.13%
Updated: August 08 2024
huggingface.co

Total runs: 2.1M
Run Growth: -159.2K
Growth Rate: -7.43%
Updated: January 25 2023
huggingface.co

Total runs: 1.7M
Run Growth: -1.7M
Growth Rate: -97.41%
Updated: February 29 2024
huggingface.co

Total runs: 1.5M
Run Growth: -78.3K
Growth Rate: -5.39%
Updated: April 29 2024
huggingface.co

Total runs: 1.4M
Run Growth: 539.6K
Growth Rate: 38.86%
Updated: January 25 2023
huggingface.co

Total runs: 1.3M
Run Growth: 241.2K
Growth Rate: 18.23%
Updated: July 17 2023
huggingface.co

Total runs: 750.1K
Run Growth: 112.0K
Growth Rate: 14.93%
Updated: August 28 2024
huggingface.co

Total runs: 641.9K
Run Growth: 117.7K
Growth Rate: 18.33%
Updated: July 17 2023
huggingface.co

Total runs: 635.2K
Run Growth: 82.3K
Growth Rate: 12.96%
Updated: July 27 2023
huggingface.co

Total runs: 623.4K
Run Growth: 411.5K
Growth Rate: 66.01%
Updated: August 14 2024
huggingface.co

Total runs: 600.3K
Run Growth: 585.1K
Growth Rate: 97.46%
Updated: August 08 2024
huggingface.co

Total runs: 507.3K
Run Growth: 192.7K
Growth Rate: 37.99%
Updated: October 11 2023
huggingface.co

Total runs: 396.4K
Run Growth: -124.5K
Growth Rate: -31.40%
Updated: September 27 2024
huggingface.co

Total runs: 317.0K
Run Growth: -125.8K
Growth Rate: -39.69%
Updated: August 28 2024
huggingface.co

Total runs: 288.0K
Run Growth: 286.9K
Growth Rate: 99.63%
Updated: August 03 2023
huggingface.co

Total runs: 233.6K
Run Growth: 110.5K
Growth Rate: 47.33%
Updated: January 25 2023
huggingface.co

Total runs: 215.0K
Run Growth: -25.7K
Growth Rate: -11.95%
Updated: August 28 2024
huggingface.co

Total runs: 213.3K
Run Growth: -395.1K
Growth Rate: -185.19%
Updated: January 25 2023
huggingface.co

Total runs: 208.4K
Run Growth: 66.4K
Growth Rate: 31.85%
Updated: November 07 2023
huggingface.co

Total runs: 192.1K
Run Growth: 54.9K
Growth Rate: 28.57%
Updated: November 28 2023
huggingface.co

Total runs: 153.1K
Run Growth: -4.5K
Growth Rate: -2.94%
Updated: January 25 2023
huggingface.co

Total runs: 146.1K
Run Growth: -46.2K
Growth Rate: -31.63%
Updated: September 07 2023
huggingface.co

Total runs: 129.8K
Run Growth: -206.2K
Growth Rate: -158.89%
Updated: June 27 2024
huggingface.co

Total runs: 121.3K
Run Growth: -5.4K
Growth Rate: -4.44%
Updated: September 18 2023
huggingface.co

Total runs: 111.2K
Run Growth: 26.3K
Growth Rate: 23.62%
Updated: January 25 2023
huggingface.co

Total runs: 101.7K
Run Growth: -215.2K
Growth Rate: -211.67%
Updated: January 25 2023
huggingface.co

Total runs: 98.8K
Run Growth: 22.3K
Growth Rate: 22.57%
Updated: January 25 2023
huggingface.co

Total runs: 93.5K
Run Growth: 1.2K
Growth Rate: 1.26%
Updated: September 27 2024
huggingface.co

Total runs: 76.3K
Run Growth: -89.4K
Growth Rate: -117.09%
Updated: August 08 2024
huggingface.co

Total runs: 64.1K
Run Growth: 6.0K
Growth Rate: 9.28%
Updated: January 25 2023
huggingface.co

Total runs: 62.5K
Run Growth: 11.9K
Growth Rate: 19.10%
Updated: January 25 2023
huggingface.co

Total runs: 44.3K
Run Growth: 26.3K
Growth Rate: 59.25%
Updated: January 25 2023
huggingface.co

Total runs: 43.2K
Run Growth: 41.4K
Growth Rate: 95.95%
Updated: November 29 2021
huggingface.co

Total runs: 26.1K
Run Growth: -12.6K
Growth Rate: -48.26%
Updated: November 27 2023
huggingface.co

Total runs: 23.2K
Run Growth: 620
Growth Rate: 2.68%
Updated: January 25 2023
huggingface.co

Total runs: 11.6K
Run Growth: 1.8K
Growth Rate: 17.34%
Updated: July 06 2023
huggingface.co

Total runs: 9.9K
Run Growth: 3.1K
Growth Rate: 30.72%
Updated: April 29 2024
huggingface.co

Total runs: 8.9K
Run Growth: -31.2K
Growth Rate: -351.34%
Updated: September 07 2023
huggingface.co

Total runs: 7.8K
Run Growth: -4.0K
Growth Rate: -51.52%
Updated: January 25 2023