larger_clap_music_and_speech huggingface.co api & laion larger_clap_music_and_speech github AI Model

Introduction of larger_clap_music_and_speech

Model Details of larger_clap_music_and_speech

Model

TL;DR

CLAP is to audio what CLIP is to image. This is an improved CLAP checkpoint, specifically trained on music and speech.

Description

CLAP (Contrastive Language-Audio Pretraining) is a neural network trained on a variety of (audio, text) pairs. It can be instructed in to predict the most relevant text snippet, given an audio, without directly optimizing for the task. The CLAP model uses a SWINTransformer to get audio features from a log-Mel spectrogram input, and a RoBERTa model to get text features. Both the text and audio features are then projected to a latent space with identical dimension. The dot product between the projected audio and text features is then used as a similar score.

Usage

You can use this model for zero shot audio classification or extracting audio and/or textual features.

Uses

Perform zero-shot audio classification

Using `pipeline`

from datasets import load_dataset
from transformers import pipeline

dataset = load_dataset("ashraq/esc50")
audio = dataset["train"]["audio"][-1]["array"]

audio_classifier = pipeline(task="zero-shot-audio-classification", model="laion/larger_clap_music_and_speech")
output = audio_classifier(audio, candidate_labels=["Sound of a dog", "Sound of vaccum cleaner"])
print(output)
>>> [{"score": 0.999, "label": "Sound of a dog"}, {"score": 0.001, "label": "Sound of vaccum cleaner"}]

Run the model:

You can also get the audio and text embeddings using ClapModel

Run the model on CPU:

from datasets import load_dataset
from transformers import ClapModel, ClapProcessor

librispeech_dummy = load_dataset("hf-internal-testing/librispeech_asr_dummy", "clean", split="validation")
audio_sample = librispeech_dummy[0]

model = ClapModel.from_pretrained("laion/larger_clap_music_and_speech")
processor = ClapProcessor.from_pretrained("laion/larger_clap_music_and_speech")

inputs = processor(audios=audio_sample["audio"]["array"], return_tensors="pt")
audio_embed = model.get_audio_features(**inputs)

Run the model on GPU:

from datasets import load_dataset
from transformers import ClapModel, ClapProcessor

librispeech_dummy = load_dataset("hf-internal-testing/librispeech_asr_dummy", "clean", split="validation")
audio_sample = librispeech_dummy[0]

model = ClapModel.from_pretrained("laion/larger_clap_music_and_speech").to(0)
processor = ClapProcessor.from_pretrained("laion/larger_clap_music_and_speech")

inputs = processor(audios=audio_sample["audio"]["array"], return_tensors="pt").to(0)
audio_embed = model.get_audio_features(**inputs)

Citation

If you are using this model for your work, please consider citing the original paper:

@misc{https://doi.org/10.48550/arxiv.2211.06687,
  doi = {10.48550/ARXIV.2211.06687},
  url = {https://arxiv.org/abs/2211.06687},
  author = {Wu, Yusong and Chen, Ke and Zhang, Tianyu and Hui, Yuchen and Berg-Kirkpatrick, Taylor and Dubnov, Shlomo},
  keywords = {Sound (cs.SD), Audio and Speech Processing (eess.AS), FOS: Computer and information sciences, FOS: Computer and information sciences, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering},
  title = {Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation},
  publisher = {arXiv},
  year = {2022},
  copyright = {Creative Commons Attribution 4.0 International}
}

Runs of laion larger_clap_music_and_speech on huggingface.co

14.7K

Total runs

24-hour runs

-33

3-day runs

1.9K

7-day runs

6.7K

30-day runs

More Information About larger_clap_music_and_speech huggingface.co Model

More larger_clap_music_and_speech license Visit here:

https://choosealicense.com/licenses/apache-2.0

larger_clap_music_and_speech huggingface.co

larger_clap_music_and_speech huggingface.co is an AI model on huggingface.co that provides larger_clap_music_and_speech's model effect (), which can be used instantly with this laion larger_clap_music_and_speech model. huggingface.co supports a free trial of the larger_clap_music_and_speech model, and also provides paid use of the larger_clap_music_and_speech. Support call larger_clap_music_and_speech model through api, including Node.js, Python, http.

larger_clap_music_and_speech huggingface.co Url

https://huggingface.co/laion/larger_clap_music_and_speech

laion larger_clap_music_and_speech online free

larger_clap_music_and_speech huggingface.co is an online trial and call api platform, which integrates larger_clap_music_and_speech's modeling effects, including api services, and provides a free online trial of larger_clap_music_and_speech, you can try larger_clap_music_and_speech online for free by clicking the link below.

laion larger_clap_music_and_speech online free url in huggingface.co:

https://huggingface.co/laion/larger_clap_music_and_speech

larger_clap_music_and_speech install

larger_clap_music_and_speech is an open source model from GitHub that offers a free installation service, and any user can find larger_clap_music_and_speech on GitHub to install. At the same time, huggingface.co provides the effect of larger_clap_music_and_speech install, users can directly use larger_clap_music_and_speech installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

larger_clap_music_and_speech install url in huggingface.co:

https://huggingface.co/laion/larger_clap_music_and_speech

huggingface.co

laion/CLIP-ViT-B-32-laion2B-s34B-b79K

Total runs: 2.1M

Run Growth: 298.1K

Growth Rate: 14.51%

Updated: 2025年1月22日

huggingface.co

laion/CLIP-ViT-bigG-14-laion2B-39B-b160k

Total runs: 1.3M

Run Growth: -524.6K

Growth Rate: -40.85%

Updated: 2025年1月22日

huggingface.co

laion/CLIP-ViT-B-16-laion2B-s34B-b88K

Total runs: 1.1M

Run Growth: -661.9K

Growth Rate: -58.23%

Updated: 2023年4月19日

huggingface.co

laion/CLIP-ViT-H-14-laion2B-s32B-b79K

Total runs: 947.8K

Run Growth: -356.8K

Growth Rate: -37.64%

Updated: 2025年1月22日

huggingface.co

laion/CLIP-ViT-B-32-roberta-base-laion2B-s12B-b32k

Total runs: 839.8K

Run Growth: 743.2K

Growth Rate: 88.50%

Updated: 2022年11月13日

huggingface.co

laion/CLIP-convnext_base_w-laion2B-s13B-b82K-augreg

Total runs: 723.7K

Run Growth: 129.0K

Growth Rate: 17.82%

Updated: 2023年4月18日

huggingface.co

laion/CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft-soup

Total runs: 271.2K

Run Growth: -238.3K

Growth Rate: -90.91%

Updated: 2025年1月22日

huggingface.co

laion/CLIP-ViT-L-14-DataComp.XL-s13B-b90K

Total runs: 158.9K

Run Growth: -1.6K

Growth Rate: -0.98%

Updated: 2023年5月16日

huggingface.co

laion/larger_clap_general

Total runs: 137.3K

Run Growth: -68.7K

Growth Rate: -50.01%

Updated: 2023年10月31日

huggingface.co

laion/CLIP-ViT-L-14-laion2B-s32B-b82K

Total runs: 121.2K

Run Growth: 36.9K

Growth Rate: 30.42%

Updated: 2024年1月16日

huggingface.co

laion/CLIP-ViT-B-32-DataComp.XL-s13B-b90K

Total runs: 100.0K

Run Growth: 37.2K

Growth Rate: 37.18%

Updated: 2023年9月29日

huggingface.co

laion/CLIP-convnext_large_d.laion2B-s26B-b102K-augreg

Total runs: 98.0K

Run Growth: -259.2K

Growth Rate: -264.56%

Updated: 2023年4月18日

huggingface.co

laion/CLIP-ViT-B-16-DataComp.XL-s13B-b90K

Total runs: 89.0K

Run Growth: -19.7K

Growth Rate: -22.20%

Updated: 2023年9月29日

huggingface.co

laion/clap-htsat-unfused

Total runs: 83.8K

Run Growth: -65.8K

Growth Rate: -78.50%

Updated: 2023年4月24日

huggingface.co

laion/CLIP-convnext_base_w-laion2B-s13B-b82K

Total runs: 52.6K

Run Growth: 30.9K

Growth Rate: 58.70%

Updated: 2023年4月18日

huggingface.co

laion/CLIP-ViT-g-14-laion2B-s12B-b42K

Total runs: 50.5K

Run Growth: 8.4K

Growth Rate: 16.59%

Updated: 2024年2月23日

huggingface.co

laion/CLIP-ViT-B-32-xlm-roberta-base-laion5B-s13B-b90k

Total runs: 48.4K

Run Growth: -6.8K

Growth Rate: -14.05%

Updated: 2022年11月14日

huggingface.co

laion/CLIP-convnext_base-laion400M-s13B-b51K

Total runs: 47.9K

Run Growth: 28.2K

Growth Rate: 58.92%

Updated: 2023年1月14日

huggingface.co

laion/mscoco_finetuned_CoCa-ViT-L-14-laion2B-s13B-b90k

Total runs: 45.1K

Run Growth: 11.9K

Growth Rate: 26.41%

Updated: 2024年1月16日

huggingface.co

laion/CLIP-ViT-g-14-laion2B-s34B-b88K

Total runs: 31.1K

Run Growth: 1.3K

Growth Rate: 4.05%

Updated: 2025年1月22日

huggingface.co

laion/CLIP-convnext_xxlarge-laion2B-s34B-b82K-augreg

Total runs: 29.1K

Run Growth: -26.2K

Growth Rate: -89.95%

Updated: 2023年4月18日

huggingface.co

laion/CLIP-convnext_base_w-laion_aesthetic-s13B-b82K

Total runs: 28.1K

Run Growth: 11.9K

Growth Rate: 42.16%

Updated: 2023年4月18日

huggingface.co

laion/clap-htsat-fused

Total runs: 17.0K

Run Growth: 3.2K

Growth Rate: 19.03%

Updated: 2023年4月24日

huggingface.co

laion/CLIP-convnext_xxlarge-laion2B-s34B-b82K-augreg-soup

Total runs: 15.9K

Run Growth: -36.9K

Growth Rate: -233.55%

Updated: 2023年4月18日

huggingface.co

laion/CLIP-ViT-L-14-CommonPool.XL-s13B-b90K

Total runs: 15.2K

Run Growth: -469

Growth Rate: -3.08%

Updated: 2024年11月12日

huggingface.co

laion/CLIP-ViT-B-16-DataComp.L-s1B-b8K

Total runs: 10.9K

Run Growth: 151

Growth Rate: 1.38%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-convnext_base_w_320-laion_aesthetic-s13B-b82K

Total runs: 10.0K

Run Growth: -681

Growth Rate: -6.84%

Updated: 2023年4月18日

huggingface.co

laion/larger_clap_music

Total runs: 8.0K

Run Growth: 2.1K

Growth Rate: 26.18%

Updated: 2023年10月30日

huggingface.co

laion/CLIP-ViT-H-14-frozen-xlm-roberta-large-laion5B-s13B-b90k

Total runs: 4.3K

Run Growth: -8.8K

Growth Rate: -212.87%

Updated: 2022年11月18日

huggingface.co

laion/CLIP-ViT-B-32-256x256-DataComp-s34B-b86K

Total runs: 3.5K

Run Growth: -577

Growth Rate: -16.38%

Updated: 2023年11月13日

huggingface.co

laion/CLIP-ViT-L-14-CommonPool.XL.clip-s13B-b90K

Total runs: 3.2K

Run Growth: -7.3K

Growth Rate: -228.06%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-L-14-CommonPool.XL.laion-s13B-b90K

Total runs: 3.1K

Run Growth: -6.7K

Growth Rate: -214.07%

Updated: 2023年4月26日

huggingface.co

laion/mscoco_finetuned_CoCa-ViT-B-32-laion2B-s13B-b90k

Total runs: 2.4K

Run Growth: 1.0K

Growth Rate: 42.49%

Updated: 2023年2月3日

huggingface.co

laion/CLIP-convnext_base_w_320-laion_aesthetic-s13B-b82K-augreg

Total runs: 2.3K

Run Growth: -614

Growth Rate: -27.08%

Updated: 2023年4月18日

huggingface.co

laion/CoCa-ViT-L-14-laion2B-s13B-b90k

Total runs: 1.7K

Run Growth: -1.9K

Growth Rate: -112.98%

Updated: 2023年2月1日

huggingface.co

laion/CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft

Total runs: 1.6K

Run Growth: -1.0K

Growth Rate: -62.70%

Updated: 2023年4月18日

huggingface.co

laion/CoCa-ViT-B-32-laion2B-s13B-b90k

Total runs: 1.0K

Run Growth: -3.2K

Growth Rate: -311.24%

Updated: 2023年1月29日

huggingface.co

laion/CLIP-ViT-B-16-CommonPool.L-s1B-b8K

Total runs: 429

Run Growth: -2.8K

Growth Rate: -652.21%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-convnext_xxlarge-laion2B-s34B-b82K-augreg-rewind

Total runs: 201

Run Growth: -3.1K

Growth Rate: -1654.26%

Updated: 2023年4月18日

huggingface.co

laion/CLIP-ViT-B-32-DataComp.M-s128M-b4K

Total runs: 162

Run Growth: -720

Growth Rate: -444.44%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.M-s128M-b4K

Total runs: 156

Run Growth: 107

Growth Rate: 68.59%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-16-CommonPool.L.clip-s1B-b8K

Total runs: 156

Run Growth: 98

Growth Rate: 62.82%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-16-CommonPool.L.basic-s1B-b8K

Total runs: 147

Run Growth: 66

Growth Rate: 44.90%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.M.clip-s128M-b4K

Total runs: 145

Run Growth: 89

Growth Rate: 61.38%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.S-s13M-b4K

Total runs: 131

Run Growth: 86

Growth Rate: 65.65%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-16-CommonPool.L.image-s1B-b8K

Total runs: 129

Run Growth: 78

Growth Rate: 60.47%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.S.laion-s13M-b4K

Total runs: 121

Run Growth: 82

Growth Rate: 67.77%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-DataComp.S-s13M-b4K

Total runs: 113

Run Growth: -107

Growth Rate: -94.69%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-16-CommonPool.L.laion-s1B-b8K

Total runs: 110

Run Growth: 59

Growth Rate: 53.64%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-16-CommonPool.L.text-s1B-b8K

Total runs: 108

Run Growth: 27

Growth Rate: 25.00%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.M.laion-s128M-b4K

Total runs: 101

Run Growth: 57

Growth Rate: 56.44%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.M.image-s128M-b4K

Total runs: 98

Run Growth: 54

Growth Rate: 55.10%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.M.basic-s128M-b4K

Total runs: 94

Run Growth: 53

Growth Rate: 56.38%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.S.clip-s13M-b4K

Total runs: 92

Run Growth: 50

Growth Rate: 54.35%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.M.text-s128M-b4K

Total runs: 92

Run Growth: 53

Growth Rate: 57.61%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.S.image-s13M-b4K

Total runs: 82

Run Growth: 26

Growth Rate: 31.71%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.S.basic-s13M-b4K

Total runs: 82

Run Growth: 45

Growth Rate: 54.88%

Updated: 2023年4月26日

huggingface.co

laion/CLIP-ViT-B-32-CommonPool.S.text-s13M-b4K

Total runs: 74

Run Growth: 39

Growth Rate: 52.70%

Updated: 2023年4月26日

huggingface.co

laion/LLaVA-Video-7B-Qwen2_openvino_int8

Total runs: 24

Run Growth: 20

Growth Rate: 83.33%

Updated: 2025年1月20日

huggingface.co

laion/anh-xglm-7.5b-cross-lingual

Total runs: 20

Run Growth: 4

Growth Rate: 20.00%

Updated: 2023年4月1日

huggingface.co

laion/bge-small-en-v1.5_openvino_int8

Total runs: 14

Run Growth: 9

Growth Rate: 64.29%

Updated: 2025年1月20日

huggingface.co

laion/anh-bloomz-7b1-mt-cross-lingual

Total runs: 11

Run Growth: -5

Growth Rate: -45.45%

Updated: 2023年4月6日

huggingface.co

laion/llava-v1.6-mistral-7b_openvino_int8

Total runs: 8

Run Growth: 2

Growth Rate: 25.00%

Updated: 2025年1月20日

huggingface.co

laion/Mantis-8B-siglip-llama3_openvino_int8

Total runs: 6

Run Growth: 1

Growth Rate: 16.67%

Updated: 2025年1月20日

huggingface.co

laion/distil-whisper-large-v3_openvino_int8

Total runs: 5

Run Growth: -4

Growth Rate: -80.00%

Updated: 2025年1月20日

huggingface.co

laion/t5-base_openvino_int8

Total runs: 4

Run Growth: -1

Growth Rate: -25.00%

Updated: 2025年1月20日

huggingface.co

laion/ongo

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年6月7日

huggingface.co

laion/xclip-base-patch16-zero-shot_openvino_int8

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2025年1月20日

huggingface.co

laion/larger_clap_general_openvino_int8

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2025年1月20日

huggingface.co

laion/clip-vit-base-patch16_openvino_int8

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2025年1月20日

huggingface.co

laion/erlich

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年5月25日

huggingface.co

laion/puck

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年6月25日

huggingface.co

laion/DALLE2-PyTorch

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年11月16日

huggingface.co

laion/scaling-laws-openclip

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年7月23日

huggingface.co

laion/wav2vec2-large-960h-lv60-self_openvino_int8

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2025年1月20日

laion / larger_clap_music_and_speech

Introduction of larger_clap_music_and_speech

Model Details of larger_clap_music_and_speech

Model

TL;DR

Description

Usage

Uses

Perform zero-shot audio classification

Using `pipeline`

Run the model:

Run the model on CPU:

Run the model on GPU:

Citation

Runs of laion larger_clap_music_and_speech on huggingface.co

More Information About larger_clap_music_and_speech huggingface.co Model

More larger_clap_music_and_speech license Visit here:

larger_clap_music_and_speech huggingface.co

larger_clap_music_and_speech huggingface.co Url

laion larger_clap_music_and_speech online free

laion larger_clap_music_and_speech online free url in huggingface.co:

larger_clap_music_and_speech install

larger_clap_music_and_speech install url in huggingface.co:

Url of larger_clap_music_and_speech

larger_clap_music_and_speech huggingface.co Url

Provider of larger_clap_music_and_speech huggingface.co

Other API from laion

laion / larger_clap_music_and_speech

Introduction of larger_clap_music_and_speech

Model Details of larger_clap_music_and_speech

Model

TL;DR

Description

Usage

Uses

Perform zero-shot audio classification

Using pipeline

Run the model:

Run the model on CPU:

Run the model on GPU:

Citation

Runs of laion larger_clap_music_and_speech on huggingface.co

More Information About larger_clap_music_and_speech huggingface.co Model

More larger_clap_music_and_speech license Visit here:

larger_clap_music_and_speech huggingface.co

larger_clap_music_and_speech huggingface.co Url

laion larger_clap_music_and_speech online free

laion larger_clap_music_and_speech online free url in huggingface.co:

larger_clap_music_and_speech install

larger_clap_music_and_speech install url in huggingface.co:

Url of larger_clap_music_and_speech

larger_clap_music_and_speech huggingface.co Url

Provider of larger_clap_music_and_speech huggingface.co

Other API from laion

Using `pipeline`