FRIDA huggingface.co api & ai-forever FRIDA github AI Model

Introduction of FRIDA

Model Details of FRIDA

Model Card for FRIDA

FRIDA is a full-scale finetuned general text embedding model inspired by denoising architecture based on T5. The model is based on the encoder part of FRED-T5 model and continues research of text embedding models ( ruMTEB , ru-en-RoSBERTa ). It has been pre-trained on a Russian-English dataset and fine-tuned for improved performance on the target task.

For more model details please refer to our technical report [TODO].

Usage

The model can be used as is with prefixes. It is recommended to use CLS pooling. The choice of prefix and pooling depends on the task.

We use the following basic rules to choose a prefix:

"search_query: " and "search_document: " prefixes are for answer or relevant paragraph retrieval
"paraphrase: " prefix is for symmetric paraphrasing related tasks (STS, paraphrase mining, deduplication)
"categorize: " prefix is for asymmetric matching of document title and body (e.g. news, scientific papers, social posts)
"categorize_sentiment: " prefix is for any tasks that rely on sentiment features (e.g. hate, toxic, emotion)
"categorize_topic: " prefix is intended for tasks where you need to group texts by topic
"categorize_entailment: " prefix is for textual entailment task (NLI)

To better tailor the model to your needs, you can fine-tune it with relevant high-quality Russian and English datasets.

Below are examples of texts encoding using the Transformers and SentenceTransformers libraries.

Transformers

import torch
import torch.nn.functional as F
from transformers import AutoTokenizer, T5EncoderModel


def pool(hidden_state, mask, pooling_method="cls"):
    if pooling_method == "mean":
        s = torch.sum(hidden_state * mask.unsqueeze(-1).float(), dim=1)
        d = mask.sum(axis=1, keepdim=True).float()
        return s / d
    elif pooling_method == "cls":
        return hidden_state[:, 0]

inputs = [
    # 
    "paraphrase: В Ярославской области разрешили работу бань, но без посетителей",
    "categorize_entailment: Женщину доставили в больницу, за ее жизнь сейчас борются врачи.",
    "search_query: Сколько программистов нужно, чтобы вкрутить лампочку?",
    # 
    "paraphrase: Ярославским баням разрешили работать без посетителей",
    "categorize_entailment: Женщину спасают врачи.",
    "search_document: Чтобы вкрутить лампочку, требуется три программиста: один напишет программу извлечения лампочки, другой — вкручивания лампочки, а третий проведет тестирование."
]

tokenizer = AutoTokenizer.from_pretrained("ai-forever/FRIDA")
model = T5EncoderModel.from_pretrained("ai-forever/FRIDA")

tokenized_inputs = tokenizer(inputs, max_length=512, padding=True, truncation=True, return_tensors="pt")

with torch.no_grad():
    outputs = model(**tokenized_inputs)
    
embeddings = pool(
    outputs.last_hidden_state, 
    tokenized_inputs["attention_mask"],
    pooling_method="cls" # or try "mean"
)

embeddings = F.normalize(embeddings, p=2, dim=1)
sim_scores = embeddings[:3] @ embeddings[3:].T
print(sim_scores.diag().tolist())
# [0.9360030293464661, 0.8591322302818298, 0.728583037853241]

SentenceTransformers

from sentence_transformers import SentenceTransformer

inputs = [
    # 
    "paraphrase: В Ярославской области разрешили работу бань, но без посетителей",
    "categorize_entailment: Женщину доставили в больницу, за ее жизнь сейчас борются врачи.",
    "search_query: Сколько программистов нужно, чтобы вкрутить лампочку?",
    # 
    "paraphrase: Ярославским баням разрешили работать без посетителей",
    "categorize_entailment: Женщину спасают врачи.",
    "search_document: Чтобы вкрутить лампочку, требуется три программиста: один напишет программу извлечения лампочки, другой — вкручивания лампочки, а третий проведет тестирование."
]

# loads model with CLS pooling
model = SentenceTransformer("ai-forever/FRIDA")

# embeddings are normalized by default
embeddings = model.encode(inputs, convert_to_tensor=True)

sim_scores = embeddings[:3] @ embeddings[3:].T
print(sim_scores.diag().tolist())
# [0.9360026717185974, 0.8591331243515015, 0.7285830974578857]

or using prompts (sentence-transformers>=2.4.0):

from sentence_transformers import SentenceTransformer

# loads model with CLS pooling
model = SentenceTransformer("ai-forever/FRIDA")

paraphrase = model.encode(["В Ярославской области разрешили работу бань, но без посетителей", "Ярославским баням разрешили работать без посетителей"], prompt_name="paraphrase")
print(paraphrase[0] @ paraphrase[1].T) # 0.9360032

categorize_entailment = model.encode(["Женщину доставили в больницу, за ее жизнь сейчас борются врачи.", "Женщину спасают врачи."], prompt_name="categorize_entailment")
print(categorize_entailment[0] @ categorize_entailment[1].T) # 0.8591322

query_embedding = model.encode("Сколько программистов нужно, чтобы вкрутить лампочку?", prompt_name="search_query")
document_embedding = model.encode("Чтобы вкрутить лампочку, требуется три программиста: один напишет программу извлечения лампочки, другой — вкручивания лампочки, а третий проведет тестирование.", prompt_name="search_document")
print(query_embedding @ document_embedding.T) # 0.7285831

Authors

SaluteDevices AI for B2C RnD Team.
Artem Snegirev: HF profile , Github ;
Anna Maksimova HF profile ;
Aleksandr Abramov: HF profile , Github , Kaggle Competitions Master

Citation

@misc{TODO
}

Limitations

The model is designed to process texts in Russian, the quality in English is unknown. Maximum input text length is limited to 512 tokens.

Runs of ai-forever FRIDA on huggingface.co

8.2K

Total runs

-633

24-hour runs

-455

3-day runs

-603

7-day runs

5.1K

30-day runs

More Information About FRIDA huggingface.co Model

More FRIDA license Visit here:

https://choosealicense.com/licenses/mit

FRIDA huggingface.co

FRIDA huggingface.co is an AI model on huggingface.co that provides FRIDA's model effect (), which can be used instantly with this ai-forever FRIDA model. huggingface.co supports a free trial of the FRIDA model, and also provides paid use of the FRIDA. Support call FRIDA model through api, including Node.js, Python, http.

FRIDA huggingface.co Url

https://huggingface.co/ai-forever/FRIDA

ai-forever FRIDA online free

FRIDA huggingface.co is an online trial and call api platform, which integrates FRIDA's modeling effects, including api services, and provides a free online trial of FRIDA, you can try FRIDA online for free by clicking the link below.

ai-forever FRIDA online free url in huggingface.co:

https://huggingface.co/ai-forever/FRIDA

FRIDA install

FRIDA is an open source model from GitHub that offers a free installation service, and any user can find FRIDA on GitHub to install. At the same time, huggingface.co provides the effect of FRIDA install, users can directly use FRIDA installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

FRIDA install url in huggingface.co:

https://huggingface.co/ai-forever/FRIDA

huggingface.co

ai-forever/sbert_large_nlu_ru

Total runs: 1.4M

Run Growth: 436.8K

Growth Rate: 31.83%

Updated: October 07 2024

huggingface.co

ai-forever/ruBert-base

Total runs: 525.5K

Run Growth: 507.4K

Growth Rate: 96.56%

Updated: November 03 2023

huggingface.co

ai-forever/rugpt3large_based_on_gpt2

Total runs: 34.1K

Run Growth: 24.6K

Growth Rate: 72.34%

Updated: December 04 2023

huggingface.co

ai-forever/sage-fredt5-distilled-95m

Total runs: 25.8K

Run Growth: 24.3K

Growth Rate: 94.29%

Updated: April 18 2024

huggingface.co

ai-forever/rugpt3small_based_on_gpt2

Total runs: 24.7K

Run Growth: 6.1K

Growth Rate: 24.60%

Updated: December 05 2023

huggingface.co

ai-forever/ruRoberta-large

Total runs: 20.8K

Run Growth: 8.2K

Growth Rate: 39.22%

Updated: November 03 2023

huggingface.co

ai-forever/mGPT

Total runs: 10.6K

Run Growth: 1.5K

Growth Rate: 13.75%

Updated: December 05 2023

huggingface.co

ai-forever/ru-en-RoSBERTa

Total runs: 7.2K

Run Growth: -3.4K

Growth Rate: -46.53%

Updated: September 26 2024

huggingface.co

ai-forever/ruT5-base

Total runs: 5.9K

Run Growth: 3.5K

Growth Rate: 60.26%

Updated: December 11 2023

huggingface.co

ai-forever/rugpt3medium_based_on_gpt2

Total runs: 5.3K

Run Growth: 1.0K

Growth Rate: 18.82%

Updated: December 05 2023

huggingface.co

ai-forever/ruclip-vit-base-patch32-384

Total runs: 4.6K

Run Growth: 3.3K

Growth Rate: 72.52%

Updated: January 10 2022

huggingface.co

ai-forever/FRED-T5-large-spell

Total runs: 3.7K

Run Growth: -723

Growth Rate: -19.64%

Updated: August 02 2024

huggingface.co

ai-forever/ruGPT-3.5-13B

Total runs: 3.4K

Run Growth: 975

Growth Rate: 28.43%

Updated: December 05 2023

huggingface.co

ai-forever/T5-large-spell

Total runs: 2.6K

Run Growth: 96

Growth Rate: 3.76%

Updated: August 02 2024

huggingface.co

ai-forever/mGPT-13B

Total runs: 2.3K

Run Growth: -408

Growth Rate: -17.86%

Updated: December 05 2023

huggingface.co

ai-forever/FRED-T5-1.7B

Total runs: 2.2K

Run Growth: 716

Growth Rate: 32.59%

Updated: December 05 2023

huggingface.co

ai-forever/ruBert-large

Total runs: 2.1K

Run Growth: 444

Growth Rate: 21.11%

Updated: November 03 2023

huggingface.co

ai-forever/ruT5-large

Total runs: 1.8K

Run Growth: 5

Growth Rate: 0.28%

Updated: December 28 2023

huggingface.co

ai-forever/rugpt2large

Total runs: 1.3K

Run Growth: -6.3K

Growth Rate: -493.39%

Updated: December 05 2023

huggingface.co

ai-forever/sbert_large_mt_nlu_ru

Total runs: 994

Run Growth: -183

Growth Rate: -18.41%

Updated: June 13 2024

huggingface.co

ai-forever/RuM2M100-418M

Total runs: 907

Run Growth: 763

Growth Rate: 84.12%

Updated: August 02 2024

huggingface.co

ai-forever/FRED-T5-large

Total runs: 845

Run Growth: 480

Growth Rate: 56.80%

Updated: December 05 2023

huggingface.co

ai-forever/RuM2M100-1.2B

Total runs: 766

Run Growth: 716

Growth Rate: 93.47%

Updated: August 02 2024

huggingface.co

ai-forever/sage-fredt5-large

Total runs: 488

Run Growth: -537

Growth Rate: -110.04%

Updated: April 03 2024

huggingface.co

ai-forever/sage-v1.1.0

Total runs: 470

Run Growth: 267

Growth Rate: 56.69%

Updated: November 19 2024

huggingface.co

ai-forever/mGPT-1.3B-persian

Total runs: 387

Run Growth: 207

Growth Rate: 53.49%

Updated: August 11 2023

huggingface.co

ai-forever/ruSciBERT

Total runs: 315

Run Growth: 158

Growth Rate: 50.16%

Updated: January 26 2023

huggingface.co

ai-forever/ruElectra-medium

Total runs: 300

Run Growth: 206

Growth Rate: 68.67%

Updated: November 03 2023

huggingface.co

ai-forever/ruElectra-small

Total runs: 287

Run Growth: -195

Growth Rate: -67.94%

Updated: November 03 2023

huggingface.co

ai-forever/sage-mt5-large

Total runs: 265

Run Growth: 201

Growth Rate: 75.85%

Updated: April 04 2024

huggingface.co

ai-forever/ruElectra-large

Total runs: 251

Run Growth: 148

Growth Rate: 58.96%

Updated: November 03 2023

huggingface.co

ai-forever/mGPT-1.3B-uzbek

Total runs: 235

Run Growth: 64

Growth Rate: 27.23%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-romanian

Total runs: 200

Run Growth: -6

Growth Rate: -3.00%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-armenian

Total runs: 188

Run Growth: 132

Growth Rate: 70.21%

Updated: August 31 2022

huggingface.co

ai-forever/mGPT-1.3B-kazakh

Total runs: 177

Run Growth: 100

Growth Rate: 56.50%

Updated: August 11 2023

huggingface.co

ai-forever/sage-m2m100-1.2B

Total runs: 144

Run Growth: 26

Growth Rate: 18.06%

Updated: April 03 2024

huggingface.co

ai-forever/mGPT-1.3B-azerbaijan

Total runs: 113

Run Growth: -87

Growth Rate: -76.99%

Updated: August 11 2023

huggingface.co

ai-forever/ruclip-vit-base-patch32-224

Total runs: 96

Run Growth: 80

Growth Rate: 83.33%

Updated: January 09 2022

huggingface.co

ai-forever/ruclip-vit-large-patch14-336

Total runs: 91

Run Growth: -750

Growth Rate: -824.18%

Updated: January 09 2022

huggingface.co

ai-forever/mGPT-1.3B-armenian

Total runs: 76

Run Growth: 18

Growth Rate: 23.68%

Updated: August 11 2023

huggingface.co

ai-forever/kandinsky3_controlnet_v2_depth

Total runs: 58

Run Growth: 6

Growth Rate: 10.17%

Updated: November 26 2024

huggingface.co

ai-forever/mGPT-1.3B-tatar

Total runs: 51

Run Growth: 40

Growth Rate: 78.43%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-georgian

Total runs: 49

Run Growth: 18

Growth Rate: 36.73%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-ukranian

Total runs: 48

Run Growth: 33

Growth Rate: 68.75%

Updated: August 14 2023

huggingface.co

ai-forever/mGPT-1.3B-turkmen

Total runs: 43

Run Growth: 31

Growth Rate: 72.09%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-bulgarian

Total runs: 33

Run Growth: -39

Growth Rate: -118.18%

Updated: August 11 2023

huggingface.co

ai-forever/bert-base-NER-reptile-5-datasets

Total runs: 30

Run Growth: 4

Growth Rate: 13.33%

Updated: February 04 2022

huggingface.co

ai-forever/ruclip-vit-base-patch16-224

Total runs: 29

Run Growth: 25

Growth Rate: 86.21%

Updated: January 09 2022

huggingface.co

ai-forever/mGPT-1.3B-mongol

Total runs: 29

Run Growth: 13

Growth Rate: 44.83%

Updated: August 11 2023

huggingface.co

ai-forever/ruclip-vit-base-patch16-384

Total runs: 22

Run Growth: 13

Growth Rate: 59.09%

Updated: January 11 2022

huggingface.co

ai-forever/mGPT-1.3B-tajik

Total runs: 16

Run Growth: -32

Growth Rate: -200.00%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-kalmyk

Total runs: 14

Run Growth: -44

Growth Rate: -314.29%

Updated: August 11 2023

huggingface.co

ai-forever/kandinsky-4-v2a

Total runs: 13

Run Growth: -41

Growth Rate: -256.25%

Updated: December 13 2024

huggingface.co

ai-forever/mGPT-1.3B-mari

Total runs: 13

Run Growth: -5

Growth Rate: -38.46%

Updated: August 11 2023

huggingface.co

ai-forever/kandinsky3_ip_adapter

Total runs: 13

Run Growth: 12

Growth Rate: 92.31%

Updated: April 04 2024

huggingface.co

ai-forever/mGPT-1.3B-kirgiz

Total runs: 13

Run Growth: -2

Growth Rate: -15.38%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-bashkir

Total runs: 12

Run Growth: -6

Growth Rate: -50.00%

Updated: August 11 2023

huggingface.co

ai-forever/ruclip-vit-large-patch14-224

Total runs: 12

Run Growth: 4

Growth Rate: 33.33%

Updated: January 09 2022

huggingface.co

ai-forever/mGPT-1.3B-chuvash

Total runs: 12

Run Growth: -16

Growth Rate: -133.33%

Updated: August 11 2023

huggingface.co

ai-forever/kandinsky3_controlnet_v2_scribble

Total runs: 11

Run Growth: 1

Growth Rate: 8.33%

Updated: December 20 2024

huggingface.co

ai-forever/mGPT-1.3B-yakut

Total runs: 10

Run Growth: 0

Growth Rate: 0.00%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-buryat

Total runs: 9

Run Growth: -3

Growth Rate: -33.33%

Updated: August 11 2023

huggingface.co

ai-forever/kandinsky4-Audio

Total runs: 9

Run Growth: 0

Growth Rate: 0.00%

Updated: December 11 2024

huggingface.co

ai-forever/kandinsky3-diffusers

Total runs: 9

Run Growth: -8

Growth Rate: -88.89%

Updated: February 20 2024

huggingface.co

ai-forever/mGPT-1.3B-ossetian

Total runs: 9

Run Growth: -18

Growth Rate: -200.00%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-tuvan

Total runs: 5

Run Growth: -8

Growth Rate: -160.00%

Updated: August 11 2023

huggingface.co

ai-forever/mGPT-1.3B-belorussian

Total runs: 3

Run Growth: -32

Growth Rate: -1066.67%

Updated: August 11 2023

huggingface.co

ai-forever/rudalle-Malevich

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: January 11 2022

huggingface.co

ai-forever/kandinsky3_controlnet_hed

Total runs: 0

Run Growth: -2

Growth Rate: 0.00%

Updated: April 04 2024

huggingface.co

ai-forever/Kandinsky_2.0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 24 2022

huggingface.co

ai-forever/ru-clip

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: December 24 2021

huggingface.co

ai-forever/MoVQGAN

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: June 08 2023

huggingface.co

ai-forever/RUDOLPH-2.7B-FBC2

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 16 2022

huggingface.co

ai-forever/Real-ESRGAN

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: September 25 2022

huggingface.co

ai-forever/rugpt3xl

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: September 21 2021

huggingface.co

ai-forever/Kandinsky3.1

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 14 2024

huggingface.co

ai-forever/KandinskyVideo_1_1

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: May 27 2024

huggingface.co

ai-forever/rudalle-utils

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: September 25 2022

huggingface.co

ai-forever/ReadingPipeline-notebooks

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: February 06 2023

huggingface.co

ai-forever/RUDOLPH-350M

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 09 2022

huggingface.co

ai-forever/paper_persi_chat

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 04 2023

huggingface.co

ai-forever/kandinsky4

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: December 11 2024

huggingface.co

ai-forever/scrabblegan-peter

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 17 2022

huggingface.co

ai-forever/ReadingPipeline-Peter

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 23 2022

huggingface.co

ai-forever/fbc3_baseline

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: September 28 2023

huggingface.co

ai-forever/Kandinsky_2.1

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: April 05 2023

huggingface.co

ai-forever/KandiSuperRes

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: August 21 2024

huggingface.co

ai-forever/RUDOLPH-1.3B

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 06 2022

huggingface.co

ai-forever/Sber-VQGAN

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 08 2021

huggingface.co

ai-forever/scrabblegan-notebooks

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 18 2022

huggingface.co

ai-forever/rudalle-Emojich

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: December 02 2021

huggingface.co

ai-forever/Kandinsky3.0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 22 2023

huggingface.co

ai-forever/RUDOLPH-2.7B

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 06 2022

huggingface.co

ai-forever/tags-generation

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: March 24 2023

huggingface.co

ai-forever/kandinsky-4-t2v-flash

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: December 13 2024

huggingface.co

ai-forever/KandinskyVideo

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 22 2023

ai-forever / FRIDA

Introduction of FRIDA

Model Details of FRIDA

Model Card for FRIDA

Usage

Transformers

SentenceTransformers

Authors

Citation

Limitations

Runs of ai-forever FRIDA on huggingface.co

More Information About FRIDA huggingface.co Model

More FRIDA license Visit here:

FRIDA huggingface.co

FRIDA huggingface.co Url

ai-forever FRIDA online free

ai-forever FRIDA online free url in huggingface.co:

FRIDA install

FRIDA install url in huggingface.co:

Url of FRIDA

FRIDA huggingface.co Url

Provider of FRIDA huggingface.co

Other API from ai-forever