sage-mt5-large huggingface.co api & ai-forever sage-mt5-large github AI Model

Introduction of sage-mt5-large

Model Details of sage-mt5-large

sage-mt5-large

Summary

The model corrects spelling errors and typos in both Russian and English languages by bringing all the words in the text to the norm of the language. Corrector had been trained based on the model mT5-large architecture. An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the library SAGE .

Public references

SAGE library announcement , DataFest 2023
Paper about synthetic error generation methods , Dialogue 2023
SAGE EACL 2024 paper

Examples

Input	Output
Перведи мне текст на аглиском: "Screw you kuys, I am goin hme (c).	Переведи мне текст на английском: "Screw you guys, I am going home" (c).
И не чсно прохожим в этот день непогожйи почему я веселый такйо	И мне ясно прохожим в этот день непогожий, почему я веселый такой
If you bought something goregous, you well be very happy.	If you bought something gorgeous, you will be very happy.

Metrics

Quality

Below are automatic metrics for determining the correctness of the spell checkers. We compare our solution with both open automatic spell checkers and the ChatGPT family of models on all six available datasets:

RUSpellRU : texts collected from ( LiveJournal ), with manually corrected typos and errors;
MultidomainGold : examples from 7 text sources, including the open web, news, social media, reviews, subtitles, policy documents and literary works;
MedSpellChecker : texts with errors from medical anamnesis;
GitHubTypoCorpusRu : spelling errors and typos in commits from GitHub ;
BEA60K : English spelling errors collected from several domains;
JFLEG : 1601 sentences in English, which contain about 2 thousand spelling errors;

RUSpellRU, MultidomainGold, MedSpellChecker, GitHubTypoCorpusRu are datasets for the Russian spellchecking and BEA60K and JFLEG are those for the English language.

RUSpellRU

Model	Precision	Recall	F1
sage-mt5-large	55.7	68.5	61.4
sage-mt5-large (ft.)	88.4	71.6	79.1
sage-ai-service	93.5	82.4	87.6
gpt-3.5-turbo	39.6	62.3	48.5
gpt-4	69.5	81.0	74.8

MultidomainGold

Model	Precision	Recall	F1
sage-mt5-large	35.4	57.9	43.9
sage-mt5-large (ft.)	65.3	62.7	63.9
sage-ai-service	70.9	68.8	69.9
gpt-3.5-turbo	17.8	56.1	27.0
gpt-4	31.1	78.1	44.5

MedSpellChecker

Model	Precision	Recall	F1
sage-mt5-large	35.1	70.8	47.0
sage-mt5-large (ft.)	77.7	77.5	77.6
sage-ai-service	73.4	76.2	74.9
gpt-3.5-turbo	15.1	53.6	23.5
gpt-4	48.9	88.7	63.1

GitHubTypoCorpusRu

Model	Precision	Recall	F1
sage-mt5-large	47.4	53.8	50.4
sage-mt5-large (ft.)	69.5	46.0	55.3
sage-ai-service	76.1	51.2	61.2
gpt-3.5-turbo	23.7	43.9	30.8
gpt-4	34.7	60.5	44.1

BEA60K

Model	Precision	Recall	F1
sage-mt5-large	64.7	83.8	73.0
gpt-3.5-turbo	66.9	84.1	74.5
gpt-4	68.6	85.2	76.0
Bert ( https://github.com/neuspell/neuspell )	65.8	79.6	72.0
SC-LSTM ( https://github.com/neuspell/neuspell )	62.2	80.3	72.0

JFLEG

Model	Precision	Recall	F1
sage-mt5-large	74.9	88.4	81.1
gpt-3.5-turbo	77.8	88.6	82.9
gpt-4	77.9	88.3	82.8
Bert ( https://github.com/neuspell/neuspell )	78.5	85.4	81.8
SC-LSTM ( https://github.com/neuspell/neuspell )	80.6	86.1	83.2

How to use

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("ai-forever/sage-mt5-large")
model = AutoModelForSeq2SeqLM.from_pretrained("ai-forever/sage-mt5-large", device_map='cuda')

sentence = "Перведи мне текст на аглиском: \"Screw you kuys, I am goin hme (c)."
inputs = tokenizer(sentence, max_length=None, padding="longest", truncation=False, return_tensors="pt")
outputs = model.generate(**inputs.to(model.device), max_length = inputs["input_ids"].size(1) * 1.5)
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))

# ["Переведи мне текст на английском: "Screw you guys, I am going home" (c)."]

Limitations

For the Russian language the model is intended to be fine-tuned for better performance.

Resources

SAGE library , GitHub
sage-fredt5-large , HuggingFace
sage-fredt5-distilled-95m , HuggingFace
sage-m2m100-1.2B , HuggingFace
sage-mt5-large , HuggingFace

License

Model mT5-large , on the basis of which our solution is made, and its source code are supplied under the Apache-2.0 license. Our solution comes with MIT license.

Specifications

File size: 5 Gb;
Framework: pytorch
Version: v1.0
Developer: SberDevices, AGI NLP

Contacts

nikita.martynov.98@list.ru

Runs of ai-forever sage-mt5-large on huggingface.co

265

Total runs

24-hour runs

3-day runs

7-day runs

201

30-day runs

More Information About sage-mt5-large huggingface.co Model

More sage-mt5-large license Visit here:

https://choosealicense.com/licenses/mit

sage-mt5-large huggingface.co

sage-mt5-large huggingface.co is an AI model on huggingface.co that provides sage-mt5-large's model effect (), which can be used instantly with this ai-forever sage-mt5-large model. huggingface.co supports a free trial of the sage-mt5-large model, and also provides paid use of the sage-mt5-large. Support call sage-mt5-large model through api, including Node.js, Python, http.

sage-mt5-large huggingface.co Url

https://huggingface.co/ai-forever/sage-mt5-large

ai-forever sage-mt5-large online free

sage-mt5-large huggingface.co is an online trial and call api platform, which integrates sage-mt5-large's modeling effects, including api services, and provides a free online trial of sage-mt5-large, you can try sage-mt5-large online for free by clicking the link below.

ai-forever sage-mt5-large online free url in huggingface.co:

https://huggingface.co/ai-forever/sage-mt5-large

sage-mt5-large install

sage-mt5-large is an open source model from GitHub that offers a free installation service, and any user can find sage-mt5-large on GitHub to install. At the same time, huggingface.co provides the effect of sage-mt5-large install, users can directly use sage-mt5-large installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

sage-mt5-large install url in huggingface.co:

https://huggingface.co/ai-forever/sage-mt5-large

huggingface.co

ai-forever/sbert_large_nlu_ru

Total runs: 1.4M

Run Growth: 436.8K

Growth Rate: 31.83%

Updated: 2024年10月7日

huggingface.co

ai-forever/ruBert-base

Total runs: 525.5K

Run Growth: 507.4K

Growth Rate: 96.56%

Updated: 2023年11月3日

huggingface.co

ai-forever/rugpt3large_based_on_gpt2

Total runs: 34.1K

Run Growth: 24.6K

Growth Rate: 72.34%

Updated: 2023年12月4日

huggingface.co

ai-forever/sage-fredt5-distilled-95m

Total runs: 25.8K

Run Growth: 24.3K

Growth Rate: 94.29%

Updated: 2024年4月18日

huggingface.co

ai-forever/rugpt3small_based_on_gpt2

Total runs: 24.7K

Run Growth: 6.1K

Growth Rate: 24.60%

Updated: 2023年12月5日

huggingface.co

ai-forever/ruRoberta-large

Total runs: 20.8K

Run Growth: 8.2K

Growth Rate: 39.22%

Updated: 2023年11月3日

huggingface.co

ai-forever/mGPT

Total runs: 10.6K

Run Growth: 1.5K

Growth Rate: 13.75%

Updated: 2023年12月5日

huggingface.co

ai-forever/FRIDA

Total runs: 8.2K

Run Growth: 5.1K

Growth Rate: 59.45%

Updated: 2024年12月29日

huggingface.co

ai-forever/ru-en-RoSBERTa

Total runs: 7.2K

Run Growth: -3.4K

Growth Rate: -46.53%

Updated: 2024年9月26日

huggingface.co

ai-forever/ruT5-base

Total runs: 5.9K

Run Growth: 3.5K

Growth Rate: 60.26%

Updated: 2023年12月11日

huggingface.co

ai-forever/rugpt3medium_based_on_gpt2

Total runs: 5.3K

Run Growth: 1.0K

Growth Rate: 18.82%

Updated: 2023年12月5日

huggingface.co

ai-forever/ruclip-vit-base-patch32-384

Total runs: 4.6K

Run Growth: 3.3K

Growth Rate: 72.52%

Updated: 2022年1月10日

huggingface.co

ai-forever/FRED-T5-large-spell

Total runs: 3.7K

Run Growth: -723

Growth Rate: -19.64%

Updated: 2024年8月2日

huggingface.co

ai-forever/ruGPT-3.5-13B

Total runs: 3.4K

Run Growth: 975

Growth Rate: 28.43%

Updated: 2023年12月5日

huggingface.co

ai-forever/T5-large-spell

Total runs: 2.6K

Run Growth: 96

Growth Rate: 3.76%

Updated: 2024年8月2日

huggingface.co

ai-forever/mGPT-13B

Total runs: 2.3K

Run Growth: -408

Growth Rate: -17.86%

Updated: 2023年12月5日

huggingface.co

ai-forever/FRED-T5-1.7B

Total runs: 2.2K

Run Growth: 716

Growth Rate: 32.59%

Updated: 2023年12月5日

huggingface.co

ai-forever/ruBert-large

Total runs: 2.1K

Run Growth: 444

Growth Rate: 21.11%

Updated: 2023年11月3日

huggingface.co

ai-forever/ruT5-large

Total runs: 1.8K

Run Growth: 5

Growth Rate: 0.28%

Updated: 2023年12月28日

huggingface.co

ai-forever/rugpt2large

Total runs: 1.3K

Run Growth: -6.3K

Growth Rate: -493.39%

Updated: 2023年12月5日

huggingface.co

ai-forever/sbert_large_mt_nlu_ru

Total runs: 994

Run Growth: -183

Growth Rate: -18.41%

Updated: 2024年6月13日

huggingface.co

ai-forever/RuM2M100-418M

Total runs: 907

Run Growth: 763

Growth Rate: 84.12%

Updated: 2024年8月2日

huggingface.co

ai-forever/FRED-T5-large

Total runs: 845

Run Growth: 480

Growth Rate: 56.80%

Updated: 2023年12月5日

huggingface.co

ai-forever/RuM2M100-1.2B

Total runs: 766

Run Growth: 716

Growth Rate: 93.47%

Updated: 2024年8月2日

huggingface.co

ai-forever/sage-fredt5-large

Total runs: 488

Run Growth: -537

Growth Rate: -110.04%

Updated: 2024年4月3日

huggingface.co

ai-forever/sage-v1.1.0

Total runs: 470

Run Growth: 267

Growth Rate: 56.69%

Updated: 2024年11月19日

huggingface.co

ai-forever/mGPT-1.3B-persian

Total runs: 387

Run Growth: 207

Growth Rate: 53.49%

Updated: 2023年8月11日

huggingface.co

ai-forever/ruSciBERT

Total runs: 315

Run Growth: 158

Growth Rate: 50.16%

Updated: 2023年1月26日

huggingface.co

ai-forever/ruElectra-medium

Total runs: 300

Run Growth: 206

Growth Rate: 68.67%

Updated: 2023年11月3日

huggingface.co

ai-forever/ruElectra-small

Total runs: 287

Run Growth: -195

Growth Rate: -67.94%

Updated: 2023年11月3日

huggingface.co

ai-forever/ruElectra-large

Total runs: 251

Run Growth: 148

Growth Rate: 58.96%

Updated: 2023年11月3日

huggingface.co

ai-forever/mGPT-1.3B-uzbek

Total runs: 235

Run Growth: 64

Growth Rate: 27.23%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-romanian

Total runs: 200

Run Growth: -6

Growth Rate: -3.00%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-armenian

Total runs: 188

Run Growth: 132

Growth Rate: 70.21%

Updated: 2022年8月31日

huggingface.co

ai-forever/mGPT-1.3B-kazakh

Total runs: 177

Run Growth: 100

Growth Rate: 56.50%

Updated: 2023年8月11日

huggingface.co

ai-forever/sage-m2m100-1.2B

Total runs: 144

Run Growth: 26

Growth Rate: 18.06%

Updated: 2024年4月3日

huggingface.co

ai-forever/mGPT-1.3B-azerbaijan

Total runs: 113

Run Growth: -87

Growth Rate: -76.99%

Updated: 2023年8月11日

huggingface.co

ai-forever/ruclip-vit-base-patch32-224

Total runs: 96

Run Growth: 80

Growth Rate: 83.33%

Updated: 2022年1月9日

huggingface.co

ai-forever/ruclip-vit-large-patch14-336

Total runs: 91

Run Growth: -750

Growth Rate: -824.18%

Updated: 2022年1月9日

huggingface.co

ai-forever/mGPT-1.3B-armenian

Total runs: 76

Run Growth: 18

Growth Rate: 23.68%

Updated: 2023年8月11日

huggingface.co

ai-forever/kandinsky3_controlnet_v2_depth

Total runs: 58

Run Growth: 6

Growth Rate: 10.17%

Updated: 2024年11月26日

huggingface.co

ai-forever/mGPT-1.3B-tatar

Total runs: 51

Run Growth: 40

Growth Rate: 78.43%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-georgian

Total runs: 49

Run Growth: 18

Growth Rate: 36.73%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-ukranian

Total runs: 48

Run Growth: 33

Growth Rate: 68.75%

Updated: 2023年8月14日

huggingface.co

ai-forever/mGPT-1.3B-turkmen

Total runs: 43

Run Growth: 31

Growth Rate: 72.09%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-bulgarian

Total runs: 33

Run Growth: -39

Growth Rate: -118.18%

Updated: 2023年8月11日

huggingface.co

ai-forever/bert-base-NER-reptile-5-datasets

Total runs: 30

Run Growth: 4

Growth Rate: 13.33%

Updated: 2022年2月4日

huggingface.co

ai-forever/ruclip-vit-base-patch16-224

Total runs: 29

Run Growth: 25

Growth Rate: 86.21%

Updated: 2022年1月9日

huggingface.co

ai-forever/mGPT-1.3B-mongol

Total runs: 29

Run Growth: 13

Growth Rate: 44.83%

Updated: 2023年8月11日

huggingface.co

ai-forever/ruclip-vit-base-patch16-384

Total runs: 22

Run Growth: 13

Growth Rate: 59.09%

Updated: 2022年1月11日

huggingface.co

ai-forever/mGPT-1.3B-tajik

Total runs: 16

Run Growth: -32

Growth Rate: -200.00%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-kalmyk

Total runs: 14

Run Growth: -44

Growth Rate: -314.29%

Updated: 2023年8月11日

huggingface.co

ai-forever/kandinsky-4-v2a

Total runs: 13

Run Growth: -41

Growth Rate: -256.25%

Updated: 2024年12月13日

huggingface.co

ai-forever/mGPT-1.3B-mari

Total runs: 13

Run Growth: -5

Growth Rate: -38.46%

Updated: 2023年8月11日

huggingface.co

ai-forever/kandinsky3_ip_adapter

Total runs: 13

Run Growth: 14

Growth Rate: 100.00%

Updated: 2024年4月4日

huggingface.co

ai-forever/mGPT-1.3B-kirgiz

Total runs: 13

Run Growth: -2

Growth Rate: -15.38%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-bashkir

Total runs: 12

Run Growth: -6

Growth Rate: -50.00%

Updated: 2023年8月11日

huggingface.co

ai-forever/ruclip-vit-large-patch14-224

Total runs: 12

Run Growth: 4

Growth Rate: 33.33%

Updated: 2022年1月9日

huggingface.co

ai-forever/mGPT-1.3B-chuvash

Total runs: 12

Run Growth: -16

Growth Rate: -133.33%

Updated: 2023年8月11日

huggingface.co

ai-forever/kandinsky3_controlnet_v2_scribble

Total runs: 11

Run Growth: 1

Growth Rate: 8.33%

Updated: 2024年12月20日

huggingface.co

ai-forever/mGPT-1.3B-yakut

Total runs: 10

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-buryat

Total runs: 9

Run Growth: -3

Growth Rate: -33.33%

Updated: 2023年8月11日

huggingface.co

ai-forever/kandinsky4-Audio

Total runs: 9

Run Growth: 0

Growth Rate: 0.00%

Updated: 2024年12月11日

huggingface.co

ai-forever/kandinsky3-diffusers

Total runs: 9

Run Growth: -8

Growth Rate: -88.89%

Updated: 2024年2月20日

huggingface.co

ai-forever/mGPT-1.3B-ossetian

Total runs: 9

Run Growth: -18

Growth Rate: -200.00%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-tuvan

Total runs: 5

Run Growth: -8

Growth Rate: -160.00%

Updated: 2023年8月11日

huggingface.co

ai-forever/mGPT-1.3B-belorussian

Total runs: 3

Run Growth: -32

Growth Rate: -1066.67%

Updated: 2023年8月11日

huggingface.co

ai-forever/kandinsky3_controlnet_hed

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2024年4月4日

huggingface.co

ai-forever/Kandinsky_2.0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年11月24日

huggingface.co

ai-forever/rudalle-Malevich

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年1月11日

huggingface.co

ai-forever/ru-clip

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2021年12月24日

huggingface.co

ai-forever/MoVQGAN

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年6月8日

huggingface.co

ai-forever/RUDOLPH-2.7B-FBC2

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年10月16日

huggingface.co

ai-forever/KandinskyVideo_1_1

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2024年5月27日

huggingface.co

ai-forever/Kandinsky3.1

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2024年11月14日

huggingface.co

ai-forever/Real-ESRGAN

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年9月25日

huggingface.co

ai-forever/rugpt3xl

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2021年9月21日

huggingface.co

ai-forever/rudalle-utils

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年9月25日

huggingface.co

ai-forever/ReadingPipeline-notebooks

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年2月6日

huggingface.co

ai-forever/RUDOLPH-350M

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年10月9日

huggingface.co

ai-forever/paper_persi_chat

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年10月4日

huggingface.co

ai-forever/kandinsky4

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2024年12月11日

huggingface.co

ai-forever/scrabblegan-peter

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年10月17日

huggingface.co

ai-forever/ReadingPipeline-Peter

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年11月23日

huggingface.co

ai-forever/Kandinsky_2.1

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年4月5日

huggingface.co

ai-forever/fbc3_baseline

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年9月28日

huggingface.co

ai-forever/KandiSuperRes

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2024年8月21日

huggingface.co

ai-forever/RUDOLPH-1.3B

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年10月6日

huggingface.co

ai-forever/Sber-VQGAN

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2021年11月8日

huggingface.co

ai-forever/Kandinsky3.0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年11月22日

huggingface.co

ai-forever/scrabblegan-notebooks

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年10月18日

huggingface.co

ai-forever/rudalle-Emojich

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2021年12月2日

huggingface.co

ai-forever/kandinsky-4-t2v-flash

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2024年12月13日

huggingface.co

ai-forever/RUDOLPH-2.7B

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2022年10月6日

huggingface.co

ai-forever/KandinskyVideo

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年11月22日

huggingface.co

ai-forever/tags-generation

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: 2023年3月24日

ai-forever / sage-mt5-large

Introduction of sage-mt5-large

Model Details of sage-mt5-large

sage-mt5-large

Summary

Public references

Examples

Metrics

Quality

How to use

Limitations

Resources

License

Specifications

Contacts

Runs of ai-forever sage-mt5-large on huggingface.co

More Information About sage-mt5-large huggingface.co Model

More sage-mt5-large license Visit here:

sage-mt5-large huggingface.co

sage-mt5-large huggingface.co Url

ai-forever sage-mt5-large online free

ai-forever sage-mt5-large online free url in huggingface.co:

sage-mt5-large install

sage-mt5-large install url in huggingface.co:

Url of sage-mt5-large

sage-mt5-large huggingface.co Url

Provider of sage-mt5-large huggingface.co

Other API from ai-forever