SeamlessM4T
is our foundational all-in-one
M
assively
M
ultilingual and
M
ultimodal
M
achine
T
ranslation model delivering high-quality translation for speech and text in nearly 100 languages.
SeamlessM4T models support the tasks of:
Speech-to-speech translation (S2ST)
Speech-to-text translation (S2TT)
Text-to-speech translation (T2ST)
Text-to-text translation (T2TT)
Automatic speech recognition (ASR).
SeamlessM4T models support:
🎤 101 languages for speech input.
💬 96 Languages for text input/output.
🔊 35 languages for speech output.
🌟 We are releasing SeamlessM4T v2, an updated version with our novel
UnitY2
architecture.
This new model improves over SeamlessM4T v1 in quality as well as inference speed in speech generation tasks.
The v2 version of SeamlessM4T is a multitask adaptation of our novel
UnitY2
architecture.
Unity2
with its hierarchical character-to-unit upsampling and non-autoregressive text-to-unit decoding considerably improves over SeamlessM4T v1 in quality and inference speed.
We provide the extensive evaluation results of seamlessM4T-Large and SeamlessM4T-Medium reported in the paper (as averages) in the
metrics
files above.
The evaluation data ids for FLEURS, CoVoST2 and CVSS-C can be found
here
Evaluating SeamlessM4T models
To reproduce our results or to evaluate using the same metrics over your own test sets, please check out the
Evaluation README here
.
For more details on using the SeamlessM4T model for inference using the 🤗 Transformers library, refer to the
SeamlessM4T v2 docs
or to this
hands-on
Google Colab
.
Supported Languages:
Listed below, are the languages supported by SeamlessM4T-large (v1/v2).
The
source
column specifies whether a language is supported as source speech (
Sp
) and/or source text (
Tx
).
The
target
column specifies whether a language is supported as target speech (
Sp
) and/or target text (
Tx
).
code
language
script
Source
Target
afr
Afrikaans
Latn
Sp, Tx
Tx
amh
Amharic
Ethi
Sp, Tx
Tx
arb
Modern Standard Arabic
Arab
Sp, Tx
Sp, Tx
ary
Moroccan Arabic
Arab
Sp, Tx
Tx
arz
Egyptian Arabic
Arab
Sp, Tx
Tx
asm
Assamese
Beng
Sp, Tx
Tx
ast
Asturian
Latn
Sp
--
azj
North Azerbaijani
Latn
Sp, Tx
Tx
bel
Belarusian
Cyrl
Sp, Tx
Tx
ben
Bengali
Beng
Sp, Tx
Sp, Tx
bos
Bosnian
Latn
Sp, Tx
Tx
bul
Bulgarian
Cyrl
Sp, Tx
Tx
cat
Catalan
Latn
Sp, Tx
Sp, Tx
ceb
Cebuano
Latn
Sp, Tx
Tx
ces
Czech
Latn
Sp, Tx
Sp, Tx
ckb
Central Kurdish
Arab
Sp, Tx
Tx
cmn
Mandarin Chinese
Hans
Sp, Tx
Sp, Tx
cmn_Hant
Mandarin Chinese
Hant
Sp, Tx
Sp, Tx
cym
Welsh
Latn
Sp, Tx
Sp, Tx
dan
Danish
Latn
Sp, Tx
Sp, Tx
deu
German
Latn
Sp, Tx
Sp, Tx
ell
Greek
Grek
Sp, Tx
Tx
eng
English
Latn
Sp, Tx
Sp, Tx
est
Estonian
Latn
Sp, Tx
Sp, Tx
eus
Basque
Latn
Sp, Tx
Tx
fin
Finnish
Latn
Sp, Tx
Sp, Tx
fra
French
Latn
Sp, Tx
Sp, Tx
fuv
Nigerian Fulfulde
Latn
Sp, Tx
Tx
gaz
West Central Oromo
Latn
Sp, Tx
Tx
gle
Irish
Latn
Sp, Tx
Tx
glg
Galician
Latn
Sp, Tx
Tx
guj
Gujarati
Gujr
Sp, Tx
Tx
heb
Hebrew
Hebr
Sp, Tx
Tx
hin
Hindi
Deva
Sp, Tx
Sp, Tx
hrv
Croatian
Latn
Sp, Tx
Tx
hun
Hungarian
Latn
Sp, Tx
Tx
hye
Armenian
Armn
Sp, Tx
Tx
ibo
Igbo
Latn
Sp, Tx
Tx
ind
Indonesian
Latn
Sp, Tx
Sp, Tx
isl
Icelandic
Latn
Sp, Tx
Tx
ita
Italian
Latn
Sp, Tx
Sp, Tx
jav
Javanese
Latn
Sp, Tx
Tx
jpn
Japanese
Jpan
Sp, Tx
Sp, Tx
kam
Kamba
Latn
Sp
--
kan
Kannada
Knda
Sp, Tx
Tx
kat
Georgian
Geor
Sp, Tx
Tx
kaz
Kazakh
Cyrl
Sp, Tx
Tx
kea
Kabuverdianu
Latn
Sp
--
khk
Halh Mongolian
Cyrl
Sp, Tx
Tx
khm
Khmer
Khmr
Sp, Tx
Tx
kir
Kyrgyz
Cyrl
Sp, Tx
Tx
kor
Korean
Kore
Sp, Tx
Sp, Tx
lao
Lao
Laoo
Sp, Tx
Tx
lit
Lithuanian
Latn
Sp, Tx
Tx
ltz
Luxembourgish
Latn
Sp
--
lug
Ganda
Latn
Sp, Tx
Tx
luo
Luo
Latn
Sp, Tx
Tx
lvs
Standard Latvian
Latn
Sp, Tx
Tx
mai
Maithili
Deva
Sp, Tx
Tx
mal
Malayalam
Mlym
Sp, Tx
Tx
mar
Marathi
Deva
Sp, Tx
Tx
mkd
Macedonian
Cyrl
Sp, Tx
Tx
mlt
Maltese
Latn
Sp, Tx
Sp, Tx
mni
Meitei
Beng
Sp, Tx
Tx
mya
Burmese
Mymr
Sp, Tx
Tx
nld
Dutch
Latn
Sp, Tx
Sp, Tx
nno
Norwegian Nynorsk
Latn
Sp, Tx
Tx
nob
Norwegian Bokmål
Latn
Sp, Tx
Tx
npi
Nepali
Deva
Sp, Tx
Tx
nya
Nyanja
Latn
Sp, Tx
Tx
oci
Occitan
Latn
Sp
--
ory
Odia
Orya
Sp, Tx
Tx
pan
Punjabi
Guru
Sp, Tx
Tx
pbt
Southern Pashto
Arab
Sp, Tx
Tx
pes
Western Persian
Arab
Sp, Tx
Sp, Tx
pol
Polish
Latn
Sp, Tx
Sp, Tx
por
Portuguese
Latn
Sp, Tx
Sp, Tx
ron
Romanian
Latn
Sp, Tx
Sp, Tx
rus
Russian
Cyrl
Sp, Tx
Sp, Tx
slk
Slovak
Latn
Sp, Tx
Sp, Tx
slv
Slovenian
Latn
Sp, Tx
Tx
sna
Shona
Latn
Sp, Tx
Tx
snd
Sindhi
Arab
Sp, Tx
Tx
som
Somali
Latn
Sp, Tx
Tx
spa
Spanish
Latn
Sp, Tx
Sp, Tx
srp
Serbian
Cyrl
Sp, Tx
Tx
swe
Swedish
Latn
Sp, Tx
Sp, Tx
swh
Swahili
Latn
Sp, Tx
Sp, Tx
tam
Tamil
Taml
Sp, Tx
Tx
tel
Telugu
Telu
Sp, Tx
Sp, Tx
tgk
Tajik
Cyrl
Sp, Tx
Tx
tgl
Tagalog
Latn
Sp, Tx
Sp, Tx
tha
Thai
Thai
Sp, Tx
Sp, Tx
tur
Turkish
Latn
Sp, Tx
Sp, Tx
ukr
Ukrainian
Cyrl
Sp, Tx
Sp, Tx
urd
Urdu
Arab
Sp, Tx
Sp, Tx
uzn
Northern Uzbek
Latn
Sp, Tx
Sp, Tx
vie
Vietnamese
Latn
Sp, Tx
Sp, Tx
xho
Xhosa
Latn
Sp
--
yor
Yoruba
Latn
Sp, Tx
Tx
yue
Cantonese
Hant
Sp, Tx
Tx
zlm
Colloquial Malay
Latn
Sp
--
zsm
Standard Malay
Latn
Tx
Tx
zul
Zulu
Latn
Sp, Tx
Tx
Note that seamlessM4T-medium supports 200 languages in the text modality, and is based on NLLB-200 (see full list in
asset card
)
Citation
For SeamlessM4T v2, please cite :
@inproceedings{seamless2023,
title="Seamless: Multilingual Expressive and Streaming Speech Translation",
author="{Seamless Communication}, Lo{\"i}c Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-juss{\`a}, Maha Elbayad, Hongyu Gong, Francisco Guzm{\'a}n, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alex Mourachko, Benjamin Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson",
journal={ArXiv},
year={2023}
}
Runs of facebook seamless-m4t-v2-large on huggingface.co
120.7K
Total runs
812
24-hour runs
108
3-day runs
5.8K
7-day runs
61.1K
30-day runs
More Information About seamless-m4t-v2-large huggingface.co Model
seamless-m4t-v2-large huggingface.co is an AI model on huggingface.co that provides seamless-m4t-v2-large's model effect (), which can be used instantly with this facebook seamless-m4t-v2-large model. huggingface.co supports a free trial of the seamless-m4t-v2-large model, and also provides paid use of the seamless-m4t-v2-large. Support call seamless-m4t-v2-large model through api, including Node.js, Python, http.
seamless-m4t-v2-large huggingface.co is an online trial and call api platform, which integrates seamless-m4t-v2-large's modeling effects, including api services, and provides a free online trial of seamless-m4t-v2-large, you can try seamless-m4t-v2-large online for free by clicking the link below.
facebook seamless-m4t-v2-large online free url in huggingface.co:
seamless-m4t-v2-large is an open source model from GitHub that offers a free installation service, and any user can find seamless-m4t-v2-large on GitHub to install. At the same time, huggingface.co provides the effect of seamless-m4t-v2-large install, users can directly use seamless-m4t-v2-large installed effect in huggingface.co for debugging and trial. It also supports api for free installation.
seamless-m4t-v2-large install url in huggingface.co: