distilbert-base-multilingual-cased huggingface.co api & distilbert distilbert-base-multilingual-cased github AI Model

Introduction of distilbert-base-multilingual-cased

Model Details of distilbert-base-multilingual-cased

Model Card for DistilBERT base multilingual (cased)

Model Details
Uses
Bias, Risks, and Limitations
Training Details
Evaluation
Environmental Impact
Citation
How To Get Started With the Model

Model Details

Model Description

This model is a distilled version of the BERT base multilingual model . The code for the distillation process can be found here . This model is cased: it does make a difference between english and English.

The model is trained on the concatenation of Wikipedia in 104 different languages listed here . The model has 6 layers, 768 dimension and 12 heads, totalizing 134M parameters (compared to 177M parameters for mBERT-base). On average, this model, referred to as DistilmBERT, is twice as fast as mBERT-base.

We encourage potential users of this model to check out the BERT base multilingual model card to learn more about usage, limitations and potential biases.

Developed by: Victor Sanh, Lysandre Debut, Julien Chaumond, Thomas Wolf (Hugging Face)
Model type: Transformer-based language model
Language(s) (NLP): 104 languages; see full list here
License: Apache 2.0
Related Models: BERT base multilingual model
Resources for more information:
- GitHub Repository
- Associated Paper

Uses

Direct Use and Downstream Use

You can use the raw model for either masked language modeling or next sentence prediction, but it's mostly intended to be fine-tuned on a downstream task. See the model hub to look for fine-tuned versions on a task that interests you.

Note that this model is primarily aimed at being fine-tuned on tasks that use the whole sentence (potentially masked) to make decisions, such as sequence classification, token classification or question answering. For tasks such as text generation you should look at model like GPT2.

Out of Scope Use

The model should not be used to intentionally create hostile or alienating environments for people. The model was not trained to be factual or true representations of people or events, and therefore using the models to generate such content is out-of-scope for the abilities of this model.

Bias, Risks, and Limitations

Significant research has explored bias and fairness issues with language models (see, e.g., Sheng et al. (2021) and Bender et al. (2021) ). Predictions generated by the model may include disturbing and harmful stereotypes across protected classes; identity characteristics; and sensitive, social, and occupational groups.

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.

Training Details

The model was pretrained with the supervision of bert-base-multilingual-cased on the concatenation of Wikipedia in 104 different languages
The model has 6 layers, 768 dimension and 12 heads, totalizing 134M parameters.
Further information about the training procedure and data is included in the bert-base-multilingual-cased model card.

Evaluation

The model developers report the following accuracy results for DistilmBERT (see GitHub Repo ):

Here are the results on the test sets for 6 of the languages available in XNLI. The results are computed in the zero shot setting (trained on the English portion and evaluated on the target language portion):

Model	English	Spanish	Chinese	German	Arabic	Urdu
mBERT base cased (computed)	82.1	74.6	69.1	72.3	66.4	58.5
mBERT base uncased (reported)	81.4	74.3	63.8	70.5	62.1	58.3
DistilmBERT	78.2	69.1	64.0	66.3	59.1	54.7

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019) .

Hardware Type: More information needed
Hours used: More information needed
Cloud Provider: More information needed
Compute Region: More information needed
Carbon Emitted: More information needed

Citation

@article{Sanh2019DistilBERTAD,
  title={DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter},
  author={Victor Sanh and Lysandre Debut and Julien Chaumond and Thomas Wolf},
  journal={ArXiv},
  year={2019},
  volume={abs/1910.01108}
}

APA

Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019). DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108.

How to Get Started With the Model

You can use the model directly with a pipeline for masked language modeling:

>>> from transformers import pipeline
>>> unmasker = pipeline('fill-mask', model='distilbert-base-multilingual-cased')
>>> unmasker("Hello I'm a [MASK] model.")

[{'score': 0.040800247341394424,
  'sequence': "Hello I'm a virtual model.",
  'token': 37859,
  'token_str': 'virtual'},
 {'score': 0.020015988498926163,
  'sequence': "Hello I'm a big model.",
  'token': 22185,
  'token_str': 'big'},
 {'score': 0.018680453300476074,
  'sequence': "Hello I'm a Hello model.",
  'token': 31178,
  'token_str': 'Hello'},
 {'score': 0.017396586015820503,
  'sequence': "Hello I'm a model model.",
  'token': 13192,
  'token_str': 'model'},
 {'score': 0.014229810796678066,
  'sequence': "Hello I'm a perfect model.",
  'token': 43477,
  'token_str': 'perfect'}]

Runs of distilbert distilbert-base-multilingual-cased on huggingface.co

649.1K

Total runs

14.6K

24-hour runs

31.3K

3-day runs

22.5K

7-day runs

151.4K

30-day runs

More Information About distilbert-base-multilingual-cased huggingface.co Model

More distilbert-base-multilingual-cased license Visit here:

https://choosealicense.com/licenses/apache-2.0

distilbert-base-multilingual-cased huggingface.co

distilbert-base-multilingual-cased huggingface.co is an AI model on huggingface.co that provides distilbert-base-multilingual-cased's model effect (), which can be used instantly with this distilbert distilbert-base-multilingual-cased model. huggingface.co supports a free trial of the distilbert-base-multilingual-cased model, and also provides paid use of the distilbert-base-multilingual-cased. Support call distilbert-base-multilingual-cased model through api, including Node.js, Python, http.

distilbert-base-multilingual-cased huggingface.co Url

https://huggingface.co/distilbert/distilbert-base-multilingual-cased

distilbert distilbert-base-multilingual-cased online free

distilbert-base-multilingual-cased huggingface.co is an online trial and call api platform, which integrates distilbert-base-multilingual-cased's modeling effects, including api services, and provides a free online trial of distilbert-base-multilingual-cased, you can try distilbert-base-multilingual-cased online for free by clicking the link below.

distilbert distilbert-base-multilingual-cased online free url in huggingface.co:

https://huggingface.co/distilbert/distilbert-base-multilingual-cased

distilbert-base-multilingual-cased install

distilbert-base-multilingual-cased is an open source model from GitHub that offers a free installation service, and any user can find distilbert-base-multilingual-cased on GitHub to install. At the same time, huggingface.co provides the effect of distilbert-base-multilingual-cased install, users can directly use distilbert-base-multilingual-cased installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

distilbert-base-multilingual-cased install url in huggingface.co:

https://huggingface.co/distilbert/distilbert-base-multilingual-cased

huggingface.co

distilbert/distilbert-base-uncased

Total runs: 17.9M

Run Growth: 5.5M

Growth Rate: 31.85%

Updated: May 06 2024

huggingface.co

distilbert/distilbert-base-uncased-finetuned-sst-2-english

Total runs: 6.7M

Run Growth: -39.1K

Growth Rate: -0.59%

Updated: December 20 2023

huggingface.co

distilbert/distilroberta-base

Total runs: 1.9M

Run Growth: -505.5K

Growth Rate: -27.05%

Updated: February 19 2024

huggingface.co

distilbert/distilgpt2

Total runs: 1.5M

Run Growth: 247.5K

Growth Rate: 17.46%

Updated: February 19 2024

huggingface.co

distilbert/distilbert-base-cased-distilled-squad

Total runs: 599.4K

Run Growth: 342.1K

Growth Rate: 57.35%

Updated: May 06 2024

huggingface.co

distilbert/distilbert-base-uncased-distilled-squad

Total runs: 338.3K

Run Growth: 170.3K

Growth Rate: 49.99%

Updated: May 06 2024

huggingface.co

distilbert/distilbert-base-cased

Total runs: 311.6K

Run Growth: -11.8K

Growth Rate: -3.83%

Updated: May 06 2024

huggingface.co

distilbert/distilbert-base-german-cased

Total runs: 28.2K

Run Growth: 210

Growth Rate: 0.75%

Updated: May 06 2024

distilbert / distilbert-base-multilingual-cased

Introduction of distilbert-base-multilingual-cased

Model Details of distilbert-base-multilingual-cased

Model Card for DistilBERT base multilingual (cased)

Table of Contents

Model Details

Model Description

Uses

Direct Use and Downstream Use

Out of Scope Use

Bias, Risks, and Limitations

Recommendations

Training Details

Evaluation

Environmental Impact

Citation

How to Get Started With the Model

Runs of distilbert distilbert-base-multilingual-cased on huggingface.co

More Information About distilbert-base-multilingual-cased huggingface.co Model

More distilbert-base-multilingual-cased license Visit here:

distilbert-base-multilingual-cased huggingface.co

distilbert-base-multilingual-cased huggingface.co Url

distilbert distilbert-base-multilingual-cased online free

distilbert distilbert-base-multilingual-cased online free url in huggingface.co:

distilbert-base-multilingual-cased install

distilbert-base-multilingual-cased install url in huggingface.co:

Url of distilbert-base-multilingual-cased

distilbert-base-multilingual-cased huggingface.co Url

Provider of distilbert-base-multilingual-cased huggingface.co

Other API from distilbert