deepset / tinybert-6l-768d-squad2

huggingface.co
Total runs: 292
24-hour runs: 0
7-day runs: 155
30-day runs: -31
Model's Last Updated: September 26 2024
question-answering

Introduction of tinybert-6l-768d-squad2

Model Details of tinybert-6l-768d-squad2

Overview

Language model: deepset/tinybert-6L-768D-squad2
Language: English
Training data: SQuAD 2.0 training set x 20 augmented + SQuAD 2.0 training set without augmentation
Eval data: SQuAD 2.0 dev set
Infrastructure : 1x V100 GPU
Published : Dec 8th, 2021

Details
  • haystack's intermediate layer and prediction layer distillation features were used for training (based on TinyBERT ). deepset/bert-base-uncased-squad2 was used as the teacher model and huawei-noah/TinyBERT_General_6L_768D was used as the student model.
Hyperparameters
Intermediate layer distillation
batch_size = 26
n_epochs = 5
max_seq_len = 384
learning_rate = 5e-5
lr_schedule = LinearWarmup
embeds_dropout_prob = 0.1
temperature = 1
Prediction layer distillation
batch_size = 26
n_epochs = 5
max_seq_len = 384
learning_rate = 3e-5
lr_schedule = LinearWarmup
embeds_dropout_prob = 0.1
temperature = 1
distillation_loss_weight = 1.0
Performance
"exact": 71.87736882001179
"f1": 76.36111895973675
Authors
  • Timo Möller: timo.moeller [at] deepset.ai
  • Julian Risch: julian.risch [at] deepset.ai
  • Malte Pietsch: malte.pietsch [at] deepset.ai
  • Michel Bartels: michel.bartels [at] deepset.ai
About us

deepset logo We bring NLP to the industry via open source!
Our focus: Industry specific language models & large scale QA systems.

Some of our work:

Get in touch: Twitter | LinkedIn | Discord | GitHub Discussions | Website

By the way: we're hiring!

Runs of deepset tinybert-6l-768d-squad2 on huggingface.co

292
Total runs
0
24-hour runs
22
3-day runs
155
7-day runs
-31
30-day runs

More Information About tinybert-6l-768d-squad2 huggingface.co Model

More tinybert-6l-768d-squad2 license Visit here:

https://choosealicense.com/licenses/mit

tinybert-6l-768d-squad2 huggingface.co

tinybert-6l-768d-squad2 huggingface.co is an AI model on huggingface.co that provides tinybert-6l-768d-squad2's model effect (), which can be used instantly with this deepset tinybert-6l-768d-squad2 model. huggingface.co supports a free trial of the tinybert-6l-768d-squad2 model, and also provides paid use of the tinybert-6l-768d-squad2. Support call tinybert-6l-768d-squad2 model through api, including Node.js, Python, http.

tinybert-6l-768d-squad2 huggingface.co Url

https://huggingface.co/deepset/tinybert-6l-768d-squad2

deepset tinybert-6l-768d-squad2 online free

tinybert-6l-768d-squad2 huggingface.co is an online trial and call api platform, which integrates tinybert-6l-768d-squad2's modeling effects, including api services, and provides a free online trial of tinybert-6l-768d-squad2, you can try tinybert-6l-768d-squad2 online for free by clicking the link below.

deepset tinybert-6l-768d-squad2 online free url in huggingface.co:

https://huggingface.co/deepset/tinybert-6l-768d-squad2

tinybert-6l-768d-squad2 install

tinybert-6l-768d-squad2 is an open source model from GitHub that offers a free installation service, and any user can find tinybert-6l-768d-squad2 on GitHub to install. At the same time, huggingface.co provides the effect of tinybert-6l-768d-squad2 install, users can directly use tinybert-6l-768d-squad2 installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

tinybert-6l-768d-squad2 install url in huggingface.co:

https://huggingface.co/deepset/tinybert-6l-768d-squad2

Url of tinybert-6l-768d-squad2

tinybert-6l-768d-squad2 huggingface.co Url

Provider of tinybert-6l-768d-squad2 huggingface.co

deepset
ORGANIZATIONS

Other API from deepset

huggingface.co

Total runs: 51.3K
Run Growth: 42.2K
Growth Rate: 82.10%
Updated: September 26 2024
huggingface.co

Total runs: 27.6K
Run Growth: 3.7K
Growth Rate: 13.44%
Updated: September 26 2024
huggingface.co

Total runs: 679
Run Growth: 297
Growth Rate: 43.74%
Updated: September 26 2024