castorini / azbert-base

huggingface.co
Total runs: 123
24-hour runs: 2
7-day runs: 9
30-day runs: 119
Model's Last Updated: 11월 05 2021
fill-mask

Introduction of azbert-base

Model Details of azbert-base

About

Here we share a pretrained BERT model that is aware of math tokens. The math tokens are treated specially and tokenized using pya0 , which adds very limited new tokens for latex markup (total vocabulary is just 31,061).

This model is trained on 4 x 2 Tesla V100 with a total batch size of 64, using Math StackExchange data with 2.7 million sentence pairs trained for 7 epochs.

Usage

Download and try it out

pip install pya0==0.3.2
wget https://vault.cs.uwaterloo.ca/s/gqstFZmWHCLGXe3/download -O ckpt.tar.gz
mkdir -p ckpt
tar xzf ckpt.tar.gz -C ckpt --strip-components=1
python test.py --test_file test.txt
Test file format

Modify the test examples in test.txt to play with it.

The test file is tab-separated, the first column is additional positions you want to mask for the right-side sentence (useful for masking tokens in math markups). A zero means no additional mask positions.

Example output

Upload to huggingface

This repo is hosted on Github , and only mirrored at huggingface .

To upload to huggingface, use the upload2hgf.sh script. Before runnig this script, be sure to check:

  • check points for model and tokenizer are created under ./ckpt folder
  • model contains all the files needed: config.json and pytorch_model.bin
  • tokenizer contains all the files needed: added_tokens.json , special_tokens_map.json , tokenizer_config.json , vocab.txt and tokenizer.json
  • no tokenizer_file field in tokenizer_config.json (sometimes it is located locally at ~/.cache )
  • git-lfs is installed
  • having git-remote named hgf reference to https://huggingface.co/castorini/azbert-base

Runs of castorini azbert-base on huggingface.co

123
Total runs
2
24-hour runs
6
3-day runs
9
7-day runs
119
30-day runs

More Information About azbert-base huggingface.co Model

More azbert-base license Visit here:

https://choosealicense.com/licenses/mit

azbert-base huggingface.co

azbert-base huggingface.co is an AI model on huggingface.co that provides azbert-base's model effect (), which can be used instantly with this castorini azbert-base model. huggingface.co supports a free trial of the azbert-base model, and also provides paid use of the azbert-base. Support call azbert-base model through api, including Node.js, Python, http.

castorini azbert-base online free

azbert-base huggingface.co is an online trial and call api platform, which integrates azbert-base's modeling effects, including api services, and provides a free online trial of azbert-base, you can try azbert-base online for free by clicking the link below.

castorini azbert-base online free url in huggingface.co:

https://huggingface.co/castorini/azbert-base

azbert-base install

azbert-base is an open source model from GitHub that offers a free installation service, and any user can find azbert-base on GitHub to install. At the same time, huggingface.co provides the effect of azbert-base install, users can directly use azbert-base installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

azbert-base install url in huggingface.co:

https://huggingface.co/castorini/azbert-base

Url of azbert-base

azbert-base huggingface.co Url

Provider of azbert-base huggingface.co

castorini
ORGANIZATIONS

Other API from castorini