roberta-large-NER huggingface.co api & 51la5 roberta-large-NER github AI Model

Introduction of roberta-large-NER

Model Details of roberta-large-NER

xlm-roberta-large-finetuned-conll03-english

Model Details
Uses
Bias, Risks, and Limitations
Training
Evaluation
Environmental Impact
Technical Specifications
Citation
Model Card Authors
How To Get Started With the Model

Model Details

Model Description

The XLM-RoBERTa model was proposed in Unsupervised Cross-lingual Representation Learning at Scale by Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer and Veselin Stoyanov. It is based on Facebook's RoBERTa model released in 2019. It is a large multi-lingual language model, trained on 2.5TB of filtered CommonCrawl data. This model is XLM-RoBERTa-large fine-tuned with the conll2003 dataset in English.

Developed by: See associated paper
Model type: Multi-lingual language model
Language(s) (NLP) or Countries (images): XLM-RoBERTa is a multilingual model trained on 100 different languages; see GitHub Repo for full list; model is fine-tuned on a dataset in English
License: More information needed
Related Models: RoBERTa , XLM
- Parent Model: XLM-RoBERTa-large
Resources for more information: - GitHub Repo - Associated Paper

Uses

Direct Use

The model is a language model. The model can be used for token classification, a natural language understanding task in which a label is assigned to some tokens in a text.

Downstream Use

Potential downstream use cases include Named Entity Recognition (NER) and Part-of-Speech (PoS) tagging. To learn more about token classification and other potential downstream use cases, see the Hugging Face token classification docs .

Out-of-Scope Use

The model should not be used to intentionally create hostile or alienating environments for people.

Bias, Risks, and Limitations

CONTENT WARNING: Readers should be made aware that language generated by this model may be disturbing or offensive to some and may propagate historical and current stereotypes.

Significant research has explored bias and fairness issues with language models (see, e.g., Sheng et al. (2021) and Bender et al. (2021) ). In the context of tasks relevant to this model, Mishra et al. (2020) explore social biases in NER systems for English and find that there is systematic bias in existing NER systems in that they fail to identify named entities from different demographic groups (though this paper did not look at BERT). For example, using a sample sentence from Mishra et al. (2020) :

>>> from transformers import pipeline
>>> tokenizer = AutoTokenizer.from_pretrained("xlm-roberta-large-finetuned-conll03-english")
>>> model = AutoModelForTokenClassification.from_pretrained("xlm-roberta-large-finetuned-conll03-english")
>>> classifier = pipeline("ner", model=model, tokenizer=tokenizer)
>>> classifier("Alya told Jasmine that Andrew could pay with cash..")
[{'end': 2,
  'entity': 'I-PER',
  'index': 1,
  'score': 0.9997861,
  'start': 0,
  'word': '▁Al'},
 {'end': 4,
  'entity': 'I-PER',
  'index': 2,
  'score': 0.9998591,
  'start': 2,
  'word': 'ya'},
 {'end': 16,
  'entity': 'I-PER',
  'index': 4,
  'score': 0.99995816,
  'start': 10,
  'word': '▁Jasmin'},
 {'end': 17,
  'entity': 'I-PER',
  'index': 5,
  'score': 0.9999584,
  'start': 16,
  'word': 'e'},
 {'end': 29,
  'entity': 'I-PER',
  'index': 7,
  'score': 0.99998057,
  'start': 23,
  'word': '▁Andrew'}]

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model.

Training

See the following resources for training data and training procedure details:

XLM-RoBERTa-large model card
CoNLL-2003 data card
Associated paper

Evaluation

See the associated paper for evaluation details.

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019) .

Hardware Type: 500 32GB Nvidia V100 GPUs (from the associated paper )
Hours used: More information needed
Cloud Provider: More information needed
Compute Region: More information needed
Carbon Emitted: More information needed

Technical Specifications

See the associated paper for further details.

Citation

BibTeX:

@article{conneau2019unsupervised,
  title={Unsupervised Cross-lingual Representation Learning at Scale},
  author={Conneau, Alexis and Khandelwal, Kartikay and Goyal, Naman and Chaudhary, Vishrav and Wenzek, Guillaume and Guzm{\'a}n, Francisco and Grave, Edouard and Ott, Myle and Zettlemoyer, Luke and Stoyanov, Veselin},
  journal={arXiv preprint arXiv:1911.02116},
  year={2019}
}

APA:

Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzmán, F., ... & Stoyanov, V. (2019). Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116.

Model Card Authors

This model card was written by the team at Hugging Face.

How to Get Started with the Model

Use the code below to get started with the model. You can use this model directly within a pipeline for NER.

Click to expand

>>> from transformers import AutoTokenizer, AutoModelForTokenClassification
>>> from transformers import pipeline
>>> tokenizer = AutoTokenizer.from_pretrained("xlm-roberta-large-finetuned-conll03-english")
>>> model = AutoModelForTokenClassification.from_pretrained("xlm-roberta-large-finetuned-conll03-english")
>>> classifier = pipeline("ner", model=model, tokenizer=tokenizer)
>>> classifier("Hello I'm Omar and I live in Zürich.")

[{'end': 14,
  'entity': 'I-PER',
  'index': 5,
  'score': 0.9999175,
  'start': 10,
  'word': '▁Omar'},
 {'end': 35,
  'entity': 'I-LOC',
  'index': 10,
  'score': 0.9999906,
  'start': 29,
  'word': '▁Zürich'}]

Runs of 51la5 roberta-large-NER on huggingface.co

31.6K

Total runs

24-hour runs

721

3-day runs

7.3K

7-day runs

-14.0K

30-day runs

More Information About roberta-large-NER huggingface.co Model

roberta-large-NER huggingface.co

roberta-large-NER huggingface.co is an AI model on huggingface.co that provides roberta-large-NER's model effect (), which can be used instantly with this 51la5 roberta-large-NER model. huggingface.co supports a free trial of the roberta-large-NER model, and also provides paid use of the roberta-large-NER. Support call roberta-large-NER model through api, including Node.js, Python, http.

roberta-large-NER huggingface.co Url

https://huggingface.co/51la5/roberta-large-NER

51la5 roberta-large-NER online free

roberta-large-NER huggingface.co is an online trial and call api platform, which integrates roberta-large-NER's modeling effects, including api services, and provides a free online trial of roberta-large-NER, you can try roberta-large-NER online for free by clicking the link below.

51la5 roberta-large-NER online free url in huggingface.co:

https://huggingface.co/51la5/roberta-large-NER

roberta-large-NER install

roberta-large-NER is an open source model from GitHub that offers a free installation service, and any user can find roberta-large-NER on GitHub to install. At the same time, huggingface.co provides the effect of roberta-large-NER install, users can directly use roberta-large-NER installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

roberta-large-NER install url in huggingface.co:

https://huggingface.co/51la5/roberta-large-NER

huggingface.co

51la5/distilbert-base-NER

Total runs: 160

Run Growth: 130

Growth Rate: 81.25%

Updated: October 17 2022

huggingface.co

51la5/BART-QMSUM-Summary

Total runs: 160

Run Growth: -1

Growth Rate: -50.00%

Updated: November 23 2022

huggingface.co

51la5/BART-large-summary

Total runs: 158

Run Growth: -4

Growth Rate: -200.00%

Updated: November 23 2022

huggingface.co

51la5/T5-summary

Total runs: 158

Run Growth: -1

Growth Rate: -33.33%

Updated: November 23 2022

huggingface.co

51la5/distilBART-summary

Total runs: 158

Run Growth: -4

Growth Rate: -200.00%

Updated: November 23 2022

huggingface.co

51la5/bert-large-NER

Total runs: 128

Run Growth: 116

Growth Rate: 90.63%

Updated: October 17 2022

huggingface.co

51la5/bert-base-NER

Total runs: 125

Run Growth: 112

Growth Rate: 89.60%

Updated: October 17 2022

huggingface.co

51la5/electra-large-NER

Total runs: 110

Run Growth: 107

Growth Rate: 97.27%

Updated: October 17 2022

huggingface.co

51la5/roberta-base-sentiment

Total runs: 106

Run Growth: 82

Growth Rate: 77.36%

Updated: October 17 2022

huggingface.co

51la5/distilbert-base-sentiment

Total runs: 106

Run Growth: 89

Growth Rate: 83.96%

Updated: October 17 2022

huggingface.co

51la5/bert-base-sentiment

Total runs: 106

Run Growth: 91

Growth Rate: 85.85%

Updated: October 17 2022

huggingface.co

51la5/QMSUM-keyphrase-gen

Total runs: 6

Run Growth: -4

Growth Rate: -66.67%

Updated: July 22 2022

huggingface.co

51la5/XSUM-keyphrase-gen

Total runs: 6

Run Growth: 0

Growth Rate: 0.00%

Updated: July 22 2022

huggingface.co

51la5/LogRegression-sentiment

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 17 2022

huggingface.co

51la5/MultinomialNB-sentiment

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 17 2022

51la5 / roberta-large-NER

Introduction of roberta-large-NER

Model Details of roberta-large-NER

xlm-roberta-large-finetuned-conll03-english

Table of Contents

Model Details

Model Description

Uses

Direct Use

Downstream Use

Out-of-Scope Use

Bias, Risks, and Limitations

Recommendations

Training

Evaluation

Environmental Impact

Technical Specifications

Citation

Model Card Authors

How to Get Started with the Model

Runs of 51la5 roberta-large-NER on huggingface.co

More Information About roberta-large-NER huggingface.co Model

roberta-large-NER huggingface.co

roberta-large-NER huggingface.co Url

51la5 roberta-large-NER online free

51la5 roberta-large-NER online free url in huggingface.co:

roberta-large-NER install

roberta-large-NER install url in huggingface.co:

Url of roberta-large-NER

roberta-large-NER huggingface.co Url

Provider of roberta-large-NER huggingface.co

Other API from 51la5