Llama-3.1-Tulu-3-8B-DPO huggingface.co api & allenai Llama-3.1-Tulu-3-8B-DPO github AI Model

Introduction of Llama-3.1-Tulu-3-8B-DPO

Model Details of Llama-3.1-Tulu-3-8B-DPO

Llama-3.1-Tulu-3-8B-DPO

Tülu3 is a leading instruction following model family, offering fully open-source data, code, and recipes designed to serve as a comprehensive guide for modern post-training techniques. Tülu3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval.

Model description

Model type: A model trained on a mix of publicly available, synthetic and human-created datasets.
Language(s) (NLP): Primarily English
License: Llama 3.1 Community License Agreement
Finetuned from model: allenai/Llama-3.1-Tulu-3-8B-SFT

Model Sources

Training Repository: https://github.com/allenai/open-instruct
Eval Repository: https://github.com/allenai/olmes
Paper: https://allenai.org/papers/tulu-3-report.pdf (arXiv soon)
Demo: https://playground.allenai.org/

Model Family

Stage	Llama 3.1 8B	Llama 3.1 70B
Base Model	meta-llama/Llama-3.1-8B	meta-llama/Llama-3.1-70B
SFT	allenai/Llama-3.1-Tulu-3-8B-SFT	allenai/Llama-3.1-Tulu-3-70B-SFT
DPO	allenai/Llama-3.1-Tulu-3-8B-DPO	allenai/Llama-3.1-Tulu-3-70B-DPO
Final Models (RLVR)	allenai/Llama-3.1-Tulu-3-8B	allenai/Llama-3.1-Tulu-3-70B
Reward Model (RM)	allenai/Llama-3.1-Tulu-3-8B-RM	(Same as 8B)

Using the model

Loading with HuggingFace

To load the model with HuggingFace, use the following snippet:

from transformers import AutoModelForCausalLM

tulu_model = AutoModelForCausalLM.from_pretrained("allenai/Llama-3.1-Tulu-3-8B-DPO")

VLLM

As a Llama base model, the model can be easily served with:

vllm serve allenai/Llama-3.1-Tulu-3-8B-DPO

Note that given the long chat template of Llama, you may want to use --max_model_len=8192 .

Chat template

The chat template for our models is formatted as:

<|user|>\nHow are you doing?\n<|assistant|>\nI'm just a computer program, so I don't have feelings, but I'm functioning as expected. How can I assist you today?<|endoftext|>

Or with new lines expanded:

<|user|>
How are you doing?
<|assistant|>
I'm just a computer program, so I don't have feelings, but I'm functioning as expected. How can I assist you today?<|endoftext|>

It is embedded within the tokenizer as well, for tokenizer.apply_chat_template .

System prompt

In Ai2 demos, we use this system prompt by default:

You are Tulu 3, a helpful and harmless AI Assistant built by the Allen Institute for AI.

The model has not been trained with a specific system prompt in mind.

Bias, Risks, and Limitations

The Tülu3 models have limited safety training, but are not deployed automatically with in-the-loop filtering of responses like ChatGPT, so the model can produce problematic outputs (especially when prompted to do so). It is also unknown what the size and composition of the corpus was used to train the base Llama 3.1 models, however it is likely to have included a mix of Web data and technical sources like books and code. See the Falcon 180B model card for an example of this.

Performance

Benchmark (eval)	Tülu 3 SFT 8B	Tülu 3 DPO 8B	Tülu 3 8B	Llama 3.1 8B Instruct	Qwen 2.5 7B Instruct	Magpie 8B	Gemma 2 9B Instruct	Ministral 8B Instruct
Avg.	60.4	64.4	64.8	62.2	57.8	44.7	55.2	58.3
MMLU (0 shot, CoT)	65.9	68.7	68.2	71.2	76.6	62.0	74.6	68.5
PopQA (15 shot)	29.3	29.3	29.1	20.2	18.1	22.5	28.3	20.2
TruthfulQA (6 shot)	46.8	56.1	55.0	55.1	63.1	57.0	61.4	55.5
BigBenchHard (3 shot, CoT)	67.9	65.8	66.0	62.8	21.7	0.9	2.5	56.2
DROP (3 shot)	61.3	62.5	62.6	61.5	54.4	49.4	58.8	56.2
MATH (4 shot CoT, Flex)	31.5	42.0	43.7	42.5	14.8	5.1	29.8	40.0
GSM8K (8 shot, CoT)	76.2	84.3	87.6	83.4	83.8	61.2	79.7	80.0
HumanEval (pass@10)	86.2	83.9	83.9	86.3	93.1	75.4	71.7	91.0
HumanEval+ (pass@10)	81.4	78.6	79.2	82.9	89.7	69.1	67.0	88.5
IFEval (prompt loose)	72.8	81.1	82.4	80.6	74.7	38.8	69.9	56.4
AlpacaEval 2 (LC % win)	12.4	33.5	34.5	24.2	29.0	49.0	43.7	31.4
Safety (6 task avg.)	93.1	87.2	85.5	75.2	75.0	46.4	75.5	56.2

Benchmark (eval)	Tülu 3 70B SFT	Tülu 3 DPO 70B	Tülu 3 70B	Llama 3.1 70B Instruct	Qwen 2.5 72B Instruct	Hermes 3 Llama 3.1 70B	Nemotron Llama 3.1 70B
Avg.	72.6	75.9	76.0	73.4	71.5	68.3	65.5
MMLU (0 shot, CoT)	78.9	83.3	83.1	85.3	85.5	80.4	83.8
PopQA (15 shot)	48.6	46.3	46.5	46.4	30.6	48.1	36.4
TruthfulQA (6 shot)	55.7	67.9	67.6	66.8	69.9	66.5	62.6
BigBenchHard (3 shot, CoT)	82.7	81.8	82.0	73.8	67.2	82.1	0.7
DROP (3 shot)	77.2	74.1	74.3	77.0	34.2	73.2	68.8
MATH (4 shot CoT, Flex)	53.7	62.3	63.0	56.4	74.3	41.9	55.0
GSM8K (8 shot, CoT)	91.1	93.5	93.5	93.7	89.5	90.0	84.7
HumanEval (pass@10)	92.9	92.4	92.4	93.6	94.0	89.6	94.1
HumanEval+ (pass@10)	87.3	88.4	88.0	89.5	90.8	85.9	85.5
IFEval (prompt loose)	82.1	82.6	83.2	88.0	87.6	76.0	79.9
AlpacaEval 2 (LC % win)	26.3	49.6	49.8	33.4	47.7	28.4	66.1
Safety (6 task avg.)	94.4	89.0	88.3	76.5	87.0	57.9	69.0

Hyperparamters

DPO:

Learning Rate : 5 × 10⁻⁷ (8B), 2.0e-7 (70B)
Learning Rate Schedule : Linear
Batch Size (effective) : 32 (8B), 128 (70B)
Max Sequence Length : 2,048
Epochs : 1

License and use

All Llama 3.1 Tülu3 models are released under Meta's Llama 3.1 Community License Agreement . Llama 3.1 is licensed under the Llama 3.1 Community License, Copyright © Meta Platforms, Inc. Tülu3 is intended for research and educational use. For more information, please see our Responsible Use Guidelines .

The models have been fine-tuned using a dataset mix with outputs generated from third party models and are subject to additional terms: Gemma Terms of Use and Qwen License Agreement (models were improved using Qwen 2.5).

Citation

If Tülu3 or any of the related materials were helpful to your work, please cite:

@article{lambert2024tulu3,
  title = {Tülu 3: Pushing Frontiers in Open Language Model Post-Training},
  author = {
    Nathan Lambert and 
    Jacob Morrison and 
    Valentina Pyatkin and 
    Shengyi Huang and 
    Hamish Ivison and 
    Faeze Brahman and 
    Lester James V. Miranda and 
    Alisa Liu and 
    Nouha Dziri and 
    Shane Lyu and 
    Yuling Gu and 
    Saumya Malik and 
    Victoria Graf and 
    Jena D. Hwang and 
    Jiangjiang Yang and
    Ronan Le Bras and
    Oyvind Tafjord and
    Chris Wilhelm and
    Luca Soldaini and 
    Noah A. Smith and 
    Yizhong Wang and 
    Pradeep Dasigi and 
    Hannaneh Hajishirzi
  },
  year = {2024},
  email = {tulu@allenai.org}
}

Runs of allenai Llama-3.1-Tulu-3-8B-DPO on huggingface.co

7.1K

Total runs

24-hour runs

-885

3-day runs

-11.6K

7-day runs

-19.8K

30-day runs

More Information About Llama-3.1-Tulu-3-8B-DPO huggingface.co Model

More Llama-3.1-Tulu-3-8B-DPO license Visit here:

https://choosealicense.com/licenses/llama3.1

Llama-3.1-Tulu-3-8B-DPO huggingface.co

Llama-3.1-Tulu-3-8B-DPO huggingface.co is an AI model on huggingface.co that provides Llama-3.1-Tulu-3-8B-DPO's model effect (), which can be used instantly with this allenai Llama-3.1-Tulu-3-8B-DPO model. huggingface.co supports a free trial of the Llama-3.1-Tulu-3-8B-DPO model, and also provides paid use of the Llama-3.1-Tulu-3-8B-DPO. Support call Llama-3.1-Tulu-3-8B-DPO model through api, including Node.js, Python, http.

Llama-3.1-Tulu-3-8B-DPO huggingface.co Url

https://huggingface.co/allenai/Llama-3.1-Tulu-3-8B-DPO

allenai Llama-3.1-Tulu-3-8B-DPO online free

Llama-3.1-Tulu-3-8B-DPO huggingface.co is an online trial and call api platform, which integrates Llama-3.1-Tulu-3-8B-DPO's modeling effects, including api services, and provides a free online trial of Llama-3.1-Tulu-3-8B-DPO, you can try Llama-3.1-Tulu-3-8B-DPO online for free by clicking the link below.

allenai Llama-3.1-Tulu-3-8B-DPO online free url in huggingface.co:

https://huggingface.co/allenai/Llama-3.1-Tulu-3-8B-DPO

Llama-3.1-Tulu-3-8B-DPO install

Llama-3.1-Tulu-3-8B-DPO is an open source model from GitHub that offers a free installation service, and any user can find Llama-3.1-Tulu-3-8B-DPO on GitHub to install. At the same time, huggingface.co provides the effect of Llama-3.1-Tulu-3-8B-DPO install, users can directly use Llama-3.1-Tulu-3-8B-DPO installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

Llama-3.1-Tulu-3-8B-DPO install url in huggingface.co:

https://huggingface.co/allenai/Llama-3.1-Tulu-3-8B-DPO

huggingface.co

allenai/longformer-base-4096

Total runs: 5.6M

Run Growth: 2.7M

Growth Rate: 47.46%

Updated: 2023年4月5日

huggingface.co

allenai/scibert_scivocab_uncased

Total runs: 804.0K

Run Growth: 118.1K

Growth Rate: 14.70%

Updated: 2022年10月3日

huggingface.co

allenai/specter2_aug2023refresh_base

Total runs: 173.9K

Run Growth: -113.8K

Growth Rate: -65.44%

Updated: 2024年5月14日

huggingface.co

allenai/ivila-row-layoutlm-finetuned-s2vl-v2

Total runs: 134.2K

Run Growth: 110.4K

Growth Rate: 82.06%

Updated: 2022年10月3日

huggingface.co

allenai/OLMo-1B-0724-hf

Total runs: 94.4K

Run Growth: -2.1K

Growth Rate: -2.24%

Updated: 2024年8月5日

huggingface.co

allenai/specter

Total runs: 91.7K

Run Growth: 78.6K

Growth Rate: 85.70%

Updated: 2023年10月18日

huggingface.co

allenai/OLMo-2-1124-7B

Total runs: 91.3K

Run Growth: 63.3K

Growth Rate: 69.57%

Updated: 2025年1月6日

huggingface.co

allenai/Molmo-7B-D-0924

Total runs: 77.3K

Run Growth: -517.1K

Growth Rate: -669.13%

Updated: 2024年10月10日

huggingface.co

allenai/led-base-16384

Total runs: 75.6K

Run Growth: 4.4K

Growth Rate: 5.77%

Updated: 2023年1月24日

huggingface.co

allenai/MolmoE-1B-0924

Total runs: 63.3K

Run Growth: 51.7K

Growth Rate: 81.63%

Updated: 2024年10月10日

huggingface.co

allenai/specter2_base

Total runs: 61.6K

Run Growth: -50.5K

Growth Rate: -81.96%

Updated: 2024年12月4日

huggingface.co

allenai/biomed_roberta_base

Total runs: 49.7K

Run Growth: -640

Growth Rate: -1.29%

Updated: 2022年10月3日

huggingface.co

allenai/tk-instruct-small-def-pos

Total runs: 41.2K

Run Growth: 41.2K

Growth Rate: 99.97%

Updated: 2023年1月24日

huggingface.co

allenai/olmOCR-7B-0225-preview

Total runs: 35.6K

Run Growth: 26.6K

Growth Rate: 98.51%

Updated: 2025年2月25日

huggingface.co

allenai/OLMo-7B-0724-Instruct-hf

Total runs: 31.2K

Run Growth: -19.4K

Growth Rate: -62.31%

Updated: 2024年9月24日

huggingface.co

allenai/OLMo-7B-0724-hf

Total runs: 27.7K

Run Growth: 24.5K

Growth Rate: 88.59%

Updated: 2024年7月16日

huggingface.co

allenai/OLMoE-1B-7B-0924

Total runs: 23.4K

Run Growth: 1.4K

Growth Rate: 5.61%

Updated: 2024年10月19日

huggingface.co

allenai/OLMo-1B-hf

Total runs: 23.0K

Run Growth: 7.7K

Growth Rate: 33.79%

Updated: 2024年8月14日

huggingface.co

allenai/Llama-3.1-Tulu-3-8B-SFT

Total runs: 22.6K

Run Growth: -54

Growth Rate: -0.24%

Updated: 2025年1月30日

huggingface.co

allenai/longformer-large-4096

Total runs: 21.2K

Run Growth: -18.4K

Growth Rate: -87.09%

Updated: 2022年10月3日

huggingface.co

allenai/unifiedqa-t5-small

Total runs: 17.9K

Run Growth: -22.8K

Growth Rate: -127.30%

Updated: 2023年1月24日

huggingface.co

allenai/OLMo-2-1124-7B-Instruct

Total runs: 16.3K

Run Growth: 5.4K

Growth Rate: 27.42%

Updated: 2025年1月6日

huggingface.co

allenai/OLMo-2-1124-13B

Total runs: 12.4K

Run Growth: 8.0K

Growth Rate: 69.22%

Updated: 2025年1月6日

huggingface.co

allenai/Molmo-7B-O-0924

Total runs: 11.5K

Run Growth: 4.9K

Growth Rate: 42.53%

Updated: 2024年11月15日

huggingface.co

allenai/longformer-large-4096-finetuned-triviaqa

Total runs: 11.3K

Run Growth: 977

Growth Rate: 8.66%

Updated: 2022年10月3日

huggingface.co

allenai/scibert_scivocab_cased

Total runs: 11.0K

Run Growth: 2.7K

Growth Rate: 25.33%

Updated: 2022年10月3日

huggingface.co

allenai/Llama-3.1-Tulu-3-8B

Total runs: 10.5K

Run Growth: 43

Growth Rate: 0.41%

Updated: 2025年2月13日

huggingface.co

allenai/OLMo-7B-hf

Total runs: 8.5K

Run Growth: 3.3K

Growth Rate: 36.78%

Updated: 2024年7月16日

huggingface.co

allenai/OLMo-2-1124-7B-SFT

Total runs: 7.6K

Run Growth: 7.9K

Growth Rate: 65.67%

Updated: 2025年1月6日

huggingface.co

allenai/OLMoE-1B-7B-0924-Instruct

Total runs: 6.9K

Run Growth: 880

Growth Rate: 12.71%

Updated: 2024年9月13日

huggingface.co

allenai/OLMo-2-1124-13B-Instruct

Total runs: 6.5K

Run Growth: -2.3K

Growth Rate: -34.31%

Updated: 2025年1月6日

huggingface.co

allenai/Molmo-72B-0924

Total runs: 6.4K

Run Growth: 3.3K

Growth Rate: 51.01%

Updated: 2024年10月10日

huggingface.co

allenai/wildguard

Total runs: 6.1K

Run Growth: -21.5K

Growth Rate: -354.06%

Updated: 2024年7月3日

huggingface.co

allenai/OLMo-7B

Total runs: 5.1K

Run Growth: -17.0K

Growth Rate: -321.48%

Updated: 2024年7月16日

huggingface.co

allenai/OLMo-7B-0724-SFT-hf

Total runs: 4.9K

Run Growth: 2.1K

Growth Rate: 45.30%

Updated: 2024年7月14日

huggingface.co

allenai/Llama-3.1-Tulu-3-70B

Total runs: 4.7K

Run Growth: 2.7K

Growth Rate: 56.97%

Updated: 2025年2月10日

huggingface.co

allenai/OLMoE-1B-7B-0125-Instruct

Total runs: 4.5K

Run Growth: 4.4K

Growth Rate: 97.96%

Updated: 2025年2月4日

huggingface.co

allenai/led-large-16384-arxiv

Total runs: 4.3K

Run Growth: 3.2K

Growth Rate: 74.66%

Updated: 2023年1月24日

huggingface.co

allenai/Llama-3.1-Tulu-3-8B-RM

Total runs: 4.1K

Run Growth: -888

Growth Rate: -21.90%

Updated: 2025年1月30日

huggingface.co

allenai/tulu-2-dpo-13b

Total runs: 4.0K

Run Growth: 1.9K

Growth Rate: 47.73%

Updated: 2024年5月17日

huggingface.co

allenai/tulu-2-dpo-7b

Total runs: 3.7K

Run Growth: 1.7K

Growth Rate: 44.61%

Updated: 2024年5月14日

huggingface.co

allenai/tulu-2-dpo-70b

Total runs: 3.7K

Run Growth: 1.6K

Growth Rate: 42.74%

Updated: 2024年1月31日

huggingface.co

allenai/tk-instruct-base-def-pos

Total runs: 3.3K

Run Growth: 3.1K

Growth Rate: 93.97%

Updated: 2023年1月24日

huggingface.co

allenai/specter2

Total runs: 2.5K

Run Growth: -163

Growth Rate: -6.49%

Updated: 2024年12月4日

huggingface.co

allenai/OLMo-7B-0424-hf

Total runs: 2.4K

Run Growth: -2.0K

Growth Rate: -77.45%

Updated: 2024年7月16日

huggingface.co

allenai/OLMo-2-1124-13B-Instruct-GGUF

Total runs: 2.3K

Run Growth: 1.6K

Growth Rate: 69.43%

Updated: 2025年1月6日

huggingface.co

allenai/OLMoE-1B-7B-0125-Instruct-GGUF

Total runs: 2.3K

Run Growth: 2.2K

Growth Rate: 98.27%

Updated: 2025年2月13日

huggingface.co

allenai/OLMo-2-1124-7B-DPO

Total runs: 2.1K

Run Growth: 2.2K

Growth Rate: 33.30%

Updated: 2025年1月6日

huggingface.co

allenai/open-instruct-human-mix-65b

Total runs: 2.1K

Run Growth: 578

Growth Rate: 34.24%

Updated: 2023年6月29日

huggingface.co

allenai/specter2_aug2023refresh

Total runs: 2.0K

Run Growth: -2.5K

Growth Rate: -123.46%

Updated: 2024年5月14日

huggingface.co

allenai/digital-socrates-13b

Total runs: 1.9K

Run Growth: 854

Growth Rate: 44.90%

Updated: 2024年9月2日

huggingface.co

allenai/Llama-3.1-Tulu-3-405B

Total runs: 1.9K

Run Growth: 1.9K

Growth Rate: 99.26%

Updated: 2025年2月10日

huggingface.co

allenai/digital-socrates-7b

Total runs: 1.9K

Run Growth: 862

Growth Rate: 45.37%

Updated: 2024年9月2日

huggingface.co

allenai/OLMo-2-1124-13B-GGUF

Total runs: 1.9K

Run Growth: 1.5K

Growth Rate: 82.01%

Updated: 2024年11月26日

huggingface.co

allenai/Llama-3.1-Tulu-3.1-8B

Total runs: 1.9K

Run Growth: 1.8K

Growth Rate: 97.27%

Updated: 2025年2月10日

huggingface.co

allenai/OLMo-7B-Instruct

Total runs: 1.8K

Run Growth: 162

Growth Rate: 9.02%

Updated: 2024年10月15日

huggingface.co

allenai/OLMo-1B

Total runs: 1.7K

Run Growth: -110

Growth Rate: -6.43%

Updated: 2024年7月16日

huggingface.co

allenai/OLMoE-1B-7B-0924-Instruct-GGUF

Total runs: 1.7K

Run Growth: -160

Growth Rate: -9.16%

Updated: 2024年9月14日

huggingface.co

allenai/olmOCR-7B-0225-preview-GGUF

Total runs: 1.7K

Run Growth: 943

Growth Rate: 72.21%

Updated: 2025年2月26日

huggingface.co

allenai/led-large-16384

Total runs: 1.6K

Run Growth: 461

Growth Rate: 28.09%

Updated: 2023年1月24日

huggingface.co

allenai/open-instruct-pythia-6.9b-tulu

Total runs: 1.6K

Run Growth: 885

Growth Rate: 47.40%

Updated: 2023年6月13日

huggingface.co

allenai/unifiedqa-t5-base

Total runs: 1.5K

Run Growth: 824

Growth Rate: 53.93%

Updated: 2023年1月24日

huggingface.co

allenai/t5-small-squad2-question-generation

Total runs: 1.4K

Run Growth: 841

Growth Rate: 57.17%

Updated: 2023年1月24日

huggingface.co

allenai/wmt19-de-en-6-6-big

Total runs: 1.0K

Run Growth: -34

Growth Rate: -3.30%

Updated: 2023年1月24日

huggingface.co

allenai/OLMo-7B-Instruct-hf

Total runs: 946

Run Growth: -3.2K

Growth Rate: -239.15%

Updated: 2024年10月26日

huggingface.co

allenai/unifiedqa-v2-t5-large-1363200

Total runs: 936

Run Growth: 513

Growth Rate: 54.81%

Updated: 2023年1月24日

huggingface.co

allenai/tulu-v2.5-ppo-13b-uf-mean-70b-uf-rm

Total runs: 920

Run Growth: 854

Growth Rate: 92.83%

Updated: 2024年6月14日

huggingface.co

allenai/tailor

Total runs: 895

Run Growth: 878

Growth Rate: 98.10%

Updated: 2023年1月24日

huggingface.co

allenai/truthfulqa-truth-judge-llama2-7B

Total runs: 873

Run Growth: -148

Growth Rate: -16.95%

Updated: 2024年3月7日

huggingface.co

allenai/OLMoE-1B-7B-0125-GGUF

Total runs: 842

Run Growth: 839

Growth Rate: 99.64%

Updated: 2025年1月22日

huggingface.co

allenai/OLMoE-1B-7B-0125

Total runs: 819

Run Growth: 795

Growth Rate: 97.07%

Updated: 2025年1月23日

huggingface.co

allenai/unifiedqa-v2-t5-3b-1363200

Total runs: 669

Run Growth: 392

Growth Rate: 58.59%

Updated: 2023年1月24日

huggingface.co

allenai/Llama-3.1-Tulu-3-70B-DPO

Total runs: 667

Run Growth: 237

Growth Rate: 35.53%

Updated: 2025年1月30日

huggingface.co

allenai/cs_roberta_base

Total runs: 655

Run Growth: 323

Growth Rate: 43.01%

Updated: 2022年10月3日

huggingface.co

allenai/OLMo-2-1124-7B-Instruct-GGUF

Total runs: 542

Run Growth: -250

Growth Rate: -46.13%

Updated: 2025年1月6日

huggingface.co

allenai/truthfulqa-info-judge-llama2-7B

Total runs: 522

Run Growth: -340

Growth Rate: -65.13%

Updated: 2024年3月7日

huggingface.co

allenai/PRIMERA

Total runs: 502

Run Growth: -100

Growth Rate: -21.23%

Updated: 2023年1月24日

huggingface.co

allenai/uio2-large

Total runs: 486

Run Growth: 256

Growth Rate: 52.67%

Updated: 2024年2月12日

huggingface.co

allenai/OLMo-7B-Twin-2T-hf

Total runs: 469

Run Growth: 134

Growth Rate: 29.00%

Updated: 2024年7月16日

huggingface.co

allenai/unifiedqa-v2-t5-base-1251000

Total runs: 406

Run Growth: 383

Growth Rate: 94.33%

Updated: 2023年1月24日

huggingface.co

allenai/scitulu-7b

Total runs: 404

Run Growth: 354

Growth Rate: 94.65%

Updated: 2024年6月13日

huggingface.co

allenai/hvila-block-layoutlm-finetuned-docbank

Total runs: 382

Run Growth: 333

Growth Rate: 87.17%

Updated: 2022年10月3日

huggingface.co

allenai/OLMo-2-1124-7B-GGUF

Total runs: 360

Run Growth: 13

Growth Rate: 4.30%

Updated: 2024年11月26日

huggingface.co

allenai/unifiedqa-t5-large

Total runs: 339

Run Growth: 26

Growth Rate: 7.67%

Updated: 2023年1月24日

huggingface.co

allenai/t5-small-next-word-generator-qoogle

Total runs: 329

Run Growth: 242

Growth Rate: 73.11%

Updated: 2023年1月24日

huggingface.co

allenai/Llama-3.1-Tulu-3-70B-SFT

Total runs: 326

Run Growth: -125

Growth Rate: -38.34%

Updated: 2025年1月30日

huggingface.co

allenai/aspire-contextualsentence-multim-compsci

Total runs: 317

Run Growth: -5.8K

Growth Rate: -1837.22%

Updated: 2023年10月17日

huggingface.co

allenai/tulu-2-7b

Total runs: 313

Run Growth: -437

Growth Rate: -139.62%

Updated: 2024年4月30日

huggingface.co

allenai/OLMoE-1B-7B-0924-GGUF

Total runs: 299

Run Growth: 36

Growth Rate: 12.08%

Updated: 2024年9月14日

huggingface.co

allenai/tulu-2-70b

Total runs: 297

Run Growth: 159

Growth Rate: 53.54%

Updated: 2024年4月19日

huggingface.co

allenai/PRIMERA-multinews

Total runs: 284

Run Growth: 190

Growth Rate: 64.41%

Updated: 2023年1月24日

huggingface.co

allenai/unifiedqa-t5-3b

Total runs: 262

Run Growth: 135

Growth Rate: 51.53%

Updated: 2023年1月24日

huggingface.co

allenai/led-base-16384-ms2

Total runs: 254

Run Growth: 176

Growth Rate: 69.29%

Updated: 2023年10月30日

huggingface.co

allenai/drug_combinations_lm_pubmedbert

Total runs: 218

Run Growth: 204

Growth Rate: 93.58%

Updated: 2022年10月20日

huggingface.co

allenai/OLMo-2-1124-7B-RM

Total runs: 210

Run Growth: 82

Growth Rate: 39.23%

Updated: 2025年1月6日

huggingface.co

allenai/open-instruct-stanford-alpaca-7b

Total runs: 194

Run Growth: 22

Growth Rate: 11.46%

Updated: 2023年6月20日

huggingface.co

allenai/OLMo-2-1124-13B-DPO

Total runs: 189

Run Growth: -216

Growth Rate: -108.00%

Updated: 2025年1月24日

huggingface.co

allenai/llama-3.1-tulu-2-8b-uf-mean-rm

Total runs: 179

Run Growth: -41

Growth Rate: -22.91%

Updated: 2024年8月15日

huggingface.co

allenai/bart-large-multi_lexsum-source-multitask

Total runs: 177

Run Growth: 162

Growth Rate: 91.53%

Updated: 2023年1月24日

huggingface.co

allenai/unifiedqa-v2-t5-base-1363200

Total runs: 176

Run Growth: -178

Growth Rate: -101.14%

Updated: 2023年1月24日

allenai / Llama-3.1-Tulu-3-8B-DPO

Introduction of Llama-3.1-Tulu-3-8B-DPO

Model Details of Llama-3.1-Tulu-3-8B-DPO

Llama-3.1-Tulu-3-8B-DPO

Model description

Model Sources

Model Family

Using the model

Loading with HuggingFace

VLLM

Chat template

System prompt

Bias, Risks, and Limitations

Performance

Hyperparamters

License and use

Citation

Runs of allenai Llama-3.1-Tulu-3-8B-DPO on huggingface.co

More Information About Llama-3.1-Tulu-3-8B-DPO huggingface.co Model

More Llama-3.1-Tulu-3-8B-DPO license Visit here:

Llama-3.1-Tulu-3-8B-DPO huggingface.co

Llama-3.1-Tulu-3-8B-DPO huggingface.co Url

allenai Llama-3.1-Tulu-3-8B-DPO online free

allenai Llama-3.1-Tulu-3-8B-DPO online free url in huggingface.co:

Llama-3.1-Tulu-3-8B-DPO install

Llama-3.1-Tulu-3-8B-DPO install url in huggingface.co:

Url of Llama-3.1-Tulu-3-8B-DPO

Llama-3.1-Tulu-3-8B-DPO huggingface.co Url

Provider of Llama-3.1-Tulu-3-8B-DPO huggingface.co

Other API from allenai