allenai / open-instruct-pythia-6.9b-tulu

huggingface.co
Total runs: 1.6K
24-hour runs: 267
7-day runs: -328
30-day runs: 885
Model's Last Updated: 6月 13 2023
text-generation

Introduction of open-instruct-pythia-6.9b-tulu

Model Details of open-instruct-pythia-6.9b-tulu

Pythia 6.9B Tulu

This model is a 6.9B Pythia model finetuned on a mixture of instruction datasets (FLAN V2, CoT, Dolly, Open Assistant 1, GPT4-Alpaca, Code-Alpaca, and ShareGPT).

This was trained as part of the paper How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources . The codebase used to train and evaluate this model can be found at https://github.com/allenai/open-instruct .

This model is licensed under the AI model license given in LICENSE.txt, with the original model license at pythia_license.txt.

Usage

Simply download and use - this model is not a diff, unlike the other open-instruct models.

Input Format

The model is trained to use the following format (note the newlines):

<|user|>
Your message here!
<|assistant|>

For best results, format all inputs in this manner. Make sure to include a newline after <|assistant|> , this can affect generation quality quite a bit.

Performance

Here is the performance of this model across benchmarks explored in our paper How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources :

MMLU 0-shot MMLU 5-shot GSM Direct GSM CoT BBH Direct BBH CoT TydiQA Gold-Passage TydiQA Closed-book Codex-Eval Pass@1 Codex-Eval Pass@10 AlpacaFarm vs Davinci-003 Average
34.1 34.6 3.5 15.5 31.3 27.8 33.4 3.8 14.3 21.4 9.2 19.8

If you use this model, please cite our work, the Pythia paper, and the original datasets:

@misc{wang2023far,
      title={How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources}, 
      author={Yizhong Wang and Hamish Ivison and Pradeep Dasigi and Jack Hessel and Tushar Khot and Khyathi Raghavi Chandu and David Wadden and Kelsey MacMillan and Noah A. Smith and Iz Beltagy and Hannaneh Hajishirzi},
      year={2023},
      eprint={2306.04751},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}
@misc{biderman2023pythia,
      title={Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling}, 
      author={Stella Biderman and Hailey Schoelkopf and Quentin Anthony and Herbie Bradley and Kyle O'Brien and Eric Hallahan and Mohammad Aflah Khan and Shivanshu Purohit and USVSN Sai Prashanth and Edward Raff and Aviya Skowron and Lintang Sutawika and Oskar van der Wal},
      year={2023},
      eprint={2304.01373},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}
@misc{dolly,
  author = {Databricks},
  title = {Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {Blog post},
  url = {https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm}
}
@article{longpre2023flan,
  title={The Flan Collection: Designing Data and Methods for Effective Instruction Tuning},
  author={Longpre, Shayne and Hou, Le and Vu, Tu and Webson, Albert and Chung, Hyung Won and Tay, Yi and Zhou, Denny and Le, Quoc V and Zoph, Barret and Wei, Jason and others},
  journal={arXiv preprint arXiv:2301.13688},
  year={2023}
}
@misc{köpf2023openassistant,
      title={OpenAssistant Conversations -- Democratizing Large Language Model Alignment}, 
      author={Andreas Köpf and Yannic Kilcher and Dimitri von Rütte and Sotiris Anagnostidis and Zhi-Rui Tam and Keith Stevens and Abdullah Barhoum and Nguyen Minh Duc and Oliver Stanley and Richárd Nagyfi and Shahul ES and Sameer Suri and David Glushkov and Arnav Dantuluri and Andrew Maguire and Christoph Schuhmann and Huu Nguyen and Alexander Mattick},
      year={2023},
      eprint={2304.07327},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}
@article{peng2023instruction,
  title={Instruction Tuning with GPT-4},
  author={Peng, Baolin and Li, Chunyuan and He, Pengcheng and Galley, Michel and Gao, Jianfeng},
  journal={arXiv preprint arXiv:2304.03277},
  year={2023}
}
@misc{codealpaca,
  author = {Sahil Chaudhary},
  title = {Code Alpaca: An Instruction-following LLaMA model for code generation},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/sahil280114/codealpaca}},
}

Runs of allenai open-instruct-pythia-6.9b-tulu on huggingface.co

1.6K
Total runs
267
24-hour runs
273
3-day runs
-328
7-day runs
885
30-day runs

More Information About open-instruct-pythia-6.9b-tulu huggingface.co Model

open-instruct-pythia-6.9b-tulu huggingface.co

open-instruct-pythia-6.9b-tulu huggingface.co is an AI model on huggingface.co that provides open-instruct-pythia-6.9b-tulu's model effect (), which can be used instantly with this allenai open-instruct-pythia-6.9b-tulu model. huggingface.co supports a free trial of the open-instruct-pythia-6.9b-tulu model, and also provides paid use of the open-instruct-pythia-6.9b-tulu. Support call open-instruct-pythia-6.9b-tulu model through api, including Node.js, Python, http.

open-instruct-pythia-6.9b-tulu huggingface.co Url

https://huggingface.co/allenai/open-instruct-pythia-6.9b-tulu

allenai open-instruct-pythia-6.9b-tulu online free

open-instruct-pythia-6.9b-tulu huggingface.co is an online trial and call api platform, which integrates open-instruct-pythia-6.9b-tulu's modeling effects, including api services, and provides a free online trial of open-instruct-pythia-6.9b-tulu, you can try open-instruct-pythia-6.9b-tulu online for free by clicking the link below.

allenai open-instruct-pythia-6.9b-tulu online free url in huggingface.co:

https://huggingface.co/allenai/open-instruct-pythia-6.9b-tulu

open-instruct-pythia-6.9b-tulu install

open-instruct-pythia-6.9b-tulu is an open source model from GitHub that offers a free installation service, and any user can find open-instruct-pythia-6.9b-tulu on GitHub to install. At the same time, huggingface.co provides the effect of open-instruct-pythia-6.9b-tulu install, users can directly use open-instruct-pythia-6.9b-tulu installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

open-instruct-pythia-6.9b-tulu install url in huggingface.co:

https://huggingface.co/allenai/open-instruct-pythia-6.9b-tulu

Url of open-instruct-pythia-6.9b-tulu

open-instruct-pythia-6.9b-tulu huggingface.co Url

Provider of open-instruct-pythia-6.9b-tulu huggingface.co

allenai
ORGANIZATIONS

Other API from allenai

huggingface.co

Total runs: 91.7K
Run Growth: 78.6K
Growth Rate: 85.70%
Updated: 10月 18 2023
huggingface.co

Total runs: 61.6K
Run Growth: -50.5K
Growth Rate: -81.96%
Updated: 12月 04 2024
huggingface.co

Total runs: 23.0K
Run Growth: 7.7K
Growth Rate: 33.79%
Updated: 8月 14 2024
huggingface.co

Total runs: 8.5K
Run Growth: 3.3K
Growth Rate: 36.78%
Updated: 7月 16 2024
huggingface.co

Total runs: 6.1K
Run Growth: -21.5K
Growth Rate: -354.06%
Updated: 7月 03 2024
huggingface.co

Total runs: 5.1K
Run Growth: -17.0K
Growth Rate: -321.48%
Updated: 7月 16 2024
huggingface.co

Total runs: 2.5K
Run Growth: -163
Growth Rate: -6.49%
Updated: 12月 04 2024
huggingface.co

Total runs: 1.7K
Run Growth: -110
Growth Rate: -6.43%
Updated: 7月 16 2024
huggingface.co

Total runs: 895
Run Growth: 878
Growth Rate: 98.10%
Updated: 1月 24 2023
huggingface.co

Total runs: 502
Run Growth: -100
Growth Rate: -21.23%
Updated: 1月 24 2023
huggingface.co

Total runs: 486
Run Growth: 256
Growth Rate: 52.67%
Updated: 2月 12 2024
huggingface.co

Total runs: 404
Run Growth: 354
Growth Rate: 94.65%
Updated: 6月 13 2024
huggingface.co

Total runs: 313
Run Growth: -437
Growth Rate: -139.62%
Updated: 4月 30 2024
huggingface.co

Total runs: 297
Run Growth: 159
Growth Rate: 53.54%
Updated: 4月 19 2024