chinese-llama-plus-13b-hf huggingface.co api & shibing624 chinese-llama-plus-13b-hf github AI Model

Introduction of chinese-llama-plus-13b-hf

Model Details of chinese-llama-plus-13b-hf

Chinese LLaMA Plus 13B Model

发布中文LLaMA-Plus, Alpaca-Plus 13B版本模型

发布中文LLaMA-Plus, Alpaca-Plus 13B版本，改进点如下：

相比基础版进一步扩充了训练数据，其中LLaMA扩充至120G文本，Alpaca扩充至4.3M指令数据，重点增加了科学领域数据，涵盖：物理、化学、生物、医学、地球科学等
Alpaca训练时采用了更大的rank，相比基础版具有更低的验证集损失
Alpaca评测结果：13B获得74.3分，Plus-7B获得78.2分，Plus-13B获得80.8分，具体评测结果请参考效果评测
多轮回复长度相比旧模型提升明显（可适当增大温度系数）
知识问答、写作、翻译等方面效果显著提升

本模型是 decapoda-research/llama-13b-hf 底座模型合并 ziqingyang/chinese-llama-plus-lora-13b LoRA权重，并转化为HuggingFace版本权重（.bin文件），可以在此中文LLaMA模型上继续指令微调训练，LLaMA模型为底座模型，直接调用会效果不佳。

test case:

input_text	predict
为什么天空是蓝色的？	天空是蓝色的是因为大气中的气体分子散射了太阳光中的短波长蓝光，使得我们看到的天空呈现出蓝色。

release model weight

chinese-llama-plus-7b 模型权重链接： https://huggingface.co/minlik/chinese-llama-plus-7b-merged
chinese-alpaca-plus-7b 模型权重链接： https://huggingface.co/shibing624/chinese-alpaca-plus-7b-hf
chinese-llama-plus-13b 模型权重链接： https://huggingface.co/shibing624/chinese-llama-plus-13b-hf
chinese-aplaca-plus-13b 模型权重链接： https://huggingface.co/shibing624/chinese-alpaca-plus-13b-hf

Usage

本项目开源在textgen项目： textgen ，可支持llama模型，通过如下命令调用：

Install package:

pip install -U textgen

from textgen import GptModel
model = GptModel("llama", "shibing624/chinese-llama-plus-13b-hf")
r = model.predict(["用一句话描述地球为什么是独一无二的。"])
print(r) # ['地球是独一无二的，因为它拥有独特的大气层、水循环、生物多样性以及其他自然资源，这些都使它成为一个独特的生命支持系统。']

Usage (HuggingFace Transformers)

Without textgen , you can use the model like this:

First, you pass your input through the transformer model, then you get the generated sentence.

Install package:

pip install sentencepiece
pip install transformers>=4.28.0

import torch
import transformers
from transformers import LlamaTokenizer, LlamaForCausalLM

def generate_prompt(text):
    return f"""Below is an instruction that describes a task. Write a response that appropriately completes the request.

### Instruction:
{text}

### Response:"""


tokenizer = LlamaTokenizer.from_pretrained('shibing624/chinese-llama-plus-13b-hf')
model = LlamaForCausalLM.from_pretrained('shibing624/chinese-llama-plus-13b-hf').half().cuda()
model.eval()

text = '为什么天空是蓝色的？'
prompt = generate_prompt(text)
input_ids = tokenizer.encode(prompt, return_tensors='pt').to('cuda')


with torch.no_grad():
    output_ids = model.generate(
        input_ids=input_ids,
        max_new_tokens=128,
        temperature=1,
        top_k=40,
        top_p=0.9,
        repetition_penalty=1.15
    ).cuda()
output = tokenizer.decode(output_ids[0], skip_special_tokens=True)
print(output.replace(text, '').strip())

output:

为什么天空是蓝色的？
天空是蓝色的是因为大气中的气体分子散射了太阳光中的短波长蓝光，使得我们看到的天空呈现出蓝色。

模型来源

release合并后的模型权重，一步到位直接使用，省电、减少碳排放。

基于多LoRA权重合并（适用于Chinese-Alpaca-Plus ）方法手动合并而成，具体是使用 decapoda-research/llama-13b-hf 底座模型合并 ziqingyang/chinese-llama-plus-lora-13b LoRA权重得到，并转化为HuggingFace版本权重（.bin文件）。

HuggingFace版本权重（.bin文件）可用于：

使用Transformers进行训练和推理
使用text-generation-webui搭建界面

PyTorch版本权重（.pth文件）可用于：

使用llama.cpp工具进行量化和部署

PyTorch版本权重（.pth文件）链接： shibing624/chinese-alpaca-plus-13b-pth

模型文件组成：

chinese-alpaca-plus-13b-hf
|-- config.json
|-- generation_config.json
|-- LICENSE
|-- pytorch_model-00001-of-00003.bin
|-- pytorch_model-00002-of-00003.bin
|-- pytorch_model-00003-of-00003.bin
|-- pytorch_model.bin.index.json
|-- README.md
|-- special_tokens_map.json
|-- tokenizer_config.json
`-- tokenizer.model

硬件要求：25G显存

微调数据集

我整理部分公开微调数据集：

50万条中文ChatGPT指令Belle数据集： BelleGroup/train_0.5M_CN
100万条中文ChatGPT指令Belle数据集： BelleGroup/train_1M_CN
5万条英文ChatGPT指令Alpaca数据集： 50k English Stanford Alpaca dataset
5万条中文GPT4指令Alpaca数据集： shibing624/alpaca-zh
69万条中文指令Guanaco数据集(Belle50万条+Guanaco19万条)： Chinese-Vicuna/guanaco_belle_merge_v1.0

如果需要训练LLaMA模型，请参考 https://github.com/shibing624/textgen

Citation

@software{textgen,
  author = {Xu Ming},
  title = {textgen: Implementation of language model finetune},
  year = {2023},
  url = {https://github.com/shibing624/textgen},
}

Reference

https://github.com/ymcui/Chinese-LLaMA-Alpaca

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	41.06
ARC (25-shot)	46.25
HellaSwag (10-shot)	71.88
MMLU (5-shot)	40.74
TruthfulQA (0-shot)	39.89
Winogrande (5-shot)	73.09
GSM8K (5-shot)	0.53
DROP (3-shot)	15.08

Runs of shibing624 chinese-llama-plus-13b-hf on huggingface.co

1.4K

Total runs

24-hour runs

101

3-day runs

319

7-day runs

460

30-day runs

More Information About chinese-llama-plus-13b-hf huggingface.co Model

More chinese-llama-plus-13b-hf license Visit here:

https://choosealicense.com/licenses/other

chinese-llama-plus-13b-hf huggingface.co

chinese-llama-plus-13b-hf huggingface.co is an AI model on huggingface.co that provides chinese-llama-plus-13b-hf's model effect (), which can be used instantly with this shibing624 chinese-llama-plus-13b-hf model. huggingface.co supports a free trial of the chinese-llama-plus-13b-hf model, and also provides paid use of the chinese-llama-plus-13b-hf. Support call chinese-llama-plus-13b-hf model through api, including Node.js, Python, http.

chinese-llama-plus-13b-hf huggingface.co Url

https://huggingface.co/shibing624/chinese-llama-plus-13b-hf

shibing624 chinese-llama-plus-13b-hf online free

chinese-llama-plus-13b-hf huggingface.co is an online trial and call api platform, which integrates chinese-llama-plus-13b-hf's modeling effects, including api services, and provides a free online trial of chinese-llama-plus-13b-hf, you can try chinese-llama-plus-13b-hf online for free by clicking the link below.

shibing624 chinese-llama-plus-13b-hf online free url in huggingface.co:

https://huggingface.co/shibing624/chinese-llama-plus-13b-hf

chinese-llama-plus-13b-hf install

chinese-llama-plus-13b-hf is an open source model from GitHub that offers a free installation service, and any user can find chinese-llama-plus-13b-hf on GitHub to install. At the same time, huggingface.co provides the effect of chinese-llama-plus-13b-hf install, users can directly use chinese-llama-plus-13b-hf installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

chinese-llama-plus-13b-hf install url in huggingface.co:

https://huggingface.co/shibing624/chinese-llama-plus-13b-hf

huggingface.co

shibing624/text2vec-base-chinese

Total runs: 1.2M

Run Growth: 525.9K

Growth Rate: 45.18%

Updated: November 14 2024

huggingface.co

shibing624/text2vec-base-multilingual

Total runs: 208.9K

Run Growth: 192.3K

Growth Rate: 93.93%

Updated: July 31 2024

huggingface.co

shibing624/macbert4csc-base-chinese

Total runs: 71.5K

Run Growth: 63.6K

Growth Rate: 90.26%

Updated: September 27 2024

huggingface.co

shibing624/text2vec-base-chinese-paraphrase

Total runs: 41.7K

Run Growth: -10.9K

Growth Rate: -27.19%

Updated: October 28 2024

huggingface.co

shibing624/chinese-text-correction-1.5b

Total runs: 1.9K

Run Growth: 883

Growth Rate: 38.97%

Updated: October 14 2024

huggingface.co

shibing624/chinese-alpaca-plus-7b-hf

Total runs: 1.7K

Run Growth: 780

Growth Rate: 46.43%

Updated: December 15 2023

huggingface.co

shibing624/chinese-alpaca-plus-13b-hf

Total runs: 1.3K

Run Growth: 446

Growth Rate: 34.31%

Updated: December 15 2023

huggingface.co

shibing624/text2vec-base-chinese-sentence

Total runs: 1.2K

Run Growth: -2.3K

Growth Rate: -185.02%

Updated: October 28 2024

huggingface.co

shibing624/mengzi-t5-base-chinese-correction

Total runs: 1.0K

Run Growth: 97

Growth Rate: 8.82%

Updated: February 19 2024

huggingface.co

shibing624/text2vec-bge-large-chinese

Total runs: 871

Run Growth: -164

Growth Rate: -19.32%

Updated: August 21 2024

huggingface.co

shibing624/chinese-text-correction-7b

Total runs: 664

Run Growth: 28

Growth Rate: 1.90%

Updated: October 14 2024

huggingface.co

shibing624/gpt2-dialogbot-base-chinese

Total runs: 307

Run Growth: 57

Growth Rate: 18.51%

Updated: March 19 2023

huggingface.co

shibing624/vicuna-baichuan-13b-chat

Total runs: 191

Run Growth: 35

Growth Rate: 19.89%

Updated: January 23 2024

huggingface.co

shibing624/code-autocomplete-gpt2-base

Total runs: 175

Run Growth: -54

Growth Rate: -30.86%

Updated: March 19 2023

huggingface.co

shibing624/bert4ner-base-chinese

Total runs: 167

Run Growth: -344

Growth Rate: -85.36%

Updated: February 19 2024

huggingface.co

shibing624/ziya-llama-13b-medical-merged

Total runs: 112

Run Growth: -187

Growth Rate: -170.00%

Updated: February 19 2024

huggingface.co

shibing624/t5-chinese-couplet

Total runs: 110

Run Growth: 97

Growth Rate: 88.18%

Updated: March 28 2023

huggingface.co

shibing624/code-autocomplete-distilgpt2-python

Total runs: 89

Run Growth: 42

Growth Rate: 48.28%

Updated: February 19 2024

huggingface.co

shibing624/chatglm3-6b-csc-chinese-lora

Total runs: 85

Run Growth: 16

Growth Rate: 18.82%

Updated: February 19 2024

huggingface.co

shibing624/parrots-chinese-hubert-base

Total runs: 78

Run Growth: -61

Growth Rate: -79.22%

Updated: November 11 2024

huggingface.co

shibing624/parrots-chinese-roberta-wwm-ext-large

Total runs: 71

Run Growth: -228

Growth Rate: -542.86%

Updated: February 12 2024

huggingface.co

shibing624/asian-role

Total runs: 52

Run Growth: -49

Growth Rate: -87.50%

Updated: February 19 2024

huggingface.co

shibing624/bart4csc-base-chinese

Total runs: 45

Run Growth: 22

Growth Rate: 48.89%

Updated: February 19 2024

huggingface.co

shibing624/chinese-text-correction-7b-lora

Total runs: 28

Run Growth: -1

Growth Rate: -3.23%

Updated: October 14 2024

huggingface.co

shibing624/llama-3-8b-instruct-262k-chinese

Total runs: 25

Run Growth: 0

Growth Rate: 0.00%

Updated: April 29 2024

huggingface.co

shibing624/bert4ner-base-uncased

Total runs: 18

Run Growth: 5

Growth Rate: 27.78%

Updated: February 19 2024

huggingface.co

shibing624/chinese-text-correction-1.5b-lora

Total runs: 14

Run Growth: 0

Growth Rate: 0.00%

Updated: October 14 2024

huggingface.co

shibing624/bertspan4ner-base-chinese

Total runs: 14

Run Growth: -12

Growth Rate: -85.71%

Updated: February 19 2024

huggingface.co

shibing624/vicuna-baichuan-13b-chat-lora

Total runs: 13

Run Growth: -2

Growth Rate: -15.38%

Updated: February 19 2024

huggingface.co

shibing624/chatglm-6b-belle-zh-lora

Total runs: 7

Run Growth: -28

Growth Rate: -350.00%

Updated: February 19 2024

huggingface.co

shibing624/ziya-llama-13b-medical-lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: February 03 2024

huggingface.co

shibing624/chinese-alpaca-plus-13b-pth

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: May 19 2023

huggingface.co

shibing624/text2vec-word2vec-tencent-chinese

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: January 02 2025

huggingface.co

shibing624/songnet-base-chinese-couplet

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 27 2022

huggingface.co

shibing624/chinese-kenlm-klm

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 08 2023

huggingface.co

shibing624/llama-3-8b-instruct-262k-chinese-lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: April 29 2024

huggingface.co

shibing624/chatglm-6b-csc-zh-lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: December 15 2023

huggingface.co

shibing624/parrots-gpt-sovits-speaker-maimai

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: February 19 2024

huggingface.co

shibing624/parrots-gpt-sovits-speaker

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: February 19 2024

huggingface.co

shibing624/songnet-base-chinese

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: February 19 2024

huggingface.co

shibing624/llama-13b-belle-zh-lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: April 14 2023

huggingface.co

shibing624/songnet-base-chinese-songci

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: February 19 2024

shibing624 / chinese-llama-plus-13b-hf

Introduction of chinese-llama-plus-13b-hf

Model Details of chinese-llama-plus-13b-hf

Chinese LLaMA Plus 13B Model

release model weight

Usage

Usage (HuggingFace Transformers)

模型来源

微调数据集

Citation

Reference

Open LLM Leaderboard Evaluation Results

Runs of shibing624 chinese-llama-plus-13b-hf on huggingface.co

More Information About chinese-llama-plus-13b-hf huggingface.co Model

More chinese-llama-plus-13b-hf license Visit here:

chinese-llama-plus-13b-hf huggingface.co

chinese-llama-plus-13b-hf huggingface.co Url

shibing624 chinese-llama-plus-13b-hf online free

shibing624 chinese-llama-plus-13b-hf online free url in huggingface.co:

chinese-llama-plus-13b-hf install

chinese-llama-plus-13b-hf install url in huggingface.co:

Url of chinese-llama-plus-13b-hf

chinese-llama-plus-13b-hf huggingface.co Url

Provider of chinese-llama-plus-13b-hf huggingface.co

Other API from shibing624