saiga_llama3_8b huggingface.co api & IlyaGusev saiga_llama3_8b github AI Model

Introduction of saiga_llama3_8b

Model Details of saiga_llama3_8b

Saiga/Llama3 8B, Russian Llama-3-based chatbot

Based on Llama-3 8B Instruct .

Llama.cpp version: link

Colab: link

Prompt format

ОСТОРОЖНО! WARNING! LET OP!

I've changed the prompt format from ChatML to the original Llama-3 format in v4 . Don't forget to switch formats!

v4, v5, v6+ : LLama-3 prompt format:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

Ты — Сайга, русскоязычный автоматический ассистент. Ты разговариваешь с людьми и помогаешь им.<|eot_id|><|start_header_id|>user<|end_header_id|>

Как дела?<|eot_id|><|start_header_id|>assistant<|end_header_id|>

Отлично, а у тебя?<|eot_id|><|start_header_id|>user<|end_header_id|>

Шикарно. Как пройти в библиотеку?<|eot_id|><|start_header_id|>assistant<|end_header_id|>

v2, v3 : ChatML prompt format:

<|im_start|>system
Ты — Сайга, русскоязычный автоматический ассистент. Ты разговариваешь с людьми и помогаешь им.<|im_end|>
<|im_start|>user
Как дела?<|im_end|>
<|im_start|>assistant
Отлично, а у тебя?<|im_end|>
<|im_start|>user
Шикарно. Как пройти в библиотеку?<|im_end|>
<|im_start|>assistant

Code example

# Исключительно ознакомительный пример.
# НЕ НАДО ТАК ИНФЕРИТЬ МОДЕЛЬ В ПРОДЕ.
# См. https://github.com/vllm-project/vllm или https://github.com/huggingface/text-generation-inference

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, GenerationConfig

MODEL_NAME = "IlyaGusev/saiga_llama3_8b"
DEFAULT_SYSTEM_PROMPT = "Ты — Сайга, русскоязычный автоматический ассистент. Ты разговариваешь с людьми и помогаешь им."

model = AutoModelForCausalLM.from_pretrained(
    MODEL_NAME,
    load_in_8bit=True,
    torch_dtype=torch.bfloat16,
    device_map="auto"
)
model.eval()

tokenizer = AutoTokenizer.from_pretrained(MODEL_NAME)
generation_config = GenerationConfig.from_pretrained(MODEL_NAME)
print(generation_config)

inputs = ["Почему трава зеленая?", "Сочини длинный рассказ, обязательно упоминая следующие объекты. Дано: Таня, мяч"]
for query in inputs:
    prompt = tokenizer.apply_chat_template([{
        "role": "system",
        "content": DEFAULT_SYSTEM_PROMPT
    }, {
        "role": "user",
        "content": query
    }], tokenize=False, add_generation_prompt=True)
    data = tokenizer(prompt, return_tensors="pt", add_special_tokens=False)
    data = {k: v.to(model.device) for k, v in data.items()}
    output_ids = model.generate(**data, generation_config=generation_config)[0]
    output_ids = output_ids[len(data["input_ids"][0]):]
    output = tokenizer.decode(output_ids, skip_special_tokens=True).strip()
    print(query)
    print(output)
    print()
    print("==============================")
    print()

Output examples

User: Почему трава зеленая? 
Saiga: Трава выглядит зеленым из-за наличия в ней хлорофилла – пигмента, который помогает растениям поглощать солнечную энергию для процесса фотосинтеза.
Хлорофилл поглощает световые лучи с длиной волны 400-500 нанометров (ближе к синему) и отражает лучи с длиной волны 600-700 нанометров (ближе к красному), что делает его видимым как зеленый цвет.
Таким образом, зеленый цвет является результатом того, как хлорофилл взаимодействует со светом, обеспечивая растения необходимой энергии для роста и развития.

User: Сочини длинный рассказ, обязательно упоминая следующие объекты. Дано: Таня, мяч
Saiga: В небольшой деревушке под открытым небом жила девочка по имени Таня. Она была очень активной и любящей играть в различные виды спорта. Однажды, когда она прогуливалась по полям, увидела на земле лежащий мяч. Это был старый, но все еще пригодный для игры мячик.
Таня сразу же решила взять его себе домой и показать своему другу Мише. Мишка тоже обожал футбол и всегда рад был бы провести время вместе со своими друзьями. Когда они вернулись домой, Таня показала ему найденный мячик и предложила сыграть в футбол.
Мишка был счастлив, что у них появился новый мяч, и они сразу же начали готовиться к матчу. Они построили две команды и начали игру. Играли они до вечера, а потом решили сделать перерыв и приготовить ужин.
После ужина они продолжили игру, теперь уже с большим энтузиазмом и страстью. В конце концов, победителем стала команда Тани, которая забила больше голов. Все закончилось веселым праздником и хорошими воспоминаниями о проведенном времени.
На следующий день Таня и Миша решили пригласить своих других друзей на игру. Они собрали всех в одном месте и провели ещё один захватывающий матч. Этот день также стал незабываемым благодаря новому мячу, который принес так много радости и улыбок.
Итак, этот маленький мяч, который Таня нашла на поле, стал не просто предметом для игр, но и символом дружбы и веселья между детьми. Он помог им создать незабываемые моменты и укрепил их отношения.

Versions

v7:

17e1bcc9c6d3e31852a95a168f7d3778f395cd2a
Other name: saiga_llama3_8b_abliterated_sft_m11_d7_kto_m7_d7
SFT dataset config: sft_d7.json
SFT model config: saiga_llama3_8b_sft_m11.json
KTO dataset config: pref_d7.json
KTO model config: saiga_llama3_8b_kto_m7.json
SFT wandb: link
KTO wandb: link

v6:

b662833f247ca04f1843b356e7ff3ee4aef8086a
Other name: saiga_llama3_8b_sft_m10_d1_kto_m2_d2
SFT dataset config: sft_d1.json
SFT model config: saiga_llama3_8b_sft_m10.json
KTO dataset config: pref_d2.json
KTO model config: saiga_llama3_8b_kto_m2.json
SFT wandb: link
KTO wandb: link

v5:

d947b00c56683cd4b2f7ce707edef89318027be4
KTO-tune over v4, dataset: lmsys_clean_ru_preferences
wandb link

v4:

1cc945d4ca2c7901cf989e7edaac52ab24f1a7dd
dataset: saiga_scored , scores >= 8, c66032920556c0f21bbbed05e7e04433ec954c3d
wandb link

v3:

c588356cd60bdee54d52c2dd5a2445acca8aa5c3
dataset: saiga_scored , scores >= 8, d51cf8060bdc90023da8cf1c3f113f9193d6569b
wandb link

v2:

ae61b4f9b34fac9856d361ea78c66284a00e4f0b
dataset code revision d0d123dd221e10bb2a3383bcb1c6e4efe1b4a28a
wandb link
5 datasets: ru_turbo_saiga, ru_sharegpt_cleaned, oasst1_ru_main_branch, gpt_roleplay_realm, ru_instruct_gpt4
Datasets merging script: create_short_chat_set.py

Evaluation

Dataset: https://github.com/IlyaGusev/rulm/blob/master/self_instruct/data/tasks.jsonl
Framework: https://github.com/tatsu-lab/alpaca_eval
Evaluator: alpaca_eval_cot_gpt4_turbo_fn

Pivot: chatgpt_3_5_turbo

model	length_controlled_winrate	win_rate	standard_error	avg_length
chatgpt_4_turbo	76.04	90.00	1.46	1270
chatgpt_3_5_turbo	50.00	50.00	0.00	536
saiga_llama3_8b, v6	49.33	68.31	2.26	1262
sfr-iter-dpo	49.11	74.94	2.13	1215
suzume	49.05	71.57	2.20	1325
saiga_llama3_8b, v7	48.95	69.40	2.25	1266
saiga_llama3_8b, v5	47.13	66.18	2.31	1194
saiga_llama3_8b, v4	43.64	65.90	2.31	1200
saiga_llama3_8b, v3	36.97	61.08	2.38	1162
saiga_llama3_8b, v2	33.07	48.19	2.45	1166
saiga_mistral_7b	23.38	35.99	2.34	949

Pivot: sfr

model	length_controlled_winrate	win_rate	standard_error	avg_length
sfr	50.00	50.00	0.00	1215
saiga_llama3_8b, v7	48.95	49.16	2.46	1266
saiga_llama3_8b, v6	46.91	47.23	2.45	1262
suzume_8b	43.69	48.19	2.46	1325

Runs of IlyaGusev saiga_llama3_8b on huggingface.co

14.1K

Total runs

419

24-hour runs

690

3-day runs

496

7-day runs

5.0K

30-day runs

More Information About saiga_llama3_8b huggingface.co Model

More saiga_llama3_8b license Visit here:

https://choosealicense.com/licenses/llama3

saiga_llama3_8b huggingface.co

saiga_llama3_8b huggingface.co is an AI model on huggingface.co that provides saiga_llama3_8b's model effect (), which can be used instantly with this IlyaGusev saiga_llama3_8b model. huggingface.co supports a free trial of the saiga_llama3_8b model, and also provides paid use of the saiga_llama3_8b. Support call saiga_llama3_8b model through api, including Node.js, Python, http.

saiga_llama3_8b huggingface.co Url

https://huggingface.co/IlyaGusev/saiga_llama3_8b

IlyaGusev saiga_llama3_8b online free

saiga_llama3_8b huggingface.co is an online trial and call api platform, which integrates saiga_llama3_8b's modeling effects, including api services, and provides a free online trial of saiga_llama3_8b, you can try saiga_llama3_8b online for free by clicking the link below.

IlyaGusev saiga_llama3_8b online free url in huggingface.co:

https://huggingface.co/IlyaGusev/saiga_llama3_8b

saiga_llama3_8b install

saiga_llama3_8b is an open source model from GitHub that offers a free installation service, and any user can find saiga_llama3_8b on GitHub to install. At the same time, huggingface.co provides the effect of saiga_llama3_8b install, users can directly use saiga_llama3_8b installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

saiga_llama3_8b install url in huggingface.co:

https://huggingface.co/IlyaGusev/saiga_llama3_8b

huggingface.co

IlyaGusev/saiga2_13b_gguf

Total runs: 13.9K

Run Growth: 2.5K

Growth Rate: 17.86%

Updated: September 26 2023

huggingface.co

IlyaGusev/mbart_ru_sum_gazeta

Total runs: 11.9K

Run Growth: 2.3K

Growth Rate: 19.22%

Updated: March 17 2023

huggingface.co

IlyaGusev/rut5_base_headline_gen_telegram

Total runs: 11.0K

Run Growth: 7.1K

Growth Rate: 64.33%

Updated: December 19 2021

huggingface.co

IlyaGusev/saiga_llama3_8b_gguf

Total runs: 6.7K

Run Growth: 3.0K

Growth Rate: 44.26%

Updated: July 04 2024

huggingface.co

IlyaGusev/saiga_mistral_7b_merged

Total runs: 6.6K

Run Growth: 5.8K

Growth Rate: 87.74%

Updated: May 10 2024

huggingface.co

IlyaGusev/saiga_mistral_7b_gguf

Total runs: 4.7K

Run Growth: -9.0K

Growth Rate: -193.14%

Updated: April 04 2024

huggingface.co

IlyaGusev/rubertconv_toxic_clf

Total runs: 3.9K

Run Growth: 2.4K

Growth Rate: 64.55%

Updated: July 13 2022

huggingface.co

IlyaGusev/rugpt_large_turbo_instructed

Total runs: 812

Run Growth: 225

Growth Rate: 28.20%

Updated: March 25 2023

huggingface.co

IlyaGusev/rugpt3medium_sum_gazeta

Total runs: 673

Run Growth: 118

Growth Rate: 17.53%

Updated: July 13 2022

huggingface.co

IlyaGusev/rut5_base_sum_gazeta

Total runs: 592

Run Growth: -184

Growth Rate: -31.08%

Updated: July 13 2022

huggingface.co

IlyaGusev/saiga2_70b_gguf

Total runs: 413

Run Growth: 264

Growth Rate: 63.92%

Updated: September 05 2023

huggingface.co

IlyaGusev/xlm_roberta_large_headline_cause_full

Total runs: 389

Run Growth: -17

Growth Rate: -4.40%

Updated: July 13 2022

huggingface.co

IlyaGusev/rubertconv_toxic_editor

Total runs: 366

Run Growth: 36

Growth Rate: 14.34%

Updated: July 13 2022

huggingface.co

IlyaGusev/saiga2_7b_gguf

Total runs: 268

Run Growth: -205

Growth Rate: -76.49%

Updated: September 17 2023

huggingface.co

IlyaGusev/rulm_gpt_neo_small

Total runs: 225

Run Growth: 212

Growth Rate: 94.22%

Updated: February 07 2023

huggingface.co

IlyaGusev/fred_t5_ru_turbo_alpaca

Total runs: 150

Run Growth: 102

Growth Rate: 63.35%

Updated: December 04 2023

huggingface.co

IlyaGusev/ru-word-stress-transformer

Total runs: 147

Run Growth: 123

Growth Rate: 84.25%

Updated: December 31 2022

huggingface.co

IlyaGusev/rubert_ext_sum_gazeta

Total runs: 110

Run Growth: 35

Growth Rate: 31.82%

Updated: July 13 2022

huggingface.co

IlyaGusev/llama_7b_ru_turbo_alpaca_lora_merged

Total runs: 66

Run Growth: 38

Growth Rate: 55.07%

Updated: April 03 2023

huggingface.co

IlyaGusev/rubert_telegram_headlines

Total runs: 40

Run Growth: -179

Growth Rate: -447.50%

Updated: July 13 2022

huggingface.co

IlyaGusev/rugpt_medium_turbo_instructed

Total runs: 29

Run Growth: 19

Growth Rate: 54.29%

Updated: March 25 2023

huggingface.co

IlyaGusev/rut5_large_turbo_instructed

Total runs: 24

Run Growth: -535

Growth Rate: -2229.17%

Updated: March 26 2023

huggingface.co

IlyaGusev/saiga_gemma_9b

Total runs: 23

Run Growth: 0

Growth Rate: 0.00%

Updated: February 24 2024

huggingface.co

IlyaGusev/xlm_roberta_large_headline_cause_simple

Total runs: 22

Run Growth: -7

Growth Rate: -36.84%

Updated: July 13 2022

huggingface.co

IlyaGusev/rurp_seraph_7b

Total runs: 12

Run Growth: 5

Growth Rate: 41.67%

Updated: April 18 2024

huggingface.co

IlyaGusev/ruadapt_ficbook_llama2_7b

Total runs: 11

Run Growth: 2

Growth Rate: 18.18%

Updated: December 26 2023

huggingface.co

IlyaGusev/sber_rut5_filler

Total runs: 8

Run Growth: -19

Growth Rate: -237.50%

Updated: July 13 2022

huggingface.co

IlyaGusev/rut5_tox

Total runs: 8

Run Growth: -1

Growth Rate: -14.29%

Updated: July 13 2022

huggingface.co

IlyaGusev/news_tg_rubert

Total runs: 5

Run Growth: 1

Growth Rate: 20.00%

Updated: June 17 2021

huggingface.co

IlyaGusev/roberta-base-informal-tagger

Total runs: 5

Run Growth: -3

Growth Rate: -60.00%

Updated: July 29 2022

huggingface.co

IlyaGusev/mt0_xxl_ru_turbo_alpaca_lora_merged

Total runs: 3

Run Growth: -3

Growth Rate: -100.00%

Updated: March 29 2023

huggingface.co

IlyaGusev/t5-base-filler-informal

Total runs: 2

Run Growth: -4

Growth Rate: -200.00%

Updated: July 29 2022

huggingface.co

IlyaGusev/gen_title_tg_bottleneck

Total runs: 2

Run Growth: 0

Growth Rate: 0.00%

Updated: November 28 2020

huggingface.co

IlyaGusev/gen_title_tg_bottleneck_encoder

Total runs: 1

Run Growth: -26

Growth Rate: 0.00%

Updated: May 19 2021

huggingface.co

IlyaGusev/saiga_30b_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: July 26 2023

huggingface.co

IlyaGusev/gigasaiga_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: September 07 2023

huggingface.co

IlyaGusev/saiga_mistral_7b_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: February 13 2024

huggingface.co

IlyaGusev/saiga_13b_ggml

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: July 26 2023

huggingface.co

IlyaGusev/llama_7b_ru_turbo_alpaca_lora_llamacpp

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: April 02 2023

huggingface.co

IlyaGusev/llama_13b_ru_turbo_alpaca_lora_llamacpp

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: April 02 2023

huggingface.co

IlyaGusev/saiga_7b_ggml

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: July 26 2023

huggingface.co

IlyaGusev/saiga2_7b_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: September 17 2023

huggingface.co

IlyaGusev/llama_13b_ru_turbo_alpaca_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: May 16 2023

huggingface.co

IlyaGusev/mt0_xxl_ru_turbo_alpaca_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: April 01 2023

huggingface.co

IlyaGusev/saiga2_13b_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: August 07 2023

huggingface.co

IlyaGusev/llama_7b_ru_turbo_alpaca_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: May 16 2023

huggingface.co

IlyaGusev/rpr_7b

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: May 18 2023

huggingface.co

IlyaGusev/saiga_7b_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: April 04 2024

huggingface.co

IlyaGusev/saiga2_70b_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: October 10 2023

huggingface.co

IlyaGusev/saiga_30b_ggml

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: July 26 2023

huggingface.co

IlyaGusev/saiga_65b_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: June 26 2023

huggingface.co

IlyaGusev/saiga_13b_lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: July 26 2023

IlyaGusev / saiga_llama3_8b

Introduction of saiga_llama3_8b

Model Details of saiga_llama3_8b

Saiga/Llama3 8B, Russian Llama-3-based chatbot

Prompt format

Code example

Output examples

Versions

Evaluation

Runs of IlyaGusev saiga_llama3_8b on huggingface.co

More Information About saiga_llama3_8b huggingface.co Model

More saiga_llama3_8b license Visit here:

saiga_llama3_8b huggingface.co

saiga_llama3_8b huggingface.co Url

IlyaGusev saiga_llama3_8b online free

IlyaGusev saiga_llama3_8b online free url in huggingface.co:

saiga_llama3_8b install

saiga_llama3_8b install url in huggingface.co:

Url of saiga_llama3_8b

saiga_llama3_8b huggingface.co Url

Provider of saiga_llama3_8b huggingface.co

Other API from IlyaGusev