deepseek-coder-33B-instruct-AWQ huggingface.co api & TheBloke deepseek-coder-33B-instruct-AWQ github AI Model

Introduction of deepseek-coder-33B-instruct-AWQ

Model Details of deepseek-coder-33B-instruct-AWQ

Chat & support: TheBloke's Discord server

Want to contribute? TheBloke's Patreon page

TheBloke's LLM work is generously supported by a grant from andreessen horowitz (a16z)

Deepseek Coder 33B Instruct - AWQ

Model creator: DeepSeek
Original model: Deepseek Coder 33B Instruct

Description

This repo contains AWQ model files for DeepSeek's Deepseek Coder 33B Instruct .

These files were quantised using hardware kindly provided by Massed Compute .

About AWQ

AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization. Compared to GPTQ, it offers faster Transformers-based inference with equivalent or better quality compared to the most commonly used GPTQ settings.

It is supported by:

Text Generation Webui - using Loader: AutoAWQ
vLLM - Llama and Mistral models only
Hugging Face Text Generation Inference (TGI)
AutoAWQ - for use from Python code

Repositories available

AWQ model(s) for GPU inference.
GPTQ models for GPU inference, with multiple quantisation parameter options.
2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference
DeepSeek's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions

Prompt template: DeepSeek

You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer.
### Instruction:
{prompt}
### Response:

Provided files, and AWQ parameters

For my first release of AWQ models, I am releasing 128g models only. I will consider adding 32g as well if there is interest, and once I have done perplexity and evaluation comparisons, but at this time 32g models are still not fully tested with AutoAWQ and vLLM.

Models are released as sharded safetensors files.

Branch	Bits	GS	AWQ Dataset	Seq Len	Size
main	4	128	Evol Instruct Code	16384	18.01 GB

How to easily download and use this model in text-generation-webui

Please make sure you're using the latest version of text-generation-webui .

It is strongly recommended to use the text-generation-webui one-click-installers unless you're sure you know how to make a manual install.

Click the Model tab .
Under Download custom model or LoRA , enter TheBloke/deepseek-coder-33B-instruct-AWQ .
Click Download .
The model will start downloading. Once it's finished it will say "Done".
In the top left, click the refresh icon next to Model .
In the Model dropdown, choose the model you just downloaded: deepseek-coder-33B-instruct-AWQ
Select Loader: AutoAWQ .
Click Load, and the model will load and is now ready for use.
If you want any custom settings, set them and then click Save settings for this model followed by Reload the Model in the top right.
Once you're ready, click the Text Generation tab and enter a prompt to get started!

Multi-user inference server: vLLM

Documentation on installing and using vLLM can be found here .

Please ensure you are using vLLM version 0.2 or later.
When using vLLM as a server, pass the --quantization awq parameter.

For example:

python3 python -m vllm.entrypoints.api_server --model TheBloke/deepseek-coder-33B-instruct-AWQ --quantization awq

When using vLLM from Python code, again set quantization=awq .

For example:

from vllm import LLM, SamplingParams

prompts = [
    "Tell me about AI",
    "Write a story about llamas",
    "What is 291 - 150?",
    "How much wood would a woodchuck chuck if a woodchuck could chuck wood?",
]
prompt_template=f'''You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer.
### Instruction:
{prompt}
### Response:
'''

prompts = [prompt_template.format(prompt=prompt) for prompt in prompts]

sampling_params = SamplingParams(temperature=0.8, top_p=0.95)

llm = LLM(model="TheBloke/deepseek-coder-33B-instruct-AWQ", quantization="awq", dtype="auto")

outputs = llm.generate(prompts, sampling_params)

# Print the outputs.
for output in outputs:
    prompt = output.prompt
    generated_text = output.outputs[0].text
    print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")

Multi-user inference server: Hugging Face Text Generation Inference (TGI)

Use TGI version 1.1.0 or later. The official Docker container is: ghcr.io/huggingface/text-generation-inference:1.1.0

Example Docker parameters:

--model-id TheBloke/deepseek-coder-33B-instruct-AWQ --port 3000 --quantize awq --max-input-length 3696 --max-total-tokens 4096 --max-batch-prefill-tokens 4096

Example Python code for interfacing with TGI (requires huggingface-hub 0.17.0 or later):

pip3 install huggingface-hub

from huggingface_hub import InferenceClient

endpoint_url = "https://your-endpoint-url-here"

prompt = "Tell me about AI"
prompt_template=f'''You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer.
### Instruction:
{prompt}
### Response:
'''

client = InferenceClient(endpoint_url)
response = client.text_generation(prompt,
                                  max_new_tokens=128,
                                  do_sample=True,
                                  temperature=0.7,
                                  top_p=0.95,
                                  top_k=40,
                                  repetition_penalty=1.1)

print(f"Model output: ", response)

Inference from Python code using AutoAWQ

Install the AutoAWQ package

Requires: AutoAWQ 0.1.1 or later.

pip3 install autoawq

If you have problems installing AutoAWQ using the pre-built wheels, install it from source instead:

pip3 uninstall -y autoawq
git clone https://github.com/casper-hansen/AutoAWQ
cd AutoAWQ
pip3 install .

AutoAWQ example code

from awq import AutoAWQForCausalLM
from transformers import AutoTokenizer

model_name_or_path = "TheBloke/deepseek-coder-33B-instruct-AWQ"

# Load tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_name_or_path, trust_remote_code=False)
# Load model
model = AutoAWQForCausalLM.from_quantized(model_name_or_path, fuse_layers=True,
                                          trust_remote_code=False, safetensors=True)

prompt = "Tell me about AI"
prompt_template=f'''You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer.
### Instruction:
{prompt}
### Response:
'''

print("*** Running model.generate:")

token_input = tokenizer(
    prompt_template,
    return_tensors='pt'
).input_ids.cuda()

# Generate output
generation_output = model.generate(
    token_input,
    do_sample=True,
    temperature=0.7,
    top_p=0.95,
    top_k=40,
    max_new_tokens=512
)

# Get the tokens from the output, decode them, print them
token_output = generation_output[0]
text_output = tokenizer.decode(token_output)
print("LLM output: ", text_output)

"""
# Inference should be possible with transformers pipeline as well in future
# But currently this is not yet supported by AutoAWQ (correct as of September 25th 2023)
from transformers import pipeline

print("*** Pipeline:")
pipe = pipeline(
    "text-generation",
    model=model,
    tokenizer=tokenizer,
    max_new_tokens=512,
    do_sample=True,
    temperature=0.7,
    top_p=0.95,
    top_k=40,
    repetition_penalty=1.1
)

print(pipe(prompt_template)[0]['generated_text'])
"""

Compatibility

The files provided are tested to work with:

text-generation-webui using Loader: AutoAWQ .
vLLM version 0.2.0 and later.
Hugging Face Text Generation Inference (TGI) version 1.1.0 and later.
AutoAWQ version 0.1.1 and later.

Discord

For further support, and discussions on these models and AI in general, join us at:

TheBloke AI's Discord server

Thanks, and how to contribute

Thanks to the chirper.ai team!

Thanks to Clay from gpus.llm-utils.org !

I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.

If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.

Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.

Patreon: https://patreon.com/TheBlokeAI
Ko-Fi: https://ko-fi.com/TheBlokeAI

Special thanks to : Aemon Algiz.

Patreon special mentions : Brandon Frisco, LangChain4j, Spiking Neurons AB, transmissions 11, Joseph William Delisle, Nitin Borwankar, Willem Michiel, Michael Dempsey, vamX, Jeffrey Morgan, zynix, jjj, Omer Bin Jawed, Sean Connelly, jinyuan sun, Jeromy Smith, Shadi, Pawan Osman, Chadd, Elijah Stavena, Illia Dulskyi, Sebastain Graf, Stephen Murray, terasurfer, Edmond Seymore, Celu Ramasamy, Mandus, Alex, biorpg, Ajan Kanaga, Clay Pascal, Raven Klaugh, 阿明, K, ya boyyy, usrbinkat, Alicia Loh, John Villwock, ReadyPlayerEmma, Chris Smitley, Cap'n Zoog, fincy, GodLy, S_X, sidney chen, Cory Kujawski, OG, Mano Prime, AzureBlack, Pieter, Kalila, Spencer Kim, Tom X Nguyen, Stanislav Ovsiannikov, Michael Levine, Andrey, Trailburnt, Vadim, Enrico Ros, Talal Aujan, Brandon Phillips, Jack West, Eugene Pentland, Michael Davis, Will Dee, webtim, Jonathan Leane, Alps Aficionado, Rooh Singh, Tiffany J. Kim, theTransient, Luke @flexchar, Elle, Caitlyn Gatomon, Ari Malik, subjectnull, Johann-Peter Hartmann, Trenton Dambrowitz, Imad Khwaja, Asp the Wyvern, Emad Mostaque, Rainer Wilmers, Alexandros Triantafyllidis, Nicholas, Pedro Madruga, SuperWojo, Harry Royden McLaughlin, James Bentley, Olakabola, David Ziegler, Ai Maven, Jeff Scroggin, Nikolai Manek, Deo Leter, Matthew Berman, Fen Risland, Ken Nordquist, Manuel Alberto Morcote, Luke Pendergrass, TL, Fred von Graf, Randy H, Dan Guido, NimbleBox.ai, Vitor Caleffi, Gabriel Tamborski, knownsqashed, Lone Striker, Erik Bjäreholt, John Detwiler, Leonard Tan, Iucharbius

Thank you to all my generous patrons and donaters!

And thank you again to a16z for their generous grant.

Original model card: DeepSeek's Deepseek Coder 33B Instruct

[🏠Homepage] | [🤖 Chat with DeepSeek Coder] | [Discord] | [Wechat(微信)]

1. Introduction of Deepseek Coder

Deepseek Coder is composed of a series of code language models, each trained from scratch on 2T tokens, with a composition of 87% code and 13% natural language in both English and Chinese. We provide various sizes of the code model, ranging from 1B to 33B versions. Each model is pre-trained on project-level code corpus by employing a window size of 16K and a extra fill-in-the-blank task, to support project-level code completion and infilling. For coding capabilities, Deepseek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.

Massive Training Data : Trained from scratch on 2T tokens, including 87% code and 13% linguistic data in both English and Chinese languages.
Highly Flexible & Scalable : Offered in model sizes of 1.3B, 5.7B, 6.7B, and 33B, enabling users to choose the setup most suitable for their requirements.
Superior Model Performance : State-of-the-art performance among publicly available code models on HumanEval, MultiPL-E, MBPP, DS-1000, and APPS benchmarks.
Advanced Code Completion Capabilities : A window size of 16K and a fill-in-the-blank task, supporting project-level code completion and infilling tasks.

2. Model Summary

deepseek-coder-33b-instruct is a 33B parameter model initialized from deepseek-coder-33b-base and fine-tuned on 2B tokens of instruction data.

Home Page: DeepSeek
Repository: deepseek-ai/deepseek-coder
Chat With DeepSeek Coder: DeepSeek-Coder

3. How to Use

Here give some examples of how to use our model.

Chat Model Inference

from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/deepseek-coder-33b-instruct", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("deepseek-ai/deepseek-coder-33b-instruct", trust_remote_code=True).cuda()
messages=[
    { 'role': 'user', 'content': "write a quick sort algorithm in python."}
]
inputs = tokenizer.apply_chat_template(messages, return_tensors="pt").to(model.device)
# 32021 is the id of <|EOT|> token
outputs = model.generate(inputs, max_new_tokens=512, do_sample=False, top_k=50, top_p=0.95, num_return_sequences=1, eos_token_id=32021)
print(tokenizer.decode(outputs[0][len(inputs[0]):], skip_special_tokens=True))

4. License

This code repository is licensed under the MIT License. The use of DeepSeek Coder models is subject to the Model License. DeepSeek Coder supports commercial use.

See the LICENSE-MODEL for more details.

5. Contact

If you have any questions, please raise an issue or contact us at agi_code@deepseek.com .

Runs of TheBloke deepseek-coder-33B-instruct-AWQ on huggingface.co

16.4K

Total runs

24-hour runs

142

3-day runs

281

7-day runs

14.8K

30-day runs

More Information About deepseek-coder-33B-instruct-AWQ huggingface.co Model

More deepseek-coder-33B-instruct-AWQ license Visit here:

https://choosealicense.com/licenses/deepseek

deepseek-coder-33B-instruct-AWQ huggingface.co

deepseek-coder-33B-instruct-AWQ huggingface.co is an AI model on huggingface.co that provides deepseek-coder-33B-instruct-AWQ's model effect (), which can be used instantly with this TheBloke deepseek-coder-33B-instruct-AWQ model. huggingface.co supports a free trial of the deepseek-coder-33B-instruct-AWQ model, and also provides paid use of the deepseek-coder-33B-instruct-AWQ. Support call deepseek-coder-33B-instruct-AWQ model through api, including Node.js, Python, http.

deepseek-coder-33B-instruct-AWQ huggingface.co Url

https://huggingface.co/TheBloke/deepseek-coder-33B-instruct-AWQ

TheBloke deepseek-coder-33B-instruct-AWQ online free

deepseek-coder-33B-instruct-AWQ huggingface.co is an online trial and call api platform, which integrates deepseek-coder-33B-instruct-AWQ's modeling effects, including api services, and provides a free online trial of deepseek-coder-33B-instruct-AWQ, you can try deepseek-coder-33B-instruct-AWQ online for free by clicking the link below.

TheBloke deepseek-coder-33B-instruct-AWQ online free url in huggingface.co:

https://huggingface.co/TheBloke/deepseek-coder-33B-instruct-AWQ

deepseek-coder-33B-instruct-AWQ install

deepseek-coder-33B-instruct-AWQ is an open source model from GitHub that offers a free installation service, and any user can find deepseek-coder-33B-instruct-AWQ on GitHub to install. At the same time, huggingface.co provides the effect of deepseek-coder-33B-instruct-AWQ install, users can directly use deepseek-coder-33B-instruct-AWQ installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

deepseek-coder-33B-instruct-AWQ install url in huggingface.co:

https://huggingface.co/TheBloke/deepseek-coder-33B-instruct-AWQ

huggingface.co

TheBloke/phi-2-GGUF

Total runs: 3.7M

Run Growth: 3.5M

Growth Rate: 95.73%

Updated: Décembre 18 2023

huggingface.co

TheBloke/Mistral-7B-Instruct-v0.2-GPTQ

Total runs: 491.4K

Run Growth: 0

Growth Rate: 0.00%

Updated: Décembre 11 2023

huggingface.co

TheBloke/Mistral-7B-Instruct-v0.1-GGUF

Total runs: 266.5K

Run Growth: 223.8K

Growth Rate: 85.90%

Updated: Décembre 09 2023

huggingface.co

TheBloke/deepseek-coder-6.7B-instruct-GGUF

Total runs: 88.6K

Run Growth: 79.6K

Growth Rate: 90.55%

Updated: Novembre 05 2023

huggingface.co

TheBloke/Mistral-7B-Instruct-v0.2-GGUF

Total runs: 87.8K

Run Growth: -3.7K

Growth Rate: -4.20%

Updated: Décembre 11 2023

huggingface.co

TheBloke/Llama-2-7B-Chat-GGUF

Total runs: 72.8K

Run Growth: 3.2K

Growth Rate: 4.45%

Updated: Octobre 14 2023

huggingface.co

TheBloke/deepseek-coder-33B-instruct-GGUF

Total runs: 61.7K

Run Growth: 55.0K

Growth Rate: 89.82%

Updated: Novembre 05 2023

huggingface.co

TheBloke/TinyLlama-1.1B-Chat-v1.0-GPTQ

Total runs: 37.6K

Run Growth: 0

Growth Rate: 0.00%

Updated: Décembre 31 2023

huggingface.co

TheBloke/deepseek-coder-1.3b-instruct-GGUF

Total runs: 36.0K

Run Growth: 22.9K

Growth Rate: 63.82%

Updated: Novembre 05 2023

huggingface.co

TheBloke/TinyLlama-1.1B-Chat-v1.0-GGUF

Total runs: 35.1K

Run Growth: 11.6K

Growth Rate: 33.08%

Updated: Décembre 31 2023

huggingface.co

TheBloke/Llama-2-7B-GGUF

Total runs: 34.8K

Run Growth: 23.7K

Growth Rate: 71.05%

Updated: Octobre 24 2023

huggingface.co

TheBloke/deepseek-llm-67b-chat-GGUF

Total runs: 34.7K

Run Growth: 31.5K

Growth Rate: 90.83%

Updated: Novembre 29 2023

huggingface.co

TheBloke/Mistral-7B-Instruct-v0.2-AWQ

Total runs: 32.3K

Run Growth: -46.3K

Growth Rate: -144.31%

Updated: Décembre 11 2023

huggingface.co

TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF

Total runs: 30.8K

Run Growth: -818

Growth Rate: -2.68%

Updated: Décembre 14 2023

huggingface.co

TheBloke/deepseek-llm-7B-chat-GGUF

Total runs: 28.8K

Run Growth: 25.0K

Growth Rate: 87.54%

Updated: Novembre 29 2023

huggingface.co

TheBloke/CausalLM-14B-GGUF

Total runs: 28.6K

Run Growth: 12.5K

Growth Rate: 42.19%

Updated: Octobre 23 2023

huggingface.co

TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ

Total runs: 27.4K

Run Growth: -71.5K

Growth Rate: -261.22%

Updated: Décembre 14 2023

huggingface.co

TheBloke/Platypus2-70B-Instruct-AWQ

Total runs: 27.0K

Run Growth: -9.0K

Growth Rate: -31.35%

Updated: Novembre 09 2023

huggingface.co

TheBloke/deepsex-34b-GGUF

Total runs: 24.1K

Run Growth: 22.9K

Growth Rate: 96.15%

Updated: Décembre 07 2023

huggingface.co

TheBloke/MythoMax-L2-13B-GGUF

Total runs: 22.8K

Run Growth: 13.4K

Growth Rate: 58.81%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Llama-2-7B-GPTQ

Total runs: 22.7K

Run Growth: -156.9K

Growth Rate: -702.04%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Mistral-7B-OpenOrca-GPTQ

Total runs: 20.0K

Run Growth: 12.9K

Growth Rate: 63.87%

Updated: Octobre 16 2023

huggingface.co

TheBloke/Llama-2-13B-chat-GPTQ

Total runs: 17.5K

Run Growth: -11.1K

Growth Rate: -63.45%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Llama-2-7B-Chat-GPTQ

Total runs: 17.2K

Run Growth: 5.7K

Growth Rate: 34.07%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Mistral-7B-OpenOrca-AWQ

Total runs: 16.9K

Run Growth: 13.2K

Growth Rate: 77.88%

Updated: Novembre 09 2023

huggingface.co

TheBloke/zephyr-7B-beta-GGUF

Total runs: 16.6K

Run Growth: -9.0K

Growth Rate: -52.73%

Updated: Octobre 27 2023

huggingface.co

TheBloke/Wizard-Vicuna-13B-Uncensored-GGUF

Total runs: 16.0K

Run Growth: 7.7K

Growth Rate: 48.28%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Mistral-7B-v0.1-GGUF

Total runs: 14.0K

Run Growth: 7.2K

Growth Rate: 53.30%

Updated: Septembre 28 2023

huggingface.co

TheBloke/Rogue-Rose-103b-v0.2-AWQ

Total runs: 13.7K

Run Growth: 6.0K

Growth Rate: 49.00%

Updated: Décembre 16 2023

huggingface.co

TheBloke/Mistral-7B-Instruct-v0.1-AWQ

Total runs: 12.8K

Run Growth: 7.7K

Growth Rate: 60.19%

Updated: Novembre 09 2023

huggingface.co

TheBloke/SOLAR-10.7B-Instruct-v1.0-uncensored-GGUF

Total runs: 12.7K

Run Growth: 1.7K

Growth Rate: 13.03%

Updated: Décembre 19 2023

huggingface.co

TheBloke/Noromaid-13B-v0.3-GGUF

Total runs: 11.6K

Run Growth: -7.5K

Growth Rate: -60.88%

Updated: Janvier 07 2024

huggingface.co

TheBloke/Luna-AI-Llama2-Uncensored-GGUF

Total runs: 11.3K

Run Growth: 2.5K

Growth Rate: 21.79%

Updated: Septembre 27 2023

huggingface.co

TheBloke/OpenHermes-2.5-Mistral-7B-GGUF

Total runs: 11.1K

Run Growth: 5.7K

Growth Rate: 51.37%

Updated: Novembre 02 2023

huggingface.co

TheBloke/dolphin-2.7-mixtral-8x7b-GGUF

Total runs: 10.8K

Run Growth: 1.5K

Growth Rate: 14.00%

Updated: Janvier 01 2024

huggingface.co

TheBloke/Llama-2-13B-chat-GGUF

Total runs: 9.0K

Run Growth: 0

Growth Rate: 0.00%

Updated: Septembre 27 2023

huggingface.co

TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF

Total runs: 8.9K

Run Growth: 1.7K

Growth Rate: 19.77%

Updated: Janvier 31 2024

huggingface.co

TheBloke/openchat_3.5-AWQ

Total runs: 8.8K

Run Growth: 8.6K

Growth Rate: 97.47%

Updated: Novembre 09 2023

huggingface.co

TheBloke/Emerhyst-20B-GGUF

Total runs: 8.7K

Run Growth: 7.9K

Growth Rate: 90.39%

Updated: Septembre 28 2023

huggingface.co

TheBloke/CodeLlama-7B-GGUF

Total runs: 8.7K

Run Growth: 2.4K

Growth Rate: 28.68%

Updated: Septembre 27 2023

huggingface.co

TheBloke/LlamaGuard-7B-AWQ

Total runs: 8.6K

Run Growth: 3.4K

Growth Rate: 45.96%

Updated: Décembre 11 2023

huggingface.co

TheBloke/Llama-2-7B-fp16

Total runs: 8.4K

Run Growth: 1.4K

Growth Rate: 23.53%

Updated: Août 27 2023

huggingface.co

TheBloke/dolphin-2.5-mixtral-8x7b-GGUF

Total runs: 8.3K

Run Growth: 1.9K

Growth Rate: 25.52%

Updated: Décembre 14 2023

huggingface.co

TheBloke/TinyLlama-1.1B-Chat-v0.3-AWQ

Total runs: 8.2K

Run Growth: 2.5K

Growth Rate: 31.78%

Updated: Novembre 09 2023

huggingface.co

TheBloke/CausalLM-7B-GGUF

Total runs: 7.7K

Run Growth: 2.8K

Growth Rate: 36.88%

Updated: Octobre 23 2023

huggingface.co

TheBloke/TinyLlama-1.1B-Chat-v0.3-GGUF

Total runs: 7.7K

Run Growth: 23

Growth Rate: 0.31%

Updated: Octobre 03 2023

huggingface.co

TheBloke/rocket-3B-GGUF

Total runs: 7.7K

Run Growth: 3.3K

Growth Rate: 42.01%

Updated: Novembre 23 2023

huggingface.co

TheBloke/Llama-2-7B-Chat-AWQ

Total runs: 7.5K

Run Growth: 3.7K

Growth Rate: 50.44%

Updated: Novembre 09 2023

huggingface.co

TheBloke/Mistral-7B-OpenOrca-GGUF

Total runs: 7.4K

Run Growth: 3.6K

Growth Rate: 50.51%

Updated: Octobre 02 2023

huggingface.co

TheBloke/MythoMax-L2-Kimiko-v2-13B-GGUF

Total runs: 6.9K

Run Growth: 1.6K

Growth Rate: 22.59%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Llama-2-13B-fp16

Total runs: 6.9K

Run Growth: -231

Growth Rate: -3.58%

Updated: Juillet 20 2023

huggingface.co

TheBloke/Llama-2-70B-Chat-AWQ

Total runs: 6.8K

Run Growth: 3.4K

Growth Rate: 49.27%

Updated: Novembre 09 2023

huggingface.co

TheBloke/deepseek-coder-6.7B-base-AWQ

Total runs: 6.8K

Run Growth: -3.1K

Growth Rate: -45.67%

Updated: Novembre 09 2023

huggingface.co

TheBloke/Mixtral-8x7B-v0.1-GGUF

Total runs: 6.7K

Run Growth: 3.1K

Growth Rate: 45.94%

Updated: Décembre 14 2023

huggingface.co

TheBloke/Llama-2-13B-chat-AWQ

Total runs: 6.7K

Run Growth: 2.4K

Growth Rate: 35.89%

Updated: Novembre 09 2023

huggingface.co

TheBloke/Utopia-13B-GGUF

Total runs: 6.6K

Run Growth: 6.3K

Growth Rate: 94.69%

Updated: Novembre 03 2023

huggingface.co

TheBloke/deepseek-coder-6.7B-base-GGUF

Total runs: 6.6K

Run Growth: 4.0K

Growth Rate: 60.80%

Updated: Novembre 05 2023

huggingface.co

TheBloke/Llama-2-70B-Chat-GGUF

Total runs: 6.6K

Run Growth: 0

Growth Rate: 0.00%

Updated: Novembre 21 2023

huggingface.co

TheBloke/CodeLlama-13B-GGUF

Total runs: 6.5K

Run Growth: 2.8K

Growth Rate: 42.91%

Updated: Septembre 27 2023

huggingface.co

TheBloke/deepseek-llm-7B-base-GGUF

Total runs: 6.5K

Run Growth: 6.3K

Growth Rate: 97.13%

Updated: Novembre 29 2023

huggingface.co

TheBloke/Llama-2-70B-Chat-GPTQ

Total runs: 6.5K

Run Growth: 211

Growth Rate: 4.12%

Updated: Septembre 27 2023

huggingface.co

TheBloke/WizardLM-1.0-Uncensored-Llama2-13B-GGUF

Total runs: 6.4K

Run Growth: 184

Growth Rate: 2.89%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Wizard-Vicuna-30B-Uncensored-GGUF

Total runs: 6.3K

Run Growth: -288

Growth Rate: -4.55%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Silicon-Maid-7B-GGUF

Total runs: 6.1K

Run Growth: 4.4K

Growth Rate: 70.76%

Updated: Décembre 27 2023

huggingface.co

TheBloke/dolphin-2.7-mixtral-8x7b-AWQ

Total runs: 6.1K

Run Growth: -4.4K

Growth Rate: -72.87%

Updated: Janvier 01 2024

huggingface.co

TheBloke/CodeLlama-7B-Instruct-GGUF

Total runs: 6.1K

Run Growth: 548

Growth Rate: 9.09%

Updated: Septembre 27 2023

huggingface.co

TheBloke/airoboros-mistral2.2-7B-GGUF

Total runs: 6.1K

Run Growth: 3.1K

Growth Rate: 52.77%

Updated: Octobre 03 2023

huggingface.co

TheBloke/CodeLlama-7B-Python-GGUF

Total runs: 6.0K

Run Growth: 1.1K

Growth Rate: 18.49%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Open_Gpt4_8x7B_v0.2-GGUF

Total runs: 5.8K

Run Growth: 0

Growth Rate: 0.00%

Updated: Janvier 12 2024

huggingface.co

TheBloke/meditron-7B-AWQ

Total runs: 5.8K

Run Growth: -77.6K

Growth Rate: -2322.53%

Updated: Novembre 30 2023

huggingface.co

TheBloke/Rose-20B-GGUF

Total runs: 5.6K

Run Growth: 5.1K

Growth Rate: 93.19%

Updated: Novembre 24 2023

huggingface.co

TheBloke/CodeLlama-13B-Instruct-GGUF

Total runs: 5.5K

Run Growth: 951

Growth Rate: 17.26%

Updated: Septembre 27 2023

huggingface.co

TheBloke/dolphin-2.2.1-mistral-7B-GGUF

Total runs: 5.3K

Run Growth: 2.1K

Growth Rate: 39.17%

Updated: Octobre 30 2023

huggingface.co

TheBloke/Open_Gpt4_8x7B-GGUF

Total runs: 5.0K

Run Growth: 53

Growth Rate: 1.06%

Updated: Janvier 05 2024

huggingface.co

TheBloke/Wizard-Vicuna-7B-Uncensored-GGUF

Total runs: 5.0K

Run Growth: -521

Growth Rate: -10.25%

Updated: Septembre 27 2023

huggingface.co

TheBloke/zephyr-7B-beta-AWQ

Total runs: 5.0K

Run Growth: 1.4K

Growth Rate: 29.08%

Updated: Novembre 09 2023

huggingface.co

TheBloke/OpenHermes-2.5-Mistral-7B-AWQ

Total runs: 4.9K

Run Growth: 1.9K

Growth Rate: 39.79%

Updated: Novembre 09 2023

huggingface.co

TheBloke/llama2_70b_chat_uncensored-GGUF

Total runs: 4.9K

Run Growth: -967

Growth Rate: -19.41%

Updated: Septembre 27 2023

huggingface.co

TheBloke/dolphin-2.6-mistral-7B-GGUF

Total runs: 4.9K

Run Growth: 2.1K

Growth Rate: 43.40%

Updated: Décembre 28 2023

huggingface.co

TheBloke/TinyLlama-1.1B-Chat-v0.3-GPTQ

Total runs: 4.7K

Run Growth: 0

Growth Rate: 0.00%

Updated: Octobre 03 2023

huggingface.co

TheBloke/claude2-alpaca-13B-GGUF

Total runs: 4.7K

Run Growth: 2.8K

Growth Rate: 60.03%

Updated: Novembre 10 2023

huggingface.co

TheBloke/Nous-Hermes-2-Mixtral-8x7B-DPO-GPTQ

Total runs: 4.6K

Run Growth: 2.2K

Growth Rate: 89.73%

Updated: Janvier 16 2024

huggingface.co

TheBloke/wizardLM-7B-HF

Total runs: 4.5K

Run Growth: 3.3K

Growth Rate: 71.37%

Updated: Juin 05 2023

huggingface.co

TheBloke/deepseek-coder-1.3b-base-GGUF

Total runs: 4.4K

Run Growth: 3.9K

Growth Rate: 89.89%

Updated: Novembre 05 2023

huggingface.co

TheBloke/Mistral-7B-Instruct-v0.1-GPTQ

Total runs: 4.4K

Run Growth: 1.3K

Growth Rate: 30.90%

Updated: Septembre 29 2023

huggingface.co

TheBloke/em_german_mistral_v01-GGUF

Total runs: 4.3K

Run Growth: 167

Growth Rate: 4.82%

Updated: Octobre 10 2023

huggingface.co

TheBloke/koala-13B-HF

Total runs: 4.2K

Run Growth: 2.4K

Growth Rate: 58.91%

Updated: Juin 05 2023

huggingface.co

TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ

Total runs: 4.2K

Run Growth: 240

Growth Rate: 5.81%

Updated: Septembre 27 2023

huggingface.co

TheBloke/dolphin-2.1-mistral-7B-GGUF

Total runs: 4.1K

Run Growth: 1.4K

Growth Rate: 35.26%

Updated: Octobre 22 2023

huggingface.co

TheBloke/NeuralBeagle14-7B-GPTQ

Total runs: 4.1K

Run Growth: -7.2K

Growth Rate: -116.41%

Updated: Janvier 17 2024

huggingface.co

TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ

Total runs: 4.0K

Run Growth: 275

Growth Rate: 6.99%

Updated: Septembre 27 2023

huggingface.co

TheBloke/Mistral-7B-Claude-Chat-GGUF

Total runs: 4.0K

Run Growth: 1.7K

Growth Rate: 42.49%

Updated: Octobre 28 2023

huggingface.co

TheBloke/koala-7B-HF

Total runs: 4.0K

Run Growth: 3.1K

Growth Rate: 80.33%

Updated: Juin 05 2023

huggingface.co

TheBloke/WhiteRabbitNeo-13B-GGUF

Total runs: 3.8K

Run Growth: 2.5K

Growth Rate: 65.53%

Updated: Décembre 21 2023

huggingface.co

TheBloke/zephyr-7B-beta-GPTQ

Total runs: 3.8K

Run Growth: 1.4K

Growth Rate: 37.37%

Updated: Octobre 27 2023

huggingface.co

TheBloke/Yarn-Mistral-7B-128k-GGUF

Total runs: 3.6K

Run Growth: 1.1K

Growth Rate: 32.56%

Updated: Novembre 02 2023

huggingface.co

TheBloke/Phind-CodeLlama-34B-v2-GGUF

Total runs: 3.6K

Run Growth: 739

Growth Rate: 20.50%

Updated: Septembre 27 2023

huggingface.co

TheBloke/WizardCoder-Python-7B-V1.0-GGUF

Total runs: 3.6K

Run Growth: 2.0K

Growth Rate: 58.21%

Updated: Septembre 27 2023

huggingface.co

TheBloke/CodeLlama-34B-GGUF

Total runs: 3.5K

Run Growth: -332

Growth Rate: -9.96%

Updated: Septembre 27 2023

huggingface.co

TheBloke/WizardLM-13B-V1.2-GGUF

Total runs: 3.4K

Run Growth: 612

Growth Rate: 17.80%

Updated: Septembre 27 2023

TheBloke / deepseek-coder-33B-instruct-AWQ

Introduction of deepseek-coder-33B-instruct-AWQ

Model Details of deepseek-coder-33B-instruct-AWQ

Deepseek Coder 33B Instruct - AWQ

Description

About AWQ

Repositories available

Prompt template: DeepSeek

Provided files, and AWQ parameters

How to easily download and use this model in text-generation-webui

Multi-user inference server: vLLM

Multi-user inference server: Hugging Face Text Generation Inference (TGI)

Inference from Python code using AutoAWQ

Install the AutoAWQ package

AutoAWQ example code

Compatibility

Discord

Thanks, and how to contribute

Original model card: DeepSeek's Deepseek Coder 33B Instruct

1. Introduction of Deepseek Coder

2. Model Summary

3. How to Use

Chat Model Inference

4. License

5. Contact

Runs of TheBloke deepseek-coder-33B-instruct-AWQ on huggingface.co

More Information About deepseek-coder-33B-instruct-AWQ huggingface.co Model

More deepseek-coder-33B-instruct-AWQ license Visit here:

deepseek-coder-33B-instruct-AWQ huggingface.co

deepseek-coder-33B-instruct-AWQ huggingface.co Url

TheBloke deepseek-coder-33B-instruct-AWQ online free

TheBloke deepseek-coder-33B-instruct-AWQ online free url in huggingface.co:

deepseek-coder-33B-instruct-AWQ install

deepseek-coder-33B-instruct-AWQ install url in huggingface.co:

Url of deepseek-coder-33B-instruct-AWQ

deepseek-coder-33B-instruct-AWQ huggingface.co Url

Provider of deepseek-coder-33B-instruct-AWQ huggingface.co

Other API from TheBloke