KoMiniLM-68M huggingface.co api & BM-K KoMiniLM-68M github AI Model

Introduction of KoMiniLM-68M

Model Details of KoMiniLM-68M

KoMiniLM

🐣 Korean mini language model

Overview

Current language models usually consist of hundreds of millions of parameters which brings challenges for fine-tuning and online serving in real-life applications due to latency and capacity constraints. In this project, we release a light weight korean language model to address the aforementioned shortcomings of existing language models.

Quick tour

from transformers import AutoTokenizer, AutoModel

tokenizer = AutoTokenizer.from_pretrained("BM-K/KoMiniLM-68M") # 68M model
model = AutoModel.from_pretrained("BM-K/KoMiniLM-68M")

inputs = tokenizer("안녕 세상아!", return_tensors="pt")
outputs = model(**inputs)

Update history

** Updates on 2022.06.20 **

Release KoMiniLM-bert-68M

** Updates on 2022.05.24 **

Release KoMiniLM-bert-23M

Pre-training

Teacher Model : KLUE-BERT(base)

Object

Self-Attention Distribution and Self-Attention Value-Relation [Wang et al., 2020] were distilled from each discrete layer of the teacher model to the student model. Wang et al. distilled in the last layer of the transformer, but that was not the case in this project.

Data sets

Data	News comments	News article
size	10G	10G

Config

KoMiniLM-68M

{
  "architectures": [
    "BertForPreTraining"
  ],
  "attention_probs_dropout_prob": 0.1,
  "classifier_dropout": null,
  "hidden_act": "gelu",
  "hidden_dropout_prob": 0.1,
  "hidden_size": 768,
  "initializer_range": 0.02,
  "intermediate_size": 3072,
  "layer_norm_eps": 1e-12,
  "max_position_embeddings": 512,
  "model_type": "bert",
  "num_attention_heads": 12,
  "num_hidden_layers": 6,
  "output_attentions": true,
  "pad_token_id": 0,
  "position_embedding_type": "absolute",
  "return_dict": false,
  "torch_dtype": "float32",
  "transformers_version": "4.13.0",
  "type_vocab_size": 2,
  "use_cache": true,
  "vocab_size": 32000
}

Performance on subtasks

The results of our fine-tuning experiments are an average of 3 runs for each task.

cd KoMiniLM-Finetune
bash scripts/run_all_kominilm.sh

	#Param	Average	NSMC (Acc)	Naver NER (F1)	PAWS (Acc)	KorNLI (Acc)	KorSTS (Spearman)	Question Pair (Acc)	KorQuaD (Dev) (EM/F1)
KoBERT(KLUE)	110M	86.84	90.20±0.07	87.11±0.05	81.36±0.21	81.06±0.33	82.47±0.14	95.03±0.44	84.43±0.18 / 93.05±0.04
KcBERT	108M	78.94	89.60±0.10	84.34±0.13	67.02±0.42	74.17±0.52	76.57±0.51	93.97±0.27	60.87±0.27 / 85.01±0.14
KoBERT(SKT)	92M	79.73	89.28±0.42	87.54±0.04	80.93±0.91	78.18±0.45	75.98±2.81	94.37±0.31	51.94±0.60 / 79.69±0.66
DistilKoBERT	28M	74.73	88.39±0.08	84.22±0.01	61.74±0.45	70.22±0.14	72.11±0.27	92.65±0.16	52.52±0.48 / 76.00±0.71

KoMiniLM ^†	68M	85.90	89.84±0.02	85.98±0.09	80.78±0.30	79.28±0.17	81.00±0.07	94.89±0.37	83.27±0.08 / 92.08±0.06
KoMiniLM ^†	23M	84.79	89.67±0.03	84.79±0.09	78.67±0.45	78.10±0.07	78.90±0.11	94.81±0.12	82.11±0.42 / 91.21±0.29

NSMC (Naver Sentiment Movie Corpus)
Naver NER (NER task on Naver NLP Challenge 2018)
PAWS (Korean Paraphrase Adversaries from Word Scrambling)
KorNLI/KorSTS (Korean Natural Language Understanding)
Question Pair (Paired Question)
KorQuAD (The Korean Question Answering Dataset)

User Contributed Examples

Runs of BM-K KoMiniLM-68M on huggingface.co

119

Total runs

-2

24-hour runs

-1

3-day runs

103

7-day runs

105

30-day runs

More Information About KoMiniLM-68M huggingface.co Model

KoMiniLM-68M huggingface.co

KoMiniLM-68M huggingface.co is an AI model on huggingface.co that provides KoMiniLM-68M's model effect (), which can be used instantly with this BM-K KoMiniLM-68M model. huggingface.co supports a free trial of the KoMiniLM-68M model, and also provides paid use of the KoMiniLM-68M. Support call KoMiniLM-68M model through api, including Node.js, Python, http.

KoMiniLM-68M huggingface.co Url

https://huggingface.co/BM-K/KoMiniLM-68M

BM-K KoMiniLM-68M online free

KoMiniLM-68M huggingface.co is an online trial and call api platform, which integrates KoMiniLM-68M's modeling effects, including api services, and provides a free online trial of KoMiniLM-68M, you can try KoMiniLM-68M online for free by clicking the link below.

BM-K KoMiniLM-68M online free url in huggingface.co:

https://huggingface.co/BM-K/KoMiniLM-68M

KoMiniLM-68M install

KoMiniLM-68M is an open source model from GitHub that offers a free installation service, and any user can find KoMiniLM-68M on GitHub to install. At the same time, huggingface.co provides the effect of KoMiniLM-68M install, users can directly use KoMiniLM-68M installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

KoMiniLM-68M install url in huggingface.co:

https://huggingface.co/BM-K/KoMiniLM-68M

huggingface.co

BM-K/KoSimCSE-roberta-multitask

Total runs: 351.1K

Run Growth: 330.4K

Growth Rate: 94.09%

Updated: March 24 2023

huggingface.co

BM-K/KoDiffCSE-RoBERTa

Total runs: 27.1K

Run Growth: 3.0K

Growth Rate: 10.70%

Updated: August 30 2023

huggingface.co

BM-K/KoSimCSE-roberta

Total runs: 6.0K

Run Growth: -347

Growth Rate: -5.88%

Updated: March 24 2023

huggingface.co

BM-K/mistral-7b-it-v1.7.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 20 2023

huggingface.co

BM-K/yi-ko-6b-it-v1.0.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: December 05 2023

huggingface.co

BM-K/stupid_model

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: January 02 2024

huggingface.co

BM-K/mistral-ko-7b-it-v2.0.1

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: December 26 2023

huggingface.co

BM-K/mistral-7b-it-v1.7.1

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 20 2023

huggingface.co

BM-K/llama-2-ko-7b-it-v1.0.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 15 2023

huggingface.co

BM-K/KoSimCSE-bert

Total runs: 1.0K

Run Growth: -45

Growth Rate: -4.42%

Updated: August 30 2023

huggingface.co

BM-K/KoChatBART

Total runs: 487

Run Growth: 289

Growth Rate: 74.29%

Updated: April 26 2023

huggingface.co

BM-K/KoSimCSE-bert-multitask

Total runs: 218

Run Growth: 21

Growth Rate: 9.42%

Updated: April 26 2023

huggingface.co

BM-K/KoSimCSE-Unsup-RoBERTa

Total runs: 159

Run Growth: 154

Growth Rate: 96.25%

Updated: February 19 2025

huggingface.co

BM-K/KoSimCSE-Unsup-BERT

Total runs: 112

Run Growth: 107

Growth Rate: 95.54%

Updated: February 19 2025

huggingface.co

BM-K/KoMiniLM

Total runs: 32

Run Growth: -99

Growth Rate: -300.00%

Updated: August 30 2023

huggingface.co

BM-K/EXAONE-3.0-7.8B-Daily-Conversation-Summary

Total runs: 27

Run Growth: 16

Growth Rate: 59.26%

Updated: September 04 2024

huggingface.co

BM-K/NewsKoT5-small

Total runs: 12

Run Growth: -93

Growth Rate: -775.00%

Updated: August 30 2023

huggingface.co

BM-K/llama-2-ko-13b-qlora-it-v1.0.0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 15 2023

BM-K / KoMiniLM-68M

Introduction of KoMiniLM-68M

Model Details of KoMiniLM-68M

KoMiniLM

Overview

Quick tour

Update history

Pre-training

Object

Data sets

Config

Performance on subtasks

User Contributed Examples

Reference

Runs of BM-K KoMiniLM-68M on huggingface.co

More Information About KoMiniLM-68M huggingface.co Model

KoMiniLM-68M huggingface.co

KoMiniLM-68M huggingface.co Url

BM-K KoMiniLM-68M online free

BM-K KoMiniLM-68M online free url in huggingface.co:

KoMiniLM-68M install

KoMiniLM-68M install url in huggingface.co:

Url of KoMiniLM-68M

KoMiniLM-68M huggingface.co Url

Provider of KoMiniLM-68M huggingface.co

Other API from BM-K