KoMiniLM huggingface.co api & BM-K KoMiniLM github AI Model

Introduction of KoMiniLM

Model Details of KoMiniLM

KoMiniLM

🐣 Korean mini language model

Overview

Current language models usually consist of hundreds of millions of parameters which brings challenges for fine-tuning and online serving in real-life applications due to latency and capacity constraints. In this project, we release a light weight korean language model to address the aforementioned shortcomings of existing language models.

Quick tour

from transformers import AutoTokenizer, AutoModel

tokenizer = AutoTokenizer.from_pretrained("BM-K/KoMiniLM") # 23M model
model = AutoModel.from_pretrained("BM-K/KoMiniLM")

inputs = tokenizer("안녕 세상아!", return_tensors="pt")
outputs = model(**inputs)

Update history

** Updates on 2022.06.20 **

Release KoMiniLM-bert-68M

** Updates on 2022.05.24 **

Release KoMiniLM-bert-23M

Pre-training

Teacher Model : KLUE-BERT(base)

Object

Self-Attention Distribution and Self-Attention Value-Relation [Wang et al., 2020] were distilled from each discrete layer of the teacher model to the student model. Wang et al. distilled in the last layer of the transformer, but that was not the case in this project.

Data sets

Data	News comments	News article
size	10G	10G

Config

KoMiniLM-23M

{
  "architectures": [
    "BertForPreTraining"
  ],
  "attention_probs_dropout_prob": 0.1,
  "classifier_dropout": null,
  "hidden_act": "gelu",
  "hidden_dropout_prob": 0.1,
  "hidden_size": 384,
  "initializer_range": 0.02,
  "intermediate_size": 1536,
  "layer_norm_eps": 1e-12,
  "max_position_embeddings": 512,
  "model_type": "bert",
  "num_attention_heads": 12,
  "num_hidden_layers": 6,
  "output_attentions": true,
  "pad_token_id": 0,
  "position_embedding_type": "absolute",
  "return_dict": false,
  "torch_dtype": "float32",
  "transformers_version": "4.13.0",
  "type_vocab_size": 2,
  "use_cache": true,
  "vocab_size": 32000
}

Performance on subtasks

The results of our fine-tuning experiments are an average of 3 runs for each task.

cd KoMiniLM-Finetune
bash scripts/run_all_kominilm.sh

	#Param	Average	NSMC (Acc)	Naver NER (F1)	PAWS (Acc)	KorNLI (Acc)	KorSTS (Spearman)	Question Pair (Acc)	KorQuaD (Dev) (EM/F1)
KoBERT(KLUE)	110M	86.84	90.20±0.07	87.11±0.05	81.36±0.21	81.06±0.33	82.47±0.14	95.03±0.44	84.43±0.18 / 93.05±0.04
KcBERT	108M	78.94	89.60±0.10	84.34±0.13	67.02±0.42	74.17±0.52	76.57±0.51	93.97±0.27	60.87±0.27 / 85.01±0.14
KoBERT(SKT)	92M	79.73	89.28±0.42	87.54±0.04	80.93±0.91	78.18±0.45	75.98±2.81	94.37±0.31	51.94±0.60 / 79.69±0.66
DistilKoBERT	28M	74.73	88.39±0.08	84.22±0.01	61.74±0.45	70.22±0.14	72.11±0.27	92.65±0.16	52.52±0.48 / 76.00±0.71

KoMiniLM ^†	68M	85.90	89.84±0.02	85.98±0.09	80.78±0.30	79.28±0.17	81.00±0.07	94.89±0.37	83.27±0.08 / 92.08±0.06
KoMiniLM ^†	23M	84.79	89.67±0.03	84.79±0.09	78.67±0.45	78.10±0.07	78.90±0.11	94.81±0.12	82.11±0.42 / 91.21±0.29

NSMC (Naver Sentiment Movie Corpus)
Naver NER (NER task on Naver NLP Challenge 2018)
PAWS (Korean Paraphrase Adversaries from Word Scrambling)
KorNLI/KorSTS (Korean Natural Language Understanding)
Question Pair (Paired Question)
KorQuAD (The Korean Question Answering Dataset)

User Contributed Examples

Runs of BM-K KoMiniLM on huggingface.co

Total runs

-102

24-hour runs

-105

3-day runs

-106

7-day runs

-99

30-day runs

More Information About KoMiniLM huggingface.co Model

KoMiniLM huggingface.co

KoMiniLM huggingface.co is an AI model on huggingface.co that provides KoMiniLM's model effect (), which can be used instantly with this BM-K KoMiniLM model. huggingface.co supports a free trial of the KoMiniLM model, and also provides paid use of the KoMiniLM. Support call KoMiniLM model through api, including Node.js, Python, http.

KoMiniLM huggingface.co Url

https://huggingface.co/BM-K/KoMiniLM

BM-K KoMiniLM online free

KoMiniLM huggingface.co is an online trial and call api platform, which integrates KoMiniLM's modeling effects, including api services, and provides a free online trial of KoMiniLM, you can try KoMiniLM online for free by clicking the link below.

BM-K KoMiniLM online free url in huggingface.co:

https://huggingface.co/BM-K/KoMiniLM

KoMiniLM install

KoMiniLM is an open source model from GitHub that offers a free installation service, and any user can find KoMiniLM on GitHub to install. At the same time, huggingface.co provides the effect of KoMiniLM install, users can directly use KoMiniLM installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

KoMiniLM install url in huggingface.co:

https://huggingface.co/BM-K/KoMiniLM

huggingface.co

BM-K/KoSimCSE-roberta-multitask

Total runs: 351.1K

Run Growth: 330.4K

Growth Rate: 94.09%

Updated: March 24 2023

huggingface.co

BM-K/KoDiffCSE-RoBERTa

Total runs: 27.1K

Run Growth: 3.0K

Growth Rate: 10.70%

Updated: August 30 2023

huggingface.co

BM-K/KoSimCSE-roberta

Total runs: 6.0K

Run Growth: -347

Growth Rate: -5.88%

Updated: March 24 2023

huggingface.co

BM-K/mistral-7b-it-v1.7.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 20 2023

huggingface.co

BM-K/yi-ko-6b-it-v1.0.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: December 05 2023

huggingface.co

BM-K/stupid_model

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: January 02 2024

huggingface.co

BM-K/mistral-ko-7b-it-v2.0.1

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: December 26 2023

huggingface.co

BM-K/mistral-7b-it-v1.7.1

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 20 2023

huggingface.co

BM-K/llama-2-ko-7b-it-v1.0.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 15 2023

huggingface.co

BM-K/KoSimCSE-bert

Total runs: 1.0K

Run Growth: -45

Growth Rate: -4.42%

Updated: August 30 2023

huggingface.co

BM-K/KoChatBART

Total runs: 487

Run Growth: 289

Growth Rate: 74.29%

Updated: April 26 2023

huggingface.co

BM-K/KoSimCSE-bert-multitask

Total runs: 218

Run Growth: 21

Growth Rate: 9.42%

Updated: April 26 2023

huggingface.co

BM-K/KoSimCSE-Unsup-RoBERTa

Total runs: 159

Run Growth: 154

Growth Rate: 96.25%

Updated: February 19 2025

huggingface.co

BM-K/KoMiniLM-68M

Total runs: 119

Run Growth: 105

Growth Rate: 88.24%

Updated: March 24 2023

huggingface.co

BM-K/KoSimCSE-Unsup-BERT

Total runs: 112

Run Growth: 107

Growth Rate: 95.54%

Updated: February 19 2025

huggingface.co

BM-K/EXAONE-3.0-7.8B-Daily-Conversation-Summary

Total runs: 27

Run Growth: 16

Growth Rate: 59.26%

Updated: September 04 2024

huggingface.co

BM-K/NewsKoT5-small

Total runs: 12

Run Growth: -93

Growth Rate: -775.00%

Updated: August 30 2023

huggingface.co

BM-K/llama-2-ko-13b-qlora-it-v1.0.0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 15 2023

BM-K / KoMiniLM

Introduction of KoMiniLM

Model Details of KoMiniLM

KoMiniLM

Overview

Quick tour

Update history

Pre-training

Object

Data sets

Config

Performance on subtasks

User Contributed Examples

Reference

Runs of BM-K KoMiniLM on huggingface.co

More Information About KoMiniLM huggingface.co Model

KoMiniLM huggingface.co

KoMiniLM huggingface.co Url

BM-K KoMiniLM online free

BM-K KoMiniLM online free url in huggingface.co:

KoMiniLM install

KoMiniLM install url in huggingface.co:

Url of KoMiniLM

KoMiniLM huggingface.co Url

Provider of KoMiniLM huggingface.co

Other API from BM-K