KoDiffCSE-RoBERTa huggingface.co api & BM-K KoDiffCSE-RoBERTa github AI Model

Introduction of KoDiffCSE-RoBERTa

Model Details of KoDiffCSE-RoBERTa

KoDiffCSE

Difference-based Contrastive Learning for Korean Sentence Embeddings

DiffCSE-[NAACL 2022]
[Github] Official implementation of DiffCSE

Quick tour

import torch
from transformers import AutoModel, AutoTokenizer

def cal_score(a, b):
    if len(a.shape) == 1: a = a.unsqueeze(0)
    if len(b.shape) == 1: b = b.unsqueeze(0)

    a_norm = a / a.norm(dim=1)[:, None]
    b_norm = b / b.norm(dim=1)[:, None]
    return torch.mm(a_norm, b_norm.transpose(0, 1)) * 100

model = AutoModel.from_pretrained('BM-K/KoDiffCSE-RoBERTa')
tokenizer = AutoTokenizer.from_pretrained('BM-K/KoDiffCSE-RoBERTa')

sentences = ['치타가 들판을 가로 질러 먹이를 쫓는다.',
             '치타 한 마리가 먹이 뒤에서 달리고 있다.',
             '원숭이 한 마리가 드럼을 연주한다.']

inputs = tokenizer(sentences, padding=True, truncation=True, return_tensors="pt")
embeddings, _ = model(**inputs, return_dict=False)

score01 = cal_score(embeddings[0][0], embeddings[1][0])  # 84.56
# '치타가 들판을 가로 질러 먹이를 쫓는다.' @ '치타 한 마리가 먹이 뒤에서 달리고 있다.'
score02 = cal_score(embeddings[0][0], embeddings[2][0])  # 48.06
# '치타가 들판을 가로 질러 먹이를 쫓는다.' @ '원숭이 한 마리가 드럼을 연주한다.'

Setups

Encoder Models

Baseline encoders used for korean sentence embedding - KLUE-PLMs

Model	Embedding size	Hidden size	# Layers	# Heads
KLUE-BERT-base	768	768	12	12
KLUE-RoBERTa-base	768	768	12	12

Warning
Large pre-trained models need a lot of GPU memory to train

Datasets

The data must exist in the "--path_to_data" folder

wiki-corpus (Unsupervised Training)
KorSTS (Validation & Testing)

Training - unsupervised

python main.py \
    --model klue/roberta-base \
    --generator_name klue/roberta-small \
    --multi_gpu True \
    --train True \
    --test False \
    --max_len 64 \
    --batch_size 256 \
    --epochs 1 \
    --eval_steps 125 \
    --lr 0.00005 \
    --masking_ratio 0.15 \
    --lambda_weight 0.005 \
    --warmup_ratio 0.05 \
    --temperature 0.05 \
    --path_to_data Dataset/ \
    --train_data wiki_corpus_examples.txt \
    --valid_data valid_sts.tsv \
    --ckpt best_checkpoint.pt

bash run_diff.sh

Note
Using roberta as an encoder is beneficial for training because the KoBERT model cannot build a small-sized generator.

Evaluation

python main.py \
    --model klue/roberta-base \
    --generator klue/roberta-small \
    --train False \
    --test True \
    --max_len 64 \
    --batch_size 256 \
    --path_to_data Dataset/ \
    --test_data test_sts.tsv \
    --path_to_saved_model output/best_checkpoint.pt

Performance - unsupervised

Model	Average	Cosine Pearson	Cosine Spearman	Euclidean Pearson	Euclidean Spearman	Manhattan Pearson	Manhattan Spearman	Dot Pearson	Dot Spearman
KoSRoBERTa-base ^†	N/A	N/A	48.96	N/A	N/A	N/A	N/A	N/A	N/A
KoSRoBERTa-large ^†	N/A	N/A	51.35	N/A	N/A	N/A	N/A	N/A	N/A

KoSimCSE-BERT	74.08	74.92	73.98	74.15	74.22	74.07	74.07	74.15	73.14
KoSimCSE-RoBERTa	75.27	75.93	75.00	75.28	75.01	75.17	74.83	75.95	75.01

KoDiffCSE-RoBERTa	77.17	77.73	76.96	77.21	76.89	77.11	76.81	77.74	76.97

Korean-SRoBERTa ^†

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License .

References

@inproceedings{chuang2022diffcse,
   title={{DiffCSE}: Difference-based Contrastive Learning for Sentence Embeddings},
   author={Chuang, Yung-Sung and Dangovski, Rumen and Luo, Hongyin and Zhang, Yang and Chang, Shiyu and Soljacic, Marin and Li, Shang-Wen and Yih, Wen-tau and Kim, Yoon and Glass, James},
   booktitle={Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)},
   year={2022}
}
@misc{park2021klue,
      title={KLUE: Korean Language Understanding Evaluation},
      author={Sungjoon Park and Jihyung Moon and Sungdong Kim and Won Ik Cho and Jiyoon Han and Jangwon Park and Chisung Song and Junseong Kim and Yongsook Song and Taehwan Oh and Joohong Lee and Juhyun Oh and Sungwon Lyu and Younghoon Jeong and Inkwon Lee and Sangwoo Seo and Dongjun Lee and Hyunwoo Kim and Myeonghwa Lee and Seongbo Jang and Seungwon Do and Sunkyoung Kim and Kyungtae Lim and Jongwon Lee and Kyumin Park and Jamin Shin and Seonghyun Kim and Lucy Park and Alice Oh and Jungwoo Ha and Kyunghyun Cho},
      year={2021},
      eprint={2105.09680},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}
@article{ham2020kornli,
  title={KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding},
  author={Ham, Jiyeon and Choe, Yo Joong and Park, Kyubyong and Choi, Ilji and Soh, Hyungjoon},
  journal={arXiv preprint arXiv:2004.03289},
  year={2020}
}

Runs of BM-K KoDiffCSE-RoBERTa on huggingface.co

35.6K

Total runs

355

24-hour runs

-378

3-day runs

-3.2K

7-day runs

2.8K

30-day runs

More Information About KoDiffCSE-RoBERTa huggingface.co Model

KoDiffCSE-RoBERTa huggingface.co

KoDiffCSE-RoBERTa huggingface.co is an AI model on huggingface.co that provides KoDiffCSE-RoBERTa's model effect (), which can be used instantly with this BM-K KoDiffCSE-RoBERTa model. huggingface.co supports a free trial of the KoDiffCSE-RoBERTa model, and also provides paid use of the KoDiffCSE-RoBERTa. Support call KoDiffCSE-RoBERTa model through api, including Node.js, Python, http.

KoDiffCSE-RoBERTa huggingface.co Url

https://huggingface.co/BM-K/KoDiffCSE-RoBERTa

BM-K KoDiffCSE-RoBERTa online free

KoDiffCSE-RoBERTa huggingface.co is an online trial and call api platform, which integrates KoDiffCSE-RoBERTa's modeling effects, including api services, and provides a free online trial of KoDiffCSE-RoBERTa, you can try KoDiffCSE-RoBERTa online for free by clicking the link below.

BM-K KoDiffCSE-RoBERTa online free url in huggingface.co:

https://huggingface.co/BM-K/KoDiffCSE-RoBERTa

KoDiffCSE-RoBERTa install

KoDiffCSE-RoBERTa is an open source model from GitHub that offers a free installation service, and any user can find KoDiffCSE-RoBERTa on GitHub to install. At the same time, huggingface.co provides the effect of KoDiffCSE-RoBERTa install, users can directly use KoDiffCSE-RoBERTa installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

KoDiffCSE-RoBERTa install url in huggingface.co:

https://huggingface.co/BM-K/KoDiffCSE-RoBERTa

huggingface.co

BM-K/KoSimCSE-roberta-multitask

Total runs: 60.7K

Run Growth: 31.5K

Growth Rate: 51.88%

Updated: March 24 2023

huggingface.co

BM-K/KoSimCSE-roberta

Total runs: 19.7K

Run Growth: 16.6K

Growth Rate: 84.51%

Updated: March 24 2023

huggingface.co

BM-K/mistral-7b-it-v1.7.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 20 2023

huggingface.co

BM-K/yi-ko-6b-it-v1.0.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: December 05 2023

huggingface.co

BM-K/stupid_model

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: January 03 2024

huggingface.co

BM-K/mistral-ko-7b-it-v2.0.1

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: December 26 2023

huggingface.co

BM-K/mistral-7b-it-v1.7.1

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 21 2023

huggingface.co

BM-K/llama-2-ko-7b-it-v1.0.0

Total runs: 2.3K

Run Growth: 0

Growth Rate: 0.00%

Updated: November 15 2023

huggingface.co

BM-K/KoSimCSE-bert

Total runs: 1.9K

Run Growth: 1.7K

Growth Rate: 88.32%

Updated: August 30 2023

huggingface.co

BM-K/KoMiniLM

Total runs: 480

Run Growth: 429

Growth Rate: 89.38%

Updated: August 30 2023

huggingface.co

BM-K/KoSimCSE-bert-multitask

Total runs: 379

Run Growth: -1.9K

Growth Rate: -506.07%

Updated: April 26 2023

huggingface.co

BM-K/KoChatBART

Total runs: 34

Run Growth: 9

Growth Rate: 26.47%

Updated: April 26 2023

huggingface.co

BM-K/KoMiniLM-68M

Total runs: 16

Run Growth: -14

Growth Rate: -87.50%

Updated: March 24 2023

huggingface.co

BM-K/KoSimCSE-Unsup-BERT

Total runs: 4

Run Growth: 1

Growth Rate: 33.33%

Updated: March 08 2023

huggingface.co

BM-K/KoSimCSE-Unsup-RoBERTa

Total runs: 4

Run Growth: 1

Growth Rate: 33.33%

Updated: March 08 2023

huggingface.co

BM-K/NewsKoT5-small

Total runs: 2

Run Growth: -7

Growth Rate: -350.00%

Updated: August 30 2023

huggingface.co

BM-K/llama-2-ko-13b-qlora-it-v1.0.0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated: November 15 2023

BM-K / KoDiffCSE-RoBERTa

Introduction of KoDiffCSE-RoBERTa

Model Details of KoDiffCSE-RoBERTa

KoDiffCSE

Quick tour

Setups

Encoder Models

Datasets

Training - unsupervised

Evaluation

Performance - unsupervised

License

References

Runs of BM-K KoDiffCSE-RoBERTa on huggingface.co

More Information About KoDiffCSE-RoBERTa huggingface.co Model

KoDiffCSE-RoBERTa huggingface.co

KoDiffCSE-RoBERTa huggingface.co Url

BM-K KoDiffCSE-RoBERTa online free

BM-K KoDiffCSE-RoBERTa online free url in huggingface.co:

KoDiffCSE-RoBERTa install

KoDiffCSE-RoBERTa install url in huggingface.co:

Url of KoDiffCSE-RoBERTa

KoDiffCSE-RoBERTa huggingface.co Url

Provider of KoDiffCSE-RoBERTa huggingface.co

Other API from BM-K