From 73d6a2f9019960c327f19689c1d9a6c0fba31d86 Mon Sep 17 00:00:00 2001 From: Junyi_Li Date: Fri, 24 Apr 2020 16:12:42 -0400 Subject: [PATCH] [model_cards] xlnet_chinese_large & roberta_chinese_large --- .../clue/albert_chinese_small/README.md | 4 +++ .../clue/albert_chinese_tiny/README.md | 4 +++ .../roberta_chinese_3L312_clue_tiny/README.md | 4 +++ .../clue/roberta_chinese_base/README.md | 4 +++ .../clue/roberta_chinese_large/README.md | 35 +++++++++++++++++++ .../clue/xlnet_chinese_large/README.md | 33 +++++++++++++++++ 6 files changed, 84 insertions(+) create mode 100644 model_cards/clue/roberta_chinese_large/README.md create mode 100644 model_cards/clue/xlnet_chinese_large/README.md diff --git a/model_cards/clue/albert_chinese_small/README.md b/model_cards/clue/albert_chinese_small/README.md index 55e35e74ee..00c748dc14 100644 --- a/model_cards/clue/albert_chinese_small/README.md +++ b/model_cards/clue/albert_chinese_small/README.md @@ -1,3 +1,7 @@ +--- +language: chinese +--- + ## albert_chinese_small ### Overview diff --git a/model_cards/clue/albert_chinese_tiny/README.md b/model_cards/clue/albert_chinese_tiny/README.md index 71437e2475..088a216153 100644 --- a/model_cards/clue/albert_chinese_tiny/README.md +++ b/model_cards/clue/albert_chinese_tiny/README.md @@ -1,3 +1,7 @@ +--- +language: chinese +--- + ## albert_chinese_tiny ### Overview diff --git a/model_cards/clue/roberta_chinese_3L312_clue_tiny/README.md b/model_cards/clue/roberta_chinese_3L312_clue_tiny/README.md index dca65cff1a..fac9f2f467 100644 --- a/model_cards/clue/roberta_chinese_3L312_clue_tiny/README.md +++ b/model_cards/clue/roberta_chinese_3L312_clue_tiny/README.md @@ -1,3 +1,7 @@ +--- +language: chinese +--- + # Introduction This model was trained on TPU and the details are as follows: diff --git a/model_cards/clue/roberta_chinese_base/README.md b/model_cards/clue/roberta_chinese_base/README.md index b0fcb124c1..0889484687 100644 --- a/model_cards/clue/roberta_chinese_base/README.md +++ b/model_cards/clue/roberta_chinese_base/README.md @@ -1,3 +1,7 @@ +--- +language: chinese +--- + ## roberta_chinese_base ### Overview diff --git a/model_cards/clue/roberta_chinese_large/README.md b/model_cards/clue/roberta_chinese_large/README.md new file mode 100644 index 0000000000..c983469512 --- /dev/null +++ b/model_cards/clue/roberta_chinese_large/README.md @@ -0,0 +1,35 @@ +--- +language: chinese +--- + +## roberta_chinese_large + +### Overview + +**Language model:** roberta-large +**Model size:** 1.2G +**Language:** Chinese +**Training data:** [CLUECorpusSmall](https://github.com/CLUEbenchmark/CLUECorpus2020) +**Eval data:** [CLUE dataset](https://github.com/CLUEbenchmark/CLUE) + +### Results + +For results on downstream tasks like text classification, please refer to [this repository](https://github.com/CLUEbenchmark/CLUE). + +### Usage + +**NOTE:** You have to call **BertTokenizer** instead of RobertaTokenizer !!! + +``` +import torch +from transformers import BertTokenizer, BertModel +tokenizer = BertTokenizer.from_pretrained("clue/roberta_chinese_large") +roberta = BertModel.from_pretrained("clue/roberta_chinese_large") +``` + +### About CLUE benchmark + +Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard. + +Github: https://github.com/CLUEbenchmark +Website: https://www.cluebenchmarks.com/ diff --git a/model_cards/clue/xlnet_chinese_large/README.md b/model_cards/clue/xlnet_chinese_large/README.md new file mode 100644 index 0000000000..e958b90eee --- /dev/null +++ b/model_cards/clue/xlnet_chinese_large/README.md @@ -0,0 +1,33 @@ +--- +language: chinese +--- + +## xlnet_chinese_large + +### Overview + +**Language model:** xlnet-large +**Model size:** 1.3G +**Language:** Chinese +**Training data:** [CLUECorpusSmall](https://github.com/CLUEbenchmark/CLUECorpus2020) +**Eval data:** [CLUE dataset](https://github.com/CLUEbenchmark/CLUE) + +### Results + +For results on downstream tasks like text classification, please refer to [this repository](https://github.com/CLUEbenchmark/CLUE). + +### Usage + +``` +import torch +from transformers import XLNetTokenizer,XLNetModel +tokenizer = XLNetTokenizer.from_pretrained("clue/xlnet_chinese_large") +xlnet = XLNetModel.from_pretrained("clue/xlnet_chinese_large") +``` + +### About CLUE benchmark + +Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard. + +Github: https://github.com/CLUEbenchmark +Website: https://www.cluebenchmarks.com/