From 73028c5df0c28ca179fbe565482a9c2143787f61 Mon Sep 17 00:00:00 2001 From: Julien Chaumond Date: Fri, 14 Feb 2020 15:16:33 -0500 Subject: [PATCH] [model_cards] EsperBERTo --- .../julien-c/EsperBERTo-small-pos/README.md | 40 +++++++++++++ .../julien-c/EsperBERTo-small/README.md | 59 +++++++++++++++++++ 2 files changed, 99 insertions(+) create mode 100644 model_cards/julien-c/EsperBERTo-small-pos/README.md create mode 100644 model_cards/julien-c/EsperBERTo-small/README.md diff --git a/model_cards/julien-c/EsperBERTo-small-pos/README.md b/model_cards/julien-c/EsperBERTo-small-pos/README.md new file mode 100644 index 0000000000..700ae9a4c3 --- /dev/null +++ b/model_cards/julien-c/EsperBERTo-small-pos/README.md @@ -0,0 +1,40 @@ +--- +language: esperanto +thumbnail: https://huggingface.co/blog/assets/EsperBERTo-thumbnail-v2.png +--- + +# EsperBERTo: RoBERTa-like Language model trained on Esperanto + +**Companion model to blog post https://huggingface.co/blog/how-to-train** 🔥 + +## Training Details + +- current checkpoint: 566000 +- machine name: `galinette` + + +![](https://huggingface.co/blog/assets/EsperBERTo-thumbnail-v2.png) + +## Example pipeline + +```python +from transformers import TokenClassificationPipeline, pipeline + + +MODEL_PATH = "./models/EsperBERTo-small-pos/" + +nlp = pipeline( + "ner", + model=MODEL_PATH, + tokenizer=MODEL_PATH, +) +# or instantiate a TokenClassificationPipeline directly. + +nlp("Mi estas viro kej estas tago varma.") + +# {'entity': 'PRON', 'score': 0.9979867339134216, 'word': ' Mi'} +# {'entity': 'VERB', 'score': 0.9683094620704651, 'word': ' estas'} +# {'entity': 'VERB', 'score': 0.9797462821006775, 'word': ' estas'} +# {'entity': 'NOUN', 'score': 0.8509314060211182, 'word': ' tago'} +# {'entity': 'ADJ', 'score': 0.9996201395988464, 'word': ' varma'} +``` \ No newline at end of file diff --git a/model_cards/julien-c/EsperBERTo-small/README.md b/model_cards/julien-c/EsperBERTo-small/README.md new file mode 100644 index 0000000000..52e0df3a03 --- /dev/null +++ b/model_cards/julien-c/EsperBERTo-small/README.md @@ -0,0 +1,59 @@ +--- +language: esperanto +thumbnail: https://huggingface.co/blog/assets/EsperBERTo-thumbnail-v2.png +--- + +# EsperBERTo: RoBERTa-like Language model trained on Esperanto + +**Companion model to blog post https://huggingface.co/blog/how-to-train** 🔥 + +## Training Details + +- current checkpoint: 566000 +- machine name: `galinette` + + +![](https://huggingface.co/blog/assets/EsperBERTo-thumbnail-v2.png) + +## Example pipeline + +```python +from transformers import pipeline + +fill_mask = pipeline( + "fill-mask", + model="julien-c/EspertBERTo-small", + tokenizer="julien-c/EspertBERTo-small" +) + +fill_mask("Jen la komenco de bela .") + +# This is the beginning of a beautiful . +# => + +# { +# 'score':0.06502299010753632 +# 'sequence':' Jen la komenco de bela vivo.' +# 'token':1099 +# } +# { +# 'score':0.0421181358397007 +# 'sequence':' Jen la komenco de bela vespero.' +# 'token':5100 +# } +# { +# 'score':0.024884626269340515 +# 'sequence':' Jen la komenco de bela laboro.' +# 'token':1570 +# } +# { +# 'score':0.02324388362467289 +# 'sequence':' Jen la komenco de bela tago.' +# 'token':1688 +# } +# { +# 'score':0.020378097891807556 +# 'sequence':' Jen la komenco de bela festo.' +# 'token':4580 +# } +``` \ No newline at end of file