update with #s of sentences/tokens (#6546)

2020-08-17 21:48:05 +01:00
parent 63144701ed
commit fb7330b30e
1 changed files with 2 additions and 0 deletions
--- a/model_cards/jimregan/BERTreach/README.md
+++ b/model_cards/jimregan/BERTreach/README.md
@@ -15,6 +15,8 @@ tags:
 * Newscrawl 300k portion of the [Leipzig Corpora](https://wortschatz.uni-leipzig.de/en/download/irish)
 * Private news corpus crawled with [Corpus Crawler](https://github.com/google/corpuscrawler)
 (2125804 sentences, 47419062 tokens, as reckoned by wc)
 ```
 from transformers import pipeline
 fill_mask = pipeline("fill-mask", model="jimregan/BERTreach", tokenizer="jimregan/BERTreach")