Create README.md (#5847)

2020-07-17 20:03:53 +02:00
parent 1bca4fbd39
commit 9750e1300c
1 changed files with 50 additions and 0 deletions
--- a/model_cards/jannesg/takalane_afr_roberta/README.md
+++ b/model_cards/jannesg/takalane_afr_roberta/README.md
@@ -0,0 +1,50 @@
 ---
 language: 
 - af
 thumbnail: https://pbs.twimg.com/media/EVjR6BsWoAAFaq5.jpg
 tags:
 - af
 - fill-mask
 - pytorch
 - roberta
 - lm-head
 - masked-lm
 license: MIT
 ---
 # Takalani Sesame - Salie - Afrikaans 🇿🇦
 <img src="https://pbs.twimg.com/media/EVjR6BsWoAAFaq5.jpg" width="600"/> 
 ## Model description
 Takalani Sesame (named after the South African version of Sesame Street) is a project that aims to promote the use of South African languages in NLP, and in particular look at techniques for low-resource languages to equalise performance with larger languages around the world.
 ## Intended uses & limitations
 #### How to use
 ```python
 from transformers import AutoTokenizer, AutoModelWithLMHead
 tokenizer = AutoTokenizer.from_pretrained("jannesg/takalane_afr_roberta")
 model = AutoModelWithLMHead.from_pretrained("jannesg/takalane_afr_roberta")
 ```
 #### Limitations and bias
 Updates will be added continously to improve performance. 
 ## Training data
 Data collected from [https://wortschatz.uni-leipzig.de/en](https://wortschatz.uni-leipzig.de/en) <br/>
 **Sentences:** 2.8M
 ## Training procedure
 No preprocessing. Standard Huggingface hyperparameters. 
 ## Author
 Jannes Germishuys [website](http://jannesgg.github.io)