From 9750e1300ccc48e7fd445a17cd6164cc25f3c183 Mon Sep 17 00:00:00 2001
From: Jannes <36601086+jannesgg@users.noreply.github.com>
Date: Fri, 17 Jul 2020 20:03:53 +0200
Subject: [PATCH] Create README.md (#5847)
---
.../jannesg/takalane_afr_roberta/README.md | 50 +++++++++++++++++++
1 file changed, 50 insertions(+)
create mode 100644 model_cards/jannesg/takalane_afr_roberta/README.md
diff --git a/model_cards/jannesg/takalane_afr_roberta/README.md b/model_cards/jannesg/takalane_afr_roberta/README.md
new file mode 100644
index 0000000000..d43471c4f7
--- /dev/null
+++ b/model_cards/jannesg/takalane_afr_roberta/README.md
@@ -0,0 +1,50 @@
+---
+language:
+- af
+thumbnail: https://pbs.twimg.com/media/EVjR6BsWoAAFaq5.jpg
+tags:
+- af
+- fill-mask
+- pytorch
+- roberta
+- lm-head
+- masked-lm
+license: MIT
+---
+
+# Takalani Sesame - Salie - Afrikaans πΏπ¦
+
+
+
+## Model description
+
+Takalani Sesame (named after the South African version of Sesame Street) is a project that aims to promote the use of South African languages in NLP, and in particular look at techniques for low-resource languages to equalise performance with larger languages around the world.
+
+## Intended uses & limitations
+
+#### How to use
+
+```python
+from transformers import AutoTokenizer, AutoModelWithLMHead
+
+tokenizer = AutoTokenizer.from_pretrained("jannesg/takalane_afr_roberta")
+
+model = AutoModelWithLMHead.from_pretrained("jannesg/takalane_afr_roberta")
+```
+
+#### Limitations and bias
+
+Updates will be added continously to improve performance.
+
+## Training data
+
+Data collected from [https://wortschatz.uni-leipzig.de/en](https://wortschatz.uni-leipzig.de/en)
+**Sentences:** 2.8M
+
+## Training procedure
+
+No preprocessing. Standard Huggingface hyperparameters.
+
+## Author
+
+Jannes Germishuys [website](http://jannesgg.github.io)