From 7da051f1354bad13aac50101aab9bad057d3a073 Mon Sep 17 00:00:00 2001 From: Suraj Parmar Date: Sat, 2 May 2020 20:45:39 +0530 Subject: [PATCH] model card for surajp/albert-base-sanskrit (#4114) * Create README.md * Update model_cards/surajp/albert-base-sanskrit/README.md Co-authored-by: Julien Chaumond --- .../surajp/albert-base-sanskrit/README.md | 36 +++++++++++++++++++ 1 file changed, 36 insertions(+) create mode 100644 model_cards/surajp/albert-base-sanskrit/README.md diff --git a/model_cards/surajp/albert-base-sanskrit/README.md b/model_cards/surajp/albert-base-sanskrit/README.md new file mode 100644 index 0000000000..b8094e7ac0 --- /dev/null +++ b/model_cards/surajp/albert-base-sanskrit/README.md @@ -0,0 +1,36 @@ +--- +language: sanskrit +--- + + +# ALBERT-base-Sanskrit + + +Explaination Notebook Colab: [SanskritALBERT.ipynb](https://colab.research.google.com/github/parmarsuraj99/suraj-parmar/blob/master/_notebooks/2020-05-02-SanskritALBERT.ipynb) + +Size of the model is **46MB** + +Example of usage: + +``` +tokenizer = AutoTokenizer.from_pretrained("surajp/albert-base-sanskrit") +model = AutoModel.from_pretrained("surajp/albert-base-sanskrit") + +enc=tokenizer.encode("ॐ सर्वे भवन्तु सुखिनः सर्वे सन्तु निरामयाः । सर्वे भद्राणि पश्यन्तु मा कश्चिद्दुःखभाग्भवेत् । ॐ शान्तिः शान्तिः शान्तिः ॥") +print(tokenizer.decode(enc)) + +ps = model(torch.tensor(enc).unsqueeze(1)) +print(ps[0].shape) +``` +``` +''' +Output: +-------- +[CLS] ॐ सर्वे भवन्तु सुखिनः सर्वे सन्तु निरामयाः । सर्वे भद्राणि पश्यन्तु मा कश्चिद्दुःखभाग्भवेत् । ॐ शान्तिः शान्तिः शान्तिः ॥[SEP] +torch.Size([28, 1, 768]) +``` + + +> Created by [Suraj Parmar/@parmarsuraj99](https://twitter.com/parmarsuraj99) + +> Made with in India