Add cards for all Geotrend models (#8617)
* docs(bert-base-15lang-cased): add model card * add cards for all Geotrend models * [model cards] fix language tag for all Geotrend models
This commit is contained in:
48
model_cards/Geotrend/bert-base-15lang-cased/README.md
Normal file
48
model_cards/Geotrend/bert-base-15lang-cased/README.md
Normal file
@@ -0,0 +1,48 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-15lang-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
The measurements below have been computed on a [Google Cloud n1-standard-1 machine (1 vCPU, 3.75 GB)](https://cloud.google.com/compute/docs/machine-types\#n1_machine_type):
|
||||||
|
| Model | Num parameters | Size | Memory | Loading time |
|
||||||
|
| ------------------------------- | -------------- | -------- | -------- | ------------ |
|
||||||
|
| bert-base-multilingual-cased | 178 million | 714 MB | 1400 MB | 4.2 sec |
|
||||||
|
| Geotrend/bert-base-15lang-cased | 141 million | 564 MB | 1098 MB | 3.1 sec |
|
||||||
|
|
||||||
|
Handled languages: en, fr, es, de, zh, ar, ru, vi, el, bg, th, tr, hi, ur and sw.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-15lang-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-15lang-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
41
model_cards/Geotrend/bert-base-ar-cased/README.md
Normal file
41
model_cards/Geotrend/bert-base-ar-cased/README.md
Normal file
@@ -0,0 +1,41 @@
|
|||||||
|
---
|
||||||
|
language: ar
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-ar-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-ar-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-ar-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-bg-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-bg-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: bg
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-bg-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-bg-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-bg-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-de-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-de-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: de
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-de-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-de-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-de-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-el-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-el-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: el
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-el-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-el-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-el-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-ar-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-ar-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-ar-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-ar-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-ar-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-bg-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-bg-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-bg-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-bg-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-bg-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: en
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-de-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-de-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-de-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-de-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-de-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-el-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-el-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-el-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-el-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-el-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-es-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-es-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-es-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-es-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-es-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-fr-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-fr-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-fr-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-fr-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-fr-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-hi-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-hi-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-hi-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-hi-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-hi-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-ru-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-ru-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-ru-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-ru-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-ru-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-sw-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-sw-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-sw-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-sw-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-sw-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-th-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-th-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-th-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-th-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-th-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-tr-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-tr-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-tr-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-tr-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-tr-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-ur-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-ur-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-ur-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-ur-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-ur-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-vi-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-vi-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-vi-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-vi-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-vi-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-en-zh-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-en-zh-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: multilingual
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-en-zh-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-en-zh-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-en-zh-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-es-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-es-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: es
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-es-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-es-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-es-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-fr-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-fr-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: fr
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-fr-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-fr-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-fr-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-hi-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-hi-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: hi
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-hi-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-hi-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-hi-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-ru-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-ru-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: ru
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-ru-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-ru-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-ru-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-sw-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-sw-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: sw
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-sw-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-sw-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-sw-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-th-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-th-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: th
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-th-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-th-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-th-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-tr-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-tr-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: tr
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-tr-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-tr-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-tr-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-ur-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-ur-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: ur
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-ur-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-ur-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-ur-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-vi-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-vi-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: vi
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-vi-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-vi-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-vi-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
40
model_cards/Geotrend/bert-base-zh-cased/README.md
Normal file
40
model_cards/Geotrend/bert-base-zh-cased/README.md
Normal file
@@ -0,0 +1,40 @@
|
|||||||
|
---
|
||||||
|
language: zh
|
||||||
|
|
||||||
|
datasets: wikipedia
|
||||||
|
|
||||||
|
license: apache-2.0
|
||||||
|
---
|
||||||
|
|
||||||
|
# bert-base-zh-cased
|
||||||
|
|
||||||
|
We are sharing smaller versions of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) that handle a custom number of languages.
|
||||||
|
|
||||||
|
Unlike [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased), our versions give exactly the same representations produced by the original model which preserves the original accuracy.
|
||||||
|
|
||||||
|
For more information please visit our paper: [Load What You Need: Smaller Versions of Multilingual BERT](https://www.aclweb.org/anthology/2020.sustainlp-1.16.pdf).
|
||||||
|
|
||||||
|
## How to use
|
||||||
|
|
||||||
|
```python
|
||||||
|
from transformers import AutoTokenizer, AutoModel
|
||||||
|
|
||||||
|
tokenizer = AutoTokenizer.from_pretrained("Geotrend/bert-base-zh-cased")
|
||||||
|
model = AutoModel.from_pretrained("Geotrend/bert-base-zh-cased")
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
### How to cite
|
||||||
|
|
||||||
|
```bibtex
|
||||||
|
@inproceedings{smallermbert,
|
||||||
|
title={Load What You Need: Smaller Versions of Mutlilingual BERT},
|
||||||
|
author={Abdaoui, Amine and Pradel, Camille and Sigel, Grégoire},
|
||||||
|
booktitle={SustaiNLP / EMNLP},
|
||||||
|
year={2020}
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Contact
|
||||||
|
|
||||||
|
Please contact amine@geotrend.fr for any question, feedback or request.
|
||||||
Reference in New Issue
Block a user