Thomas Wolf
59941c5d1f
Merge pull request #2189 from stefan-it/xlmr
...
Add support for XLM-RoBERTa
2019-12-20 13:26:38 +01:00
Thomas Wolf
ff36e6d8d7
Merge pull request #2231 from huggingface/requests_user_agent
...
[http] customizable requests user-agent
2019-12-20 10:28:10 +01:00
Julien Chaumond
15d897ff4a
[http] customizable requests user-agent
2019-12-19 18:29:22 -05:00
Julien Chaumond
f25e9b6f77
[hf_bucket_url] support for cloudfront urls
2019-12-19 18:28:17 -05:00
Julien Chaumond
a5a06a851e
[doc] Param name consistency
2019-12-19 16:24:20 -05:00
Aidan Kierans
1718fb9e74
Minor/basic text fixes ( #2229 )
...
* Small clarification
Matches line 431 to line 435 for additional clarity and consistency.
* Fixed minor typo
The letter "s" was previously omitted from the word "docstrings".
2019-12-19 16:23:18 -05:00
Julien Chaumond
9a399ead25
Revert incorrect #1778
2019-12-19 15:45:48 -05:00
Stefan Schweter
3376adc051
configuration/modeling/tokenization: add various fine-tuned XLM-RoBERTa models for English, German, Spanish and Dutch (CoNLL datasets)
2019-12-19 21:30:23 +01:00
Lysandre
33adab2b91
Fix albert example
2019-12-19 12:40:43 -05:00
Lysandre
a1f1dce0ae
Correct max position for SQUAD and TFDS
2019-12-19 12:25:55 -05:00
Francesco
62c1fc3c1e
Removed duplicate XLMConfig, XLMForQuestionAnswering and XLMTokenizer from import statement of run_squad.py script
2019-12-19 09:50:56 -05:00
Ejar
284572efc0
Updated typo on the link
...
Updated documentation due to typo
2019-12-19 09:36:43 -05:00
patrickvonplaten
ed6ba93912
corrected typo in example for t5 model input argument
2019-12-19 09:34:55 -05:00
Stefan Schweter
a26ce4dee1
examples: add XLM-RoBERTa to glue script
2019-12-19 02:23:01 +01:00
Stefan Schweter
fe9aab1055
tokenization: use S3 location for XLM-RoBERTa model
2019-12-18 23:47:48 +01:00
Stefan Schweter
5c5f67a256
modeling: use S3 location for XLM-RoBERTa model
2019-12-18 23:47:00 +01:00
Stefan Schweter
db90e12114
configuration: use S3 location for XLM-RoBERTa model
2019-12-18 23:46:33 +01:00
Stefan Schweter
f09d999641
docs: fix numbering 😅
2019-12-18 19:49:33 +01:00
Stefan Schweter
dd7a958fd6
docs: add XLM-RoBERTa to pretrained model list (incl. all parameters)
2019-12-18 19:45:46 +01:00
Stefan Schweter
d35405b7a3
docs: add XLM-RoBERTa to index page
2019-12-18 19:45:10 +01:00
Stefan Schweter
3e89fca543
readme: add XLM-RoBERTa to model architecture list
2019-12-18 19:44:23 +01:00
Stefan Schweter
128cfdee9b
tokenization add XLM-RoBERTa base model
2019-12-18 19:28:16 +01:00
Stefan Schweter
e778dd854d
modeling: add XLM-RoBERTa base model
2019-12-18 19:27:34 +01:00
Stefan Schweter
64a971a915
auto: add XLM-RoBERTa to auto tokenization
2019-12-18 18:24:32 +01:00
Stefan Schweter
036831e279
auto: add XLM-RoBERTa to audo modeling
2019-12-18 18:23:42 +01:00
Stefan Schweter
41a13a6375
auto: add XLMRoBERTa to auto configuration
2019-12-18 18:20:27 +01:00
Lysandre
8efc6dd544
fix #2214
2019-12-18 10:47:59 -05:00
Stefan Schweter
01b68be34f
converter: remove XLM-RoBERTa specific script (can be done with the script for RoBERTa now)
2019-12-18 12:24:46 +01:00
Stefan Schweter
ca31abc6d6
tokenization: *align* fairseq and spm vocab to fix some tokenization errors
2019-12-18 11:36:54 +01:00
Stefan Schweter
cce3089b65
Merge remote-tracking branch 'upstream/master' into xlmr
2019-12-18 11:05:16 +01:00
Julien Chaumond
94c99db34c
[FinBERT] fix incorrect url
2019-12-17 20:35:25 -05:00
Julien Chaumond
7ffa817390
[s3] mv files and update links
2019-12-17 20:35:25 -05:00
Antti Virtanen
c5f35e61db
Uploaded files to AWS.
2019-12-17 20:35:25 -05:00
Antti Virtanen
abc43ffbff
Add pretrained model documentation for FinBERT.
2019-12-17 20:35:25 -05:00
Antti Virtanen
8ac840ff87
Adding Finnish BERT.
2019-12-17 20:35:25 -05:00
Julien Chaumond
a0d386455b
Fix outdated tokenizer doc
2019-12-17 20:07:39 -05:00
Julien Chaumond
ea636440d1
[roberta.conversion] Do not hardcode vocab size
...
and support for fairseq 0.9+
2019-12-17 18:12:22 -05:00
Arman Cohan
a4df2e0113
update roberta conversion
...
- update to fix conversion for the updated fairseq model
- create save directory if not exist
2019-12-17 18:12:22 -05:00
Lysandre
5e289f69bc
regex 2019.12.17 install fails with Python 2
2019-12-17 15:54:05 -05:00
Lysandre
2cff4bd8f3
Fix segmentation fault
2019-12-17 15:54:05 -05:00
thomwolf
2f1c745cde
update conversion script
2019-12-17 11:47:54 +01:00
thomwolf
83bc5235cf
Merge branch 'master' into pr/2189
2019-12-17 11:47:32 +01:00
Stefan Schweter
f349826a57
model: fix cls and sep token for XLM-RoBERTa documentation
2019-12-17 10:36:04 +01:00
Thomas Wolf
f061606277
Merge pull request #2164 from huggingface/cleanup-configs
...
[SMALL BREAKING CHANGE] Cleaning up configuration classes - Adding Model Cards
2019-12-17 09:10:16 +01:00
Julien Chaumond
3f5ccb183e
[doc] Clarify uploads
...
cf 855ff0e91d (commitcomment-36452545)
2019-12-16 18:20:29 -05:00
thomwolf
3cb51299c3
Fix #2109
2019-12-16 16:58:44 -05:00
Lysandre
18a879f475
fix #2180
2019-12-16 16:44:29 -05:00
Lysandre
d803409215
Fix run squad evaluate during training
2019-12-16 16:31:38 -05:00
Julien Chaumond
855ff0e91d
[doc] Model upload and sharing
...
ping @lysandrejik @thomwolf
Is this clear enough? Anything we should add?
2019-12-16 12:42:22 -05:00
Stefan Schweter
d064009b72
converter: fix vocab size
2019-12-16 17:23:25 +01:00