Commit Graph

227 Commits

Author SHA1 Message Date
Lysandre
b21402fc86 Python 2 tests + licence 2019-11-26 13:08:12 -05:00
Lysandre
c14a22272f ALBERT passes all tests 2019-11-26 13:08:12 -05:00
Lysandre
870320a24e Early tests 2019-11-26 13:08:12 -05:00
Lysandre
25a31953e8 Output Attentions + output hidden states 2019-11-26 13:08:12 -05:00
Lysandre
ce9eade29c Initializer range using BertPreTrainedModel 2019-11-26 13:08:12 -05:00
Lysandre
5680a11063 Activation function managed from the config file 2019-11-26 13:08:12 -05:00
Lysandre
1e5b31c388 Several fixes and improvements 2019-11-26 13:08:12 -05:00
Lysandre
ee20201d33 Tokenization tests + fixes + init 2019-11-26 13:08:12 -05:00
Lysandre
e3ea5d1d8d Docstrings 2019-11-26 13:08:12 -05:00
Lysandre
fedac786d4 Tokenization + small fixes 2019-11-26 13:08:12 -05:00
Lysandre
67b422662c Documentation + improved AlbertForMaskedLM 2019-11-26 13:08:12 -05:00
Lysandre
1b92564330 Reorganize and cleanup 2019-11-26 13:08:12 -05:00
Lysandre
12290c0d5c Handles multi layer and multi groups 2019-11-26 13:08:12 -05:00
Lysandre
139affaa8d Albert layer/layer groups 2019-11-26 13:08:12 -05:00
Lysandre
91ccbae788 Accepts multiple sizes 2019-11-26 13:08:12 -05:00
Lysandre
c0c2088333 ALBERT model 2019-11-26 13:08:12 -05:00
v_sboliu
8e5d84fcc1 Fixed typo 2019-11-26 09:01:32 -05:00
Bilal Khan
07bf43074f Fix GPT2 docstring 2019-11-25 11:32:00 -05:00
Evpok Padding
fa963ecc59 if→elif 2019-11-25 10:21:03 -05:00
Evpok Padding
c8eb8157b8 fix docstrings 2019-11-25 10:21:03 -05:00
Evpok Padding
99f750d64e add Camembert models to modeling_auto 2019-11-25 10:21:03 -05:00
Nikolay Korolev
041a901f32 Fix typo in documentation. toto -> to 2019-11-23 10:55:16 -05:00
Stefan Schweter
0b3d45eb64 camembert: add implementation for save_vocabulary method 2019-11-18 15:49:44 +01:00
Stefan Schweter
33753d9139 module: import CamembertForTokenClassification 2019-11-18 14:14:54 +01:00
Stefan Schweter
d32ce2c8df camembert: add wrapper for CamembertForTokenClassification 2019-11-18 14:14:19 +01:00
Julien Chaumond
0477b307c7 [camembert] tokenizer: use additional_special_tokens 2019-11-16 00:11:07 -05:00
Julien Chaumond
f9abf73e31 [camembert] realign w/ recent changes 2019-11-16 00:11:07 -05:00
Julien Chaumond
26858f27cb [camembert] Upload to s3 + rename script 2019-11-16 00:11:07 -05:00
Louis MARTIN
035fea5315 Add CamemBERT to auto files and docs 2019-11-16 00:11:07 -05:00
Louis MARTIN
694d4fcbb6 Add CamemBERT classes to __init__.py 2019-11-16 00:11:07 -05:00
Louis MARTIN
fb6c70a91d Update tokenization_camembert.py with urls 2019-11-16 00:11:07 -05:00
Louis MARTIN
e44b939e71 Add configuration_camembert.py and modeling_camembert.py 2019-11-16 00:11:07 -05:00
Louis MARTIN
14b3aa3b3c Add tokenization_camembert.py 2019-11-16 00:11:07 -05:00
Thomas Wolf
74ce8de7d8 Merge pull request #1792 from stefan-it/distilbert-for-token-classification
DistilBERT for token classification
2019-11-14 22:47:53 +01:00
Thomas Wolf
5b322a36db Merge pull request #1811 from huggingface/special-tokens
Fix special tokens addition in decoder #1807
2019-11-14 22:17:24 +01:00
Thomas Wolf
1a237d7f42 Merge pull request #1831 from iedmrc/gpt2-tokenization-sum-func-replacement
sum() is replaced by itertools.chain.from_iterable()
2019-11-14 22:11:54 +01:00
Thomas Wolf
df99f8c5a1 Merge pull request #1832 from huggingface/memory-leak-schedulers
replace LambdaLR scheduler wrappers by function
2019-11-14 22:10:31 +01:00
Lysandre
a67e747889 Reorganized max_len warning 2019-11-14 10:30:22 -05:00
Rémi Louf
022525b003 replace LambdaLR scheduler wrappers by function
Custom schedulers are currently initiated by wrapping Pytorch's LambdaLR
class and passing a method of the wrapping class to the __init__
function of LambdaLR. This approach is not appropriate for several
reasons:

1. one does not need to define a class when it only defines a
__init__() method;
2. instantiating the parent class by passing a method of the child class
creates a cyclical reference which leads to memory leaks. See issues #1742 and #1134.

In this commit we replace the wrapper classes with functions that
instantiate `LambdaLR` with a custom learning rate function. We use a
closure to specify the parameter of the latter. We also do a bit of
renaming within the function to explicit the behaviour and removed
docstrings that were subsequently not necessary.
2019-11-14 15:39:08 +01:00
İbrahim Ethem Demirci
7627dde1f8 sum() is the leanest method to flatten a string list, so it's been replaced by itertools.chain.from_iterable() 2019-11-14 17:06:15 +03:00
Lysandre
74d0bcb6ff Fix special tokens addition in decoder 2019-11-12 15:27:57 -05:00
Julien Chaumond
155c782a2c [inputs_embeds] All TF models + tests 2019-11-12 11:29:21 -05:00
Julien Chaumond
2aef2f0bbc [common attributes] Fix previous commit for transfo-xl 2019-11-12 11:29:21 -05:00
Julien Chaumond
2f17464266 [common attributes] Slightly sharper test coverage 2019-11-12 11:29:21 -05:00
Julien Chaumond
9d2398fd99 Ooopsie 2019-11-12 11:29:21 -05:00
Julien Chaumond
70d97ddd60 [TF models] Common attributes as per #1721 2019-11-12 11:29:21 -05:00
Julien Chaumond
dd6b2e05e1 whitespace 2019-11-12 11:29:21 -05:00
Lysandre
d409aca326 Clarify the use of past in GPT2 and CTRL 2019-11-12 10:59:37 -05:00
Stefan Schweter
94e55253ae tests: add test case for DistilBertForTokenClassification implementation 2019-11-11 16:20:15 +01:00
Stefan Schweter
1806eabf59 module: add DistilBertForTokenClassification import 2019-11-11 16:18:48 +01:00