Stefan Schweter
d32ce2c8df
camembert: add wrapper for CamembertForTokenClassification
2019-11-18 14:14:19 +01:00
Julien Chaumond
0477b307c7
[camembert] tokenizer: use additional_special_tokens
2019-11-16 00:11:07 -05:00
Julien Chaumond
f9abf73e31
[camembert] realign w/ recent changes
2019-11-16 00:11:07 -05:00
Julien Chaumond
26858f27cb
[camembert] Upload to s3 + rename script
2019-11-16 00:11:07 -05:00
Louis MARTIN
035fea5315
Add CamemBERT to auto files and docs
2019-11-16 00:11:07 -05:00
Louis MARTIN
694d4fcbb6
Add CamemBERT classes to __init__.py
2019-11-16 00:11:07 -05:00
Louis MARTIN
fb6c70a91d
Update tokenization_camembert.py with urls
2019-11-16 00:11:07 -05:00
Louis MARTIN
e44b939e71
Add configuration_camembert.py and modeling_camembert.py
2019-11-16 00:11:07 -05:00
Louis MARTIN
14b3aa3b3c
Add tokenization_camembert.py
2019-11-16 00:11:07 -05:00
Thomas Wolf
74ce8de7d8
Merge pull request #1792 from stefan-it/distilbert-for-token-classification
...
DistilBERT for token classification
2019-11-14 22:47:53 +01:00
Thomas Wolf
5b322a36db
Merge pull request #1811 from huggingface/special-tokens
...
Fix special tokens addition in decoder #1807
2019-11-14 22:17:24 +01:00
Thomas Wolf
1a237d7f42
Merge pull request #1831 from iedmrc/gpt2-tokenization-sum-func-replacement
...
sum() is replaced by itertools.chain.from_iterable()
2019-11-14 22:11:54 +01:00
Thomas Wolf
df99f8c5a1
Merge pull request #1832 from huggingface/memory-leak-schedulers
...
replace LambdaLR scheduler wrappers by function
2019-11-14 22:10:31 +01:00
Lysandre
a67e747889
Reorganized max_len warning
2019-11-14 10:30:22 -05:00
Rémi Louf
022525b003
replace LambdaLR scheduler wrappers by function
...
Custom schedulers are currently initiated by wrapping Pytorch's LambdaLR
class and passing a method of the wrapping class to the __init__
function of LambdaLR. This approach is not appropriate for several
reasons:
1. one does not need to define a class when it only defines a
__init__() method;
2. instantiating the parent class by passing a method of the child class
creates a cyclical reference which leads to memory leaks. See issues #1742 and #1134 .
In this commit we replace the wrapper classes with functions that
instantiate `LambdaLR` with a custom learning rate function. We use a
closure to specify the parameter of the latter. We also do a bit of
renaming within the function to explicit the behaviour and removed
docstrings that were subsequently not necessary.
2019-11-14 15:39:08 +01:00
İbrahim Ethem Demirci
7627dde1f8
sum() is the leanest method to flatten a string list, so it's been replaced by itertools.chain.from_iterable()
2019-11-14 17:06:15 +03:00
Lysandre
74d0bcb6ff
Fix special tokens addition in decoder
2019-11-12 15:27:57 -05:00
Julien Chaumond
155c782a2c
[inputs_embeds] All TF models + tests
2019-11-12 11:29:21 -05:00
Julien Chaumond
2aef2f0bbc
[common attributes] Fix previous commit for transfo-xl
2019-11-12 11:29:21 -05:00
Julien Chaumond
2f17464266
[common attributes] Slightly sharper test coverage
2019-11-12 11:29:21 -05:00
Julien Chaumond
9d2398fd99
Ooopsie
2019-11-12 11:29:21 -05:00
Julien Chaumond
70d97ddd60
[TF models] Common attributes as per #1721
2019-11-12 11:29:21 -05:00
Julien Chaumond
dd6b2e05e1
whitespace
2019-11-12 11:29:21 -05:00
Lysandre
d409aca326
Clarify the use of past in GPT2 and CTRL
2019-11-12 10:59:37 -05:00
Stefan Schweter
94e55253ae
tests: add test case for DistilBertForTokenClassification implementation
2019-11-11 16:20:15 +01:00
Stefan Schweter
1806eabf59
module: add DistilBertForTokenClassification import
2019-11-11 16:18:48 +01:00
Stefan Schweter
1c7253cc5f
modeling: add DistilBertForTokenClassification implementation
2019-11-11 16:18:16 +01:00
Lysandre
b5d330d118
Fix #1784
2019-11-11 10:15:14 -05:00
Julien Chaumond
1c542df7e5
Add RoBERTa-based GPT-2 Output Detector from OpenAI
...
converted from https://github.com/openai/gpt-2-output-dataset/tree/master/detector
Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr >
Co-Authored-By: Jong Wook Kim <jongwook@nyu.edu >
Co-Authored-By: Jeff Wu <wuthefwasthat@gmail.com >
2019-11-06 16:26:31 -05:00
Julien Chaumond
2f3a421018
Fix other PyTorch models
2019-11-06 14:03:47 -05:00
Julien Chaumond
d5319793c4
Fix BERT
2019-11-06 14:03:47 -05:00
Julien Chaumond
27e015bd54
[tests] Flag to test on cuda
2019-11-06 14:03:47 -05:00
Julien Chaumond
13d9135fa5
[tests] get rid of warning
...
cf. https://docs.pytest.org/en/latest/example/simple.html
2019-11-06 14:03:47 -05:00
Julien Chaumond
30968d70af
misc doc
2019-11-05 19:06:12 -05:00
Dom Hudson
de890ae67d
Updating docblocks in optimizers.py
2019-11-05 17:31:29 -05:00
Lysandre
d7d36181fd
GPT-2 XL
2019-11-05 13:31:58 -05:00
Julien Chaumond
7daacf00df
Merge pull request #1695 from huggingface/models_inputs_embeds
...
model forwards can take an inputs_embeds param
2019-11-05 09:55:28 -05:00
thomwolf
f1e4db2aa8
Fix #1686
2019-11-05 09:38:00 +01:00
Julien Chaumond
00337e9687
[inputs_embeds] All PyTorch models
2019-11-05 00:39:18 +00:00
Julien Chaumond
9eddf44b7a
docstring + check
2019-11-04 17:19:15 +00:00
Julien Chaumond
8e11de0e86
model forwards can take an inputs_embeds param
2019-11-04 16:56:26 +00:00
thomwolf
b340a910ed
fix tests - flagged as slow all the tests downloading from AWS
2019-11-04 16:03:36 +01:00
thomwolf
f02805da6f
fix tests
2019-11-04 15:42:23 +01:00
thomwolf
1724cee8c4
switch from properties to methods
2019-11-04 15:34:10 +01:00
thomwolf
9b45d0f878
Add common properties input_embeddings and output_embeddings
2019-11-04 12:28:56 +01:00
cregouby
ac29353abe
Fix https://github.com/huggingface/transformers/issues/1673
2019-10-31 10:04:40 +01:00
Thomas Wolf
22838f19fd
Merge pull request #1668 from tlkh/fix-tf-xlm
...
Fixed training for TF XLM
2019-10-30 17:08:00 +01:00
Thomas Wolf
04c69db399
Merge pull request #1628 from huggingface/tfglue
...
run_tf_glue works with all tasks
2019-10-30 17:04:03 +01:00
Thomas Wolf
3df4367244
Merge pull request #1601 from huggingface/clean-roberta
...
Clean roberta model & all tokenizers now add special tokens by default (breaking change)
2019-10-30 17:00:40 +01:00
Thomas Wolf
36174696cc
Merge branch 'master' into clean-roberta
2019-10-30 16:51:06 +01:00