HuggingFace_transformer

Author	SHA1	Message	Date
Lysandre	b21402fc86	Python 2 tests + licence	2019-11-26 13:08:12 -05:00
Lysandre	c14a22272f	ALBERT passes all tests	2019-11-26 13:08:12 -05:00
Lysandre	870320a24e	Early tests	2019-11-26 13:08:12 -05:00
Lysandre	25a31953e8	Output Attentions + output hidden states	2019-11-26 13:08:12 -05:00
Lysandre	ce9eade29c	Initializer range using BertPreTrainedModel	2019-11-26 13:08:12 -05:00
Lysandre	5680a11063	Activation function managed from the config file	2019-11-26 13:08:12 -05:00
Lysandre	1e5b31c388	Several fixes and improvements	2019-11-26 13:08:12 -05:00
Lysandre	ee20201d33	Tokenization tests + fixes + init	2019-11-26 13:08:12 -05:00
Lysandre	e3ea5d1d8d	Docstrings	2019-11-26 13:08:12 -05:00
Lysandre	fedac786d4	Tokenization + small fixes	2019-11-26 13:08:12 -05:00
Lysandre	67b422662c	Documentation + improved AlbertForMaskedLM	2019-11-26 13:08:12 -05:00
Lysandre	1b92564330	Reorganize and cleanup	2019-11-26 13:08:12 -05:00
Lysandre	12290c0d5c	Handles multi layer and multi groups	2019-11-26 13:08:12 -05:00
Lysandre	139affaa8d	Albert layer/layer groups	2019-11-26 13:08:12 -05:00
Lysandre	91ccbae788	Accepts multiple sizes	2019-11-26 13:08:12 -05:00
Lysandre	c0c2088333	ALBERT model	2019-11-26 13:08:12 -05:00
v_sboliu	8e5d84fcc1	Fixed typo	2019-11-26 09:01:32 -05:00
Bilal Khan	07bf43074f	Fix GPT2 docstring	2019-11-25 11:32:00 -05:00
Evpok Padding	fa963ecc59	if→elif	2019-11-25 10:21:03 -05:00
Evpok Padding	c8eb8157b8	fix docstrings	2019-11-25 10:21:03 -05:00
Evpok Padding	99f750d64e	add Camembert models to modeling_auto	2019-11-25 10:21:03 -05:00
Nikolay Korolev	041a901f32	Fix typo in documentation. toto -> to	2019-11-23 10:55:16 -05:00
Stefan Schweter	0b3d45eb64	camembert: add implementation for save_vocabulary method	2019-11-18 15:49:44 +01:00
Stefan Schweter	33753d9139	module: import CamembertForTokenClassification	2019-11-18 14:14:54 +01:00
Stefan Schweter	d32ce2c8df	camembert: add wrapper for CamembertForTokenClassification	2019-11-18 14:14:19 +01:00
Julien Chaumond	0477b307c7	[camembert] tokenizer: use additional_special_tokens	2019-11-16 00:11:07 -05:00
Julien Chaumond	f9abf73e31	[camembert] realign w/ recent changes	2019-11-16 00:11:07 -05:00
Julien Chaumond	26858f27cb	[camembert] Upload to s3 + rename script	2019-11-16 00:11:07 -05:00
Louis MARTIN	035fea5315	Add CamemBERT to auto files and docs	2019-11-16 00:11:07 -05:00
Louis MARTIN	694d4fcbb6	Add CamemBERT classes to __init__.py	2019-11-16 00:11:07 -05:00
Louis MARTIN	fb6c70a91d	Update tokenization_camembert.py with urls	2019-11-16 00:11:07 -05:00
Louis MARTIN	e44b939e71	Add configuration_camembert.py and modeling_camembert.py	2019-11-16 00:11:07 -05:00
Louis MARTIN	14b3aa3b3c	Add tokenization_camembert.py	2019-11-16 00:11:07 -05:00
Thomas Wolf	74ce8de7d8	Merge pull request #1792 from stefan-it/distilbert-for-token-classification DistilBERT for token classification	2019-11-14 22:47:53 +01:00
Thomas Wolf	5b322a36db	Merge pull request #1811 from huggingface/special-tokens Fix special tokens addition in decoder #1807	2019-11-14 22:17:24 +01:00
Thomas Wolf	1a237d7f42	Merge pull request #1831 from iedmrc/gpt2-tokenization-sum-func-replacement sum() is replaced by itertools.chain.from_iterable()	2019-11-14 22:11:54 +01:00
Thomas Wolf	df99f8c5a1	Merge pull request #1832 from huggingface/memory-leak-schedulers replace LambdaLR scheduler wrappers by function	2019-11-14 22:10:31 +01:00
Lysandre	a67e747889	Reorganized max_len warning	2019-11-14 10:30:22 -05:00
Rémi Louf	022525b003	replace LambdaLR scheduler wrappers by function Custom schedulers are currently initiated by wrapping Pytorch's LambdaLR class and passing a method of the wrapping class to the __init__ function of LambdaLR. This approach is not appropriate for several reasons: 1. one does not need to define a class when it only defines a __init__() method; 2. instantiating the parent class by passing a method of the child class creates a cyclical reference which leads to memory leaks. See issues #1742 and #1134. In this commit we replace the wrapper classes with functions that instantiate `LambdaLR` with a custom learning rate function. We use a closure to specify the parameter of the latter. We also do a bit of renaming within the function to explicit the behaviour and removed docstrings that were subsequently not necessary.	2019-11-14 15:39:08 +01:00
İbrahim Ethem Demirci	7627dde1f8	sum() is the leanest method to flatten a string list, so it's been replaced by itertools.chain.from_iterable()	2019-11-14 17:06:15 +03:00
Lysandre	74d0bcb6ff	Fix special tokens addition in decoder	2019-11-12 15:27:57 -05:00
Julien Chaumond	155c782a2c	[inputs_embeds] All TF models + tests	2019-11-12 11:29:21 -05:00
Julien Chaumond	2aef2f0bbc	[common attributes] Fix previous commit for transfo-xl	2019-11-12 11:29:21 -05:00
Julien Chaumond	2f17464266	[common attributes] Slightly sharper test coverage	2019-11-12 11:29:21 -05:00
Julien Chaumond	9d2398fd99	Ooopsie	2019-11-12 11:29:21 -05:00
Julien Chaumond	70d97ddd60	[TF models] Common attributes as per #1721	2019-11-12 11:29:21 -05:00
Julien Chaumond	dd6b2e05e1	whitespace	2019-11-12 11:29:21 -05:00
Lysandre	d409aca326	Clarify the use of past in GPT2 and CTRL	2019-11-12 10:59:37 -05:00
Stefan Schweter	94e55253ae	tests: add test case for DistilBertForTokenClassification implementation	2019-11-11 16:20:15 +01:00
Stefan Schweter	1806eabf59	module: add DistilBertForTokenClassification import	2019-11-11 16:18:48 +01:00

1 2 3 4 5

227 Commits