HuggingFace_transformer

Author	SHA1	Message	Date
Iz Beltagy	ef03ae874f	[Longformer] more models + model cards (#4628 ) * adding freeze roberta models * model cards * lint	2020-05-28 11:11:05 +02:00
Patrick von Platen	96f57c9ccb	[Benchmark] Memory benchmark utils (#4198 ) * improve memory benchmarking * correct typo * fix current memory * check torch memory allocated * better pytorch function * add total cached gpu memory * add total gpu required * improve torch gpu usage * update memory usage * finalize memory tracing * save intermediate benchmark class * fix conflict * improve benchmark * improve benchmark * finalize * make style * improve benchmarking * correct typo * make train function more flexible * fix csv save * better repr of bytes * better print * fix __repr__ bug * finish plot script * rename plot file * delete csv and small improvements * fix in plot * fix in plot * correct usage of timeit * remove redundant line * remove redundant line * fix bug * add hf parser tests * add versioning and platform info * make style * add gpu information * ensure backward compatibility * finish adding all tests * Update src/transformers/benchmark/benchmark_args.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/benchmark/benchmark_args_utils.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * delete csv files * fix isort ordering * add out of memory handling * add better train memory handling Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-05-27 23:22:16 +02:00
Suraj Patil	ec4cdfdd05	LongformerForSequenceClassification (#4580 ) * LongformerForSequenceClassification * better naming x=>hidden_states, fix typo in doc * Update src/transformers/modeling_longformer.py * Update src/transformers/modeling_longformer.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-05-27 22:30:00 +02:00
Lysandre Debut	6a17688021	per_device instead of per_gpu/error thrown when argument unknown (#4618 ) * per_device instead of per_gpu/error thrown when argument unknown * [docs] Restore examples.md symlink * Correct absolute links so that symlink to the doc works correctly * Update src/transformers/hf_argparser.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Warning + reorder * Docs * Style * not for squad Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-27 11:36:55 -04:00
Patrick von Platen	003c477129	[GPT2, CTRL] Allow input of input_ids and past of variable length (#4581 ) * revert convenience method * clean docs a bit	2020-05-26 19:43:58 +02:00
Bram Vanroy	8cc6807e89	Make transformers-cli cross-platform (#4131 ) * make transformers-cli cross-platform Using "scripts" is a useful option in setup.py particularly when you want to get access to non-python scripts. However, in this case we want to have an entry point into some of our own Python scripts. To do this in a concise, cross-platfom way, we can use entry_points.console_scripts. This change is necessary to provide the CLI on different platforms, which "scripts" does not ensure. Usage remains the same, but the "transformers-cli" script has to be moved (be part of the library) and renamed (underscore + extension) * make style & quality	2020-05-26 10:00:51 -04:00
Patrick von Platen	c589eae2b8	[Longformer For Question Answering] Conversion script, doc, small fixes (#4593 ) * add new longformer for question answering model * add new config as well * fix links * fix links part 2	2020-05-26 14:58:47 +02:00
ZhuBaohe	a163c9ca5b	[T5] Fix Cross Attention position bias (#4499 ) * fix * fix1	2020-05-26 08:57:24 -04:00
ZhuBaohe	1d69028989	fix (#4410 )	2020-05-26 08:51:28 -04:00
Sam Shleifer	b86e42e0ac	[ci] fix 3 remaining slow GPU failures (#4584 )	2020-05-25 19:20:50 -04:00
Patrick von Platen	3e3e552125	[Reformer] fix reformer num buckets (#4564 ) * fix reformer num buckets * fix * adapt docs * set num buckets in config	2020-05-25 16:04:45 -04:00
Elman Mansimov	3dea40b858	fixing tokenization of extra_id symbols in T5Tokenizer. Related to issue 4021 (#4353 )	2020-05-25 16:04:30 -04:00
Suraj Patil	5139733623	LongformerTokenizerFast (#4547 )	2020-05-25 16:03:55 -04:00
Sho Arora	adab7f8332	Add nn.Module as superclass (#4533 )	2020-05-25 15:29:33 -04:00
Suraj Patil	03d8527de0	Longformer for question answering (#4500 ) * added LongformerForQuestionAnswering * add LongformerForQuestionAnswering * fix import for LongformerForMaskedLM * add LongformerForQuestionAnswering * hardcoded sep_token_id * compute attention_mask if not provided * combine global_attention_mask with attention_mask when provided * update example in docstring * add assert error messages, better attention combine * add test for longformerForQuestionAnswering * typo * cast gloabl_attention_mask to long * make style * Update src/transformers/configuration_longformer.py * Update src/transformers/configuration_longformer.py * fix the code quality * Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers into longformer-for-question-answering Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-05-25 18:43:36 +02:00
Bharat Raghunathan	a34a9896ac	DOC: Fix typos in modeling_auto (#4534 )	2020-05-23 09:40:59 -04:00
Bijay Gurung	e19b978151	Add Type Hints to modeling_utils.py Closes #3911 (#3948 ) * Add Type Hints to modeling_utils.py Closes #3911 Add Type Hints to methods in `modeling_utils.py` Note: The coverage isn't 100%. Mostly skipped internal methods. * Reformat according to `black` and `isort` * Use typing.Iterable instead of Sequence * Parameterize Iterable by its generic type * Use typing.Optional when None is the default value * Adhere to style guideline * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-22 19:10:22 -04:00
Funtowicz Morgan	996f393a86	Warn the user about max_len being on the path to be deprecated. (#4528 ) * Warn the user about max_len being on the path to be deprecated. * Ensure better backward compatibility when max_len is provided to a tokenizer. * Make sure to override the parameter and not the actual instance value. * Format & quality	2020-05-22 18:08:30 -04:00
Sam Shleifer	ab44630db2	[Summarization Pipeline]: Fix default tokenizer (#4506 ) * Fix pipelines defaults bug * one liner * style	2020-05-22 17:49:45 -04:00
Julien Chaumond	2c1ebb8b50	Re-apply #4446 + add packaging dependency As discussed w/ @lysandrejik packaging is maintained by PyPA (the Python Packaging Authority), and should be lightweight and stable	2020-05-22 17:29:03 -04:00
Lysandre	e6aeb0d3e8	Style	2020-05-22 17:20:03 -04:00
Anthony MOI	35df911485	Fix convert_token_type_ids_from_sequences for fast tokenizers (#4503 )	2020-05-22 12:45:10 -04:00
Lysandre	10d72390c0	Revert #4446 Since it introduces a new dependency Some checks failed GitHub-hosted runner / check_code_quality (push) Has been cancelled Details	2020-05-22 10:49:45 -04:00
Lysandre	e0db6bbd65	Release: v2.10.0	2020-05-22 10:37:44 -04:00
Frankie Liuzzi	bd6e301832	added functionality for electra classification head (#4257 ) * added functionality for electra classification head * unneeded dropout * Test ELECTRA for sequence classification * Style Co-authored-by: Frankie <frankie@frase.io> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-05-22 09:48:21 -04:00
Lysandre	a086527727	Unused Union should not be imported	2020-05-21 09:42:47 -04:00
Lysandre Debut	9d2ce253de	TPU hangs when saving optimizer/scheduler (#4467 ) * TPU hangs when saving optimizer/scheduler * Style * ParallelLoader is not a DataLoader * Style * Addressing @julien-c's comments	2020-05-21 09:18:27 -04:00
Zhangyx	49296533ca	Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463 ) * Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website. * Use Split enum + always output the label name Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-21 09:17:44 -04:00
Cola	eacea530c1	🚨 Remove warning of deprecation (#4477 ) Remove warning of deprecated overload of addcdiv_ Fix #4451	2020-05-20 16:48:29 -04:00
Julien Plu	fa2fbed3e5	Better None gradients handling in TF Trainer (#4469 ) * Better None gradients handling * Apply Style * Apply Style	2020-05-20 16:46:21 -04:00
Oliver Åstrand	e708bb75bf	Correct TF formatting to exclude LayerNorms from weight decay (#4448 ) * Exclude LayerNorms from weight decay * Include both formats of layer norm	2020-05-20 16:45:59 -04:00
Rens	49c06132df	pass on tokenizer to pipeline (#4489 )	2020-05-20 22:23:21 +02:00
Sam Shleifer	efbc1c5a9d	[MarianTokenizer] implement save_vocabulary and other common methods (#4389 )	2020-05-19 19:45:49 -04:00
Patrick von Platen	48c3a70b4e	[Longformer] Docs and clean API (#4464 ) * add longformer docs * improve docs	2020-05-19 21:52:36 +02:00
Patrick von Platen	aa925a52fa	[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468 ) * fix gpu slow tests in pytorch * change model to device syntax	2020-05-19 21:35:04 +02:00
Sam Shleifer	07dd7c2fd8	[cleanup] test_tokenization_common.py (#4390 )	2020-05-19 10:46:55 -04:00
Iz Beltagy	8f1d047148	Longformer (#4352 ) * first commit * bug fixes * better examples * undo padding * remove wrong VOCAB_FILES_NAMES * License * make style * make isort happy * unit tests * integration test * make `black` happy by undoing `isort` changes!! * lint * no need for the padding value * batch_size not bsz * remove unused type casting * seqlen not seq_len * staticmethod * `bert` selfattention instead of `n2` * uint8 instead of bool + lints * pad inputs_embeds using embeddings not a constant * black * unit test with padding * fix unit tests * remove redundant unit test * upload model weights * resolve todo * simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_ * increase unittest coverage	2020-05-19 16:04:43 +02:00
Shaoyen	384f0eb2f9	Map optimizer to correct device after loading from checkpoint. (#4403 ) * Map optimizer to correct device after loading from checkpoint. * Make style test pass Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-18 23:16:05 -04:00
Julien Chaumond	bf14ef75f1	[Trainer] move model to device before setting optimizer (#4450 )	2020-05-18 23:13:33 -04:00
Julien Chaumond	5e7fe8b585	Distributed eval: SequentialDistributedSampler + gather all results (#4243 ) * Distributed eval: SequentialDistributedSampler + gather all results * For consistency only write to disk from world_master Close https://github.com/huggingface/transformers/issues/4272 * Working distributed eval * Hook into scripts * Fix #3721 again * TPU.mesh_reduce: stay in tensor space Thanks @jysohn23 * Just a small comment * whitespace * torch.hub: pip install packaging * Add test scenarii	2020-05-18 22:02:39 -04:00
Julien Chaumond	4c06893610	Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300 ) * Test case for #3936 * multigpu tests pass on pytorch 1.4.0 * Fixup * multigpu tests pass on pytorch 1.5.0 * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * rename multigpu to require_multigpu * mode doc	2020-05-18 20:34:50 -04:00
Rakesh Chada	9de4afa897	Make get_last_lr in trainer backward compatible (#4446 ) * makes fetching last learning late in trainer backward compatible * split comment to multiple lines * fixes black styling issue * uses version to create a more explicit logic	2020-05-18 20:17:36 -04:00
Funtowicz Morgan	ca4a3f4da9	Adding optimizations block from ONNXRuntime. (#4431 ) * Adding optimizations block from ONNXRuntime. * Turn off external data format by default for PyTorch export. * Correct the way use_external_format is passed through the cmdline args.	2020-05-18 20:32:33 +02:00
Patrick von Platen	d39bf0ac2d	better naming in tf t5 (#4401 )	2020-05-18 11:34:00 -04:00
Patrick von Platen	590adb130b	improve docstring (#4422 )	2020-05-18 11:31:35 -04:00
Patrick von Platen	026a5d0888	[T5 fp16] Fix fp16 in T5 (#4436 ) * fix fp16 in t5 * make style * refactor invert_attention_mask fn * fix typo	2020-05-18 17:25:58 +02:00
Patrick von Platen	a27c795908	fix (#4419 )	2020-05-18 15:51:40 +02:00
Mehrad Moradshahi	8581a670e3	[MbartTokenizer] save to sentencepiece.bpe.model (#4335 )	2020-05-18 08:54:04 -04:00
Lorenzo Ampil	18d233d525	Allow the creation of "entity groups" for NerPipeline #3548 (#3957 ) * Add index to be returned by NerPipeline to allow for the creation of * Add entity groups * Convert entity list to dict * Add entity to entity_group_disagg atfter updating entity gorups * Change 'group' parameter to 'grouped_entities' * Add unit tests for grouped NER pipeline case * Correct variable name typo for NER_FINETUNED_MODELS * Sync grouped tests to recent test updates	2020-05-17 09:25:17 +02:00
Julien Chaumond	3e0f062106	Fix addcmul_	2020-05-15 17:44:17 -04:00

1 2 3 4 5 ...

609 Commits