HuggingFace_transformer

Author	SHA1	Message	Date
Zhangyx	3a5d1ea2a5	Fix two bugs: 1. Index of test data of SST-2. 2. Label index of MNLI data. (#4546 )	2020-05-29 11:12:24 -04:00
Zhangyx	49296533ca	Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463 ) * Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website. * Use Split enum + always output the label name Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-21 09:17:44 -04:00
Lysandre Debut	7defc6670f	p_mask in SQuAD pre-processing (#4049 ) * Better p_mask building * Adressing @mfuntowicz comments	2020-05-14 17:07:52 -04:00
Julien Chaumond	448c467256	Fix: unpin flake8 and fix cs errors (#4367 ) * Fix: unpin flake8 and fix cs errors * Ok we still need to quote those	2020-05-14 13:14:26 -04:00
Julien Chaumond	c547f15a17	Use Filelock to ensure distributed barriers see context in https://github.com/huggingface/transformers/pull/4223	2020-05-14 11:58:32 -04:00
Julien Chaumond	7b75aa9fa5	[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223 ) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None	2020-05-08 14:10:05 -04:00
peterandluc	8e093e5981	Remove 50k limits bug	2020-04-23 11:15:09 -04:00
Julien Chaumond	dd9d483d03	Trainer (#3800 ) * doc * [tests] Add sample files for a regression task * [HUGE] Trainer * Feedback from @sshleifer * Feedback from @thomwolf + logging tweak * [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes * [glue] Use default max_seq_length of 128 like before * [glue] move DataTrainingArguments around * [ner] Change interface of InputExample, and align run_{tf,pl} * Re-align the pl scripts a little bit * ner * [ner] Add integration test * Fix language_modeling with API tweak * [ci] Tweak loss target * Don't break console output * amp.initialize: model must be on right device before * [multiple-choice] update for Trainer * Re-align to `827d6d6ef0`	2020-04-21 20:11:56 -04:00
Funtowicz Morgan	2c05b8a56c	Remove tqdm logging when using pipelines. (#3833 ) Introduce tqdm_enabled parameter on squad_convert_examples_to_features() default to True and set to False in QA pipelines.	2020-04-20 22:58:52 +02:00
Jared T Nielsen	c79b550dd0	Add `qas_id` to SquadResult and SquadExample (#3745 ) * Add qas_id * Fix incorrect name in squad.py * Make output files optional for squad eval	2020-04-20 16:08:57 -04:00
Julien Chaumond	f98d0ef2a2	Big cleanup of `glue_convert_examples_to_features` (#3688 ) * Big cleanup of `glue_convert_examples_to_features` * Use batch_encode_plus * Cleaner wrapping of glue_convert_examples_to_features for TF @lysandrejik * Cleanup syntax, thanks to @mfuntowicz * Raise explicit error in case of user error	2020-04-10 10:20:18 -04:00
Julien Chaumond	cc598b312b	[InputExample] Unfreeze for now, cf. #3423	2020-03-30 10:41:49 -04:00
Lysandre Debut	ffcffebe85	Force the return of token type IDs (#3439 )	2020-03-26 09:41:36 +01:00
Julien Chaumond	83272a3853	Experiment w/ dataclasses (including Py36) (#3423 ) * [ci] Also run test_examples in py37 (will revert at the end of the experiment) * InputExample: use immutable dataclass * [deps] Install dataclasses for Py<3.7 * [skip ci] Revert "[ci] Also run test_examples in py37" This reverts commit d29afd9959786b77759b0b8fa4e6b4335b952015.	2020-03-25 11:10:20 -04:00
Serkan Karakulak	b2c2c31c60	Minor Bug Fix for Running Roberta on Glue (#3240 ) * added return_token_type_ids argument for tokenizers which do not generate return_type_ids by default * fixed styling * Style Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>	2020-03-19 12:08:31 -04:00
maximeilluin	c749a543fa	Added CamembertForQuestionAnswering (#2746 ) * Added CamembertForQuestionAnswering * fixed camembert tokenizer case	2020-02-21 12:01:02 -05:00
Scott Gigante	ea8eba35e2	Fix InputExample docstring (#2891 )	2020-02-20 15:25:15 -05:00
jiyeon	bed38d3afe	Fix typo in src/transformers/data/processors/squad.py	2020-02-11 11:22:24 -05:00
Lysandre	125a75a121	Correctly compute tokens when padding on the left	2020-02-10 10:47:42 -05:00
Lysandre	15579e2d55	[SQuAD v2] Code quality	2020-01-21 11:36:46 -05:00
Lysandre	073219b43f	Manage impossible examples SQuAD v2	2020-01-21 11:24:43 -05:00
James Betker	cefd51c50c	Fix glue processor failing on tf datasets	2020-01-20 11:46:43 -05:00
Nafise Sadat Moosavi	99d4515572	HANS evaluation	2020-01-16 13:21:30 +01:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Lysandre Debut	1efc208ff3	Complete DataProcessor class	2020-01-06 15:02:25 +01:00
Simone Primarosa	c45d0cf60f	Improve logging message in the single sentence classification processor	2020-01-06 14:54:36 +01:00
Simone Primarosa	bf89be77b9	Improve logging message in the single sentence classification processor	2020-01-06 14:54:36 +01:00
Simone Primarosa	bf8d4bc674	Improve logging message in glue feature conversion	2020-01-06 14:54:36 +01:00
Aymeric Augustin	71f94a8a1c	Remove unused variables in src.	2019-12-23 22:38:09 +01:00
Aymeric Augustin	c8b0c1e551	Improve exception type. ImportError isn't really appropriate when there's no import involved.	2019-12-23 21:27:38 +01:00
Aymeric Augustin	5565dcdd35	Remove warning when scikit-learn isn't available. Most users don't need it.	2019-12-23 21:16:26 +01:00
Aymeric Augustin	1c62e87b34	Use built-in open(). On Python 3, `open is io.open`.	2019-12-22 18:38:56 +01:00
Aymeric Augustin	798b3b3899	Remove sys.version_info[0] == 2 or 3.	2019-12-22 18:38:42 +01:00
Aymeric Augustin	c824d15aa1	Remove __future__ imports.	2019-12-22 17:47:54 +01:00
Aymeric Augustin	6be7cdda66	Move source code inside a src subdirectory. This prevents transformers from being importable simply because the CWD is the root of the git repository, while not being importable from other directories. That led to inconsistent behavior, especially in examples. Once you fetch this commit, in your dev environment, you must run: $ pip uninstall transformers $ pip install -e .	2019-12-22 14:15:13 +01:00

36 Commits