Mike Arpaia
|
8b5c63e4de
|
Fixes to the TensorFlow conversion tool
|
2019-04-01 13:17:54 -06:00 |
|
Thomas Wolf
|
694e2117f3
|
Merge pull request #388 from ananyahjha93/master
Added remaining GLUE tasks to 'run_classifier.py'
|
2019-03-28 09:06:53 +01:00 |
|
Thomas Wolf
|
cc8c2d2332
|
Merge pull request #396 from IndexFziQ/IndexFziQ
add tqdm to the process of eval in examples/run_swag.py
|
2019-03-27 12:03:26 +01:00 |
|
thomwolf
|
361aff6de5
|
typos
|
2019-03-27 11:54:59 +01:00 |
|
thomwolf
|
cea8ba1d59
|
adjusted formating and some wording in the readme
|
2019-03-27 11:53:44 +01:00 |
|
Matthew Carrigan
|
24e67fbf75
|
Minor README update
|
2019-03-25 12:33:30 +00:00 |
|
Matthew Carrigan
|
8d1d1ffde2
|
Corrected the displayed loss when gradient_accumulation_steps > 1
|
2019-03-25 12:15:19 +00:00 |
|
Matthew Carrigan
|
abb7d1ff6d
|
Added proper context management to ensure cleanup happens in the right
order.
|
2019-03-21 17:50:03 +00:00 |
|
Matthew Carrigan
|
06a30cfdf3
|
Added a --reduce_memory option to the training script to keep training
data on disc as a memmap rather than in memory
|
2019-03-21 17:04:12 +00:00 |
|
Matthew Carrigan
|
7d1ae644ef
|
Added a --reduce_memory option to the training script to keep training
data on disc as a memmap rather than in memory
|
2019-03-21 17:02:18 +00:00 |
|
Matthew Carrigan
|
2bba7f810e
|
Added a --reduce_memory option to shelve docs to disc instead of keeping them in memory.
|
2019-03-21 16:50:16 +00:00 |
|
Matthew Carrigan
|
8733ffcb5e
|
Removing a couple of other old unnecessary comments
|
2019-03-21 14:09:57 +00:00 |
|
Matthew Carrigan
|
8a861048dd
|
Fixed up the notes on a possible future low-memory path
|
2019-03-21 14:08:39 +00:00 |
|
Matthew Carrigan
|
a8a577ba93
|
Reduced memory usage for pregenerating the data a lot by writing it
out on the fly without shuffling - the Sampler in the finetuning script
will shuffle for us.
|
2019-03-21 14:05:52 +00:00 |
|
Matthew Carrigan
|
0ae59e662d
|
Reduced memory usage for pregenerating the data a lot by writing it
out on the fly without shuffling - the Sampler in the finetuning script
will shuffle for us.
|
2019-03-21 14:04:17 +00:00 |
|
Matthew Carrigan
|
6a9038ba53
|
Removed an old irrelevant comment
|
2019-03-21 13:36:41 +00:00 |
|
Yuqiang Xie
|
77944d1b31
|
add tqdm to the process of eval
Maybe better.
|
2019-03-21 20:59:33 +08:00 |
|
Matthew Carrigan
|
29a392fbcf
|
Small README changes
|
2019-03-20 17:35:17 +00:00 |
|
Matthew Carrigan
|
832b2b0058
|
Adding README
|
2019-03-20 17:31:49 +00:00 |
|
Matthew Carrigan
|
934d3f4d2f
|
Syncing up argument names between the scripts
|
2019-03-20 17:23:23 +00:00 |
|
Matthew Carrigan
|
f19ba35b2b
|
Move old finetuning script into the new folder
|
2019-03-20 16:47:06 +00:00 |
|
Matthew Carrigan
|
7de5c6aa5e
|
PEP8 and formatting cleanups
|
2019-03-20 16:44:04 +00:00 |
|
Matthew Carrigan
|
1798e98e5a
|
Added final TODOs
|
2019-03-20 16:42:37 +00:00 |
|
Matthew Carrigan
|
c64c2fc4c2
|
Fixed embarrassing indentation problem
|
2019-03-20 15:42:57 +00:00 |
|
Matthew Carrigan
|
0540d360f2
|
Fixed logging
|
2019-03-20 15:36:51 +00:00 |
|
Matthew Carrigan
|
976554a472
|
First commit of the new LM finetuning
|
2019-03-20 14:23:51 +00:00 |
|
Ananya Harsh Jha
|
e5b63fb542
|
Merge branch 'master' of https://github.com/ananyahjha93/pytorch-pretrained-BERT
pull current master to local
|
2019-03-17 08:30:13 -04:00 |
|
Ananya Harsh Jha
|
8a4e90ff40
|
corrected folder creation error for MNLI-MM, verified GLUE results
|
2019-03-17 08:16:50 -04:00 |
|
Ananya Harsh Jha
|
e0bf01d9a9
|
added hack for mismatched MNLI
|
2019-03-16 14:10:48 -04:00 |
|
Ananya Harsh Jha
|
4c721c6b6a
|
added eval time metrics for GLUE tasks
|
2019-03-15 23:21:24 -04:00 |
|
tseretelitornike
|
83857ffeaa
|
Added missing imports.
|
2019-03-15 12:45:48 +01:00 |
|
Yongbo Wang
|
d1e4fa98a9
|
typo in annotation
modify `heruistic` to `heuristic` in line 660, `charcter` to `character` in line 661.
|
2019-03-14 17:32:15 +08:00 |
|
Yongbo Wang
|
3d6452163d
|
typo
modify `mull` to `null` in line 474 annotation.
|
2019-03-14 17:03:38 +08:00 |
|
thomwolf
|
a98dfe4ced
|
fixing #377 (empty nbest_predictions.json)
|
2019-03-14 09:57:06 +01:00 |
|
Ananya Harsh Jha
|
043c8781ef
|
added code for all glue task processors
|
2019-03-14 04:24:04 -04:00 |
|
Yongbo Wang
|
22a465a91f
|
Simplify code, delete redundancy line
delete redundancy line `if args.train`, simplify code.
|
2019-03-13 09:42:06 +08:00 |
|
Elon Musk
|
66d8206809
|
Update run_gpt2.py
|
2019-03-08 11:59:08 -05:00 |
|
thomwolf
|
7cc35c3104
|
fix openai gpt example and updating readme
|
2019-03-06 11:43:21 +01:00 |
|
thomwolf
|
994d86609b
|
fixing PYTORCH_PRETRAINED_BERT_CACHE use in examples
|
2019-03-06 10:21:24 +01:00 |
|
thomwolf
|
5c85fc3977
|
fix typo - logger info
|
2019-03-06 10:05:21 +01:00 |
|
Thomas Wolf
|
8e36da7acb
|
Merge pull request #347 from jplehmann/feature/sst2-processor
Processor for SST-2 task
|
2019-03-06 09:48:27 +01:00 |
|
Thomas Wolf
|
3c01dfb775
|
Merge pull request #338 from CatalinVoss/patch-3
Fix top k generation for k != 0
|
2019-03-06 09:47:33 +01:00 |
|
John Lehmann
|
0f96d4b1f7
|
Run classifier processor for SST-2.
|
2019-03-05 13:38:28 -06:00 |
|
Catalin Voss
|
4b4b079272
|
Fix top k generation for k != 0
|
2019-03-02 21:54:44 -08:00 |
|
Catalin Voss
|
c0cf0a04d5
|
Fix typo
|
2019-02-27 18:01:06 -08:00 |
|
Ben Johnson
|
8607233679
|
Update run_openai_gpt.py
|
2019-02-20 13:58:54 -05:00 |
|
thomwolf
|
0202da0271
|
remove unnecessary example
|
2019-02-18 13:51:42 +01:00 |
|
thomwolf
|
690a0dbf36
|
fix example - masking
|
2019-02-18 10:50:30 +01:00 |
|
thomwolf
|
fbb248a2e4
|
examples testing
|
2019-02-18 01:28:18 +01:00 |
|
thomwolf
|
b65f07d8c0
|
adding examples
|
2019-02-18 00:55:33 +01:00 |
|