* Add TapexTokenizer * Improve docstrings and provide option to provide answer * Remove option for pretokenized inputs * Add TAPEX to README * Fix copies * Remove option for pretokenized inputs * Initial commit: add tapex fine-tuning examples on both table-based question answering and table-based fact verification. * - Draft a README file for running the script and introducing some background. - Remove unused code lines in tabfact script. - Disable the deafult `pad_to_max_length` option which is memory-consuming. * * Support `as_target_tokenizer` function for TapexTokenizer. * Fix the do_lower_case behaviour of TapexTokenizer. * Add unit tests for target scenarios and cased/uncased scenarios for both source and target. * * Replace the label BartTokenizer with TapexTokenizer's as_target_tokenizer function. * Fix typos in tapex example README. * * fix the evaluation script - remove the property `task_name` * * Make the label space more clear for tabfact tasks * * Using a new fine-tuning script for tapex-base on tabfact. * * Remove the lowercase code outside the tokenizer - we use the tokenizer to control whether do_lower_case * Guarantee the hyper-parameter can be run without out-of-memory on 16GB card and report the new reproduced number on wikisql * * Remove the default tokenizer_name option. * Provide evaluation command. * * Support for WikiTableQuestion dataset. * Fix a typo in README. * * Fix the datasets's key name in WikiTableQuestions * Run make fixup and move test to folder * Fix quality * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply some more suggestions from code review * Improve docstrings * Overwrite failing test * Improve comment in example scripts * Fix rebase * Add TAPEX to Auto mapping * Add TAPEX to auto config mappings * Put TAPEX higher than BART in auto mapping * Add TAPEX to doc tests Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain> Co-authored-by: SivilTaram <qianlxc@outlook.com> Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
44 lines
2.2 KiB
Plaintext
44 lines
2.2 KiB
Plaintext
docs/source/en/quicktour.mdx
|
|
docs/source/en/task_summary.mdx
|
|
docs/source/en/model_doc/speech_to_text.mdx
|
|
docs/source/en/model_doc/tapex.mdx
|
|
src/transformers/generation_utils.py
|
|
src/transformers/models/bart/modeling_bart.py
|
|
src/transformers/models/beit/modeling_beit.py
|
|
src/transformers/models/bigbird_pegasus/modeling_bigbird_pegasus.py
|
|
src/transformers/models/blenderbot/modeling_blenderbot.py
|
|
src/transformers/models/blenderbot_small/modeling_blenderbot_small.py
|
|
src/transformers/models/convnext/modeling_convnext.py
|
|
src/transformers/models/data2vec/modeling_data2vec_audio.py
|
|
src/transformers/models/deit/modeling_deit.py
|
|
src/transformers/models/dpt/modeling_dpt.py
|
|
src/transformers/models/glpn/modeling_glpn.py
|
|
src/transformers/models/hubert/modeling_hubert.py
|
|
src/transformers/models/marian/modeling_marian.py
|
|
src/transformers/models/mbart/modeling_mbart.py
|
|
src/transformers/models/pegasus/modeling_pegasus.py
|
|
src/transformers/models/plbart/modeling_plbart.py
|
|
src/transformers/models/poolformer/modeling_poolformer.py
|
|
src/transformers/models/resnet/modeling_resnet.py
|
|
src/transformers/models/roberta/modeling_roberta.py
|
|
src/transformers/models/roberta/modeling_tf_roberta.py
|
|
src/transformers/models/segformer/modeling_segformer.py
|
|
src/transformers/models/sew/modeling_sew.py
|
|
src/transformers/models/sew_d/modeling_sew_d.py
|
|
src/transformers/models/speech_encoder_decoder/modeling_speech_encoder_decoder.py
|
|
src/transformers/models/speech_to_text/modeling_speech_to_text.py
|
|
src/transformers/models/speech_to_text_2/modeling_speech_to_text_2.py
|
|
src/transformers/models/swin/modeling_swin.py
|
|
src/transformers/models/trocr/modeling_trocr.py
|
|
src/transformers/models/unispeech/modeling_unispeech.py
|
|
src/transformers/models/unispeech_sat/modeling_unispeech_sat.py
|
|
src/transformers/models/van/modeling_van.py
|
|
src/transformers/models/vilt/modeling_vilt.py
|
|
src/transformers/models/vision_encoder_decoder/modeling_vision_encoder_decoder.py
|
|
src/transformers/models/vit/modeling_vit.py
|
|
src/transformers/models/vit_mae/modeling_vit_mae.py
|
|
src/transformers/models/wav2vec2/modeling_wav2vec2.py
|
|
src/transformers/models/wav2vec2/tokenization_wav2vec2.py
|
|
src/transformers/models/wav2vec2_with_lm/processing_wav2vec2_with_lm.py
|
|
src/transformers/models/wavlm/modeling_wavlm.py
|