HuggingFace_transformer

Author	SHA1	Message	Date
Younes Belkada	163ac3d3ee	Add Switch transformers (#19323 ) * first commit * add more comments * add router v1 * clean up - remove `tf` modeling files * clean up - remove `tf` modeling files * clean up * v0 routers * added more router - Implemented `ExpertsChooseMaskedRouter` - added tests - 2 more routers to implement * last router * improved docstring - completed the docstring in `router.py` - added more args in the config * v0 sparse mlp * replace wrong naming * forward pass run * update MOE layer * small router update * fixup * consistency * remove scatter router * remove abstract layer * update test and model for integration testing * v1 conversion * update * hardcode hack * all keys match * add gin conversion, without additional libraries * update conversion sctipy * delete router file * update tests wrt router deletion * fix router issues * update expert code * update, logits match, code needsREFACTORING * Refactor code Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * add generate tests Co-authored-by: younesbelkada <younesbelkada@gmail.com> * add support for router loss Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * fix forward error * refactor a bit * remove `FlaxSwitchTransformers` modules * more tests pass * Update code Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * fixup * fix tests * fix doc * fix doc + tokenization * fix tokenizer test * fix test * fix loss output * update code for backward pass * add loss support * update documentation * fix documentation, clean tokenizer * more doc fix, cleanup example_switch * fix failing test * fix test * fix test * fix loss issue * move layer * update doc and fix router capacity usage * fixup * add sparse mlp index for documentation on hub * fixup * test sparse mix architecture * Apply suggestions from code review * Update docs/source/en/model_doc/switch_transformers.mdx * fixup on update * fix tests * fix another test * attempt fix * Update src/transformers/models/switch_transformers/configuration_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * try * all tests pass * fix jitter noise * Apply suggestions from code review * doc tests pass * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * remove assert * change config order * fix readme japanese * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove parallelizable tests + add one liners * remove ONNX config * fix nits - add `T5Tokenizer` in auto mapping - remove `Switch Transformers` from ONNX supported models * remove `_get_router` * remove asserts * add check in test for `router_dtype` * add `SwitchTransformersConfig` in `run_pipeline_test` * Update tests/pipelines/test_pipelines_summarization.py * add huge model conversion script * fix slow tests - add better casting for `Linear8bitLt` - remove `torchscript` tests * add make dir * style on new script * fix nits - doctest - remove `_keys_to_ignore_on_load_unexpected` * Update src/transformers/models/switch_transformers/configuration_switch_transformers.py * add google as authors * fix year * remove last `assert` statements * standardize vertical spaces * fix failing import * fix another failing test * Remove strange àuthorized_keys` * removing todo and padding that is never used Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: ybelkada <younes@huggingface.co> Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Arthur Zucker <arthur@huggingface.co>	2022-11-15 13:06:45 +01:00
Matthijs Hollemans	f711d683b5	add MobileNetV2 model (#17845 ) * add model files etc for MobileNetV2 * rename files for MobileNetV1 * initial implementation of MobileNetV1 * fix conversion script * cleanup * write docs * tweaks * fix conversion script * extract hidden states * fix test cases * make fixup * fixup it all * rename V1 to V2 * fix checkpoints * fixup * implement first block + weight conversion * add remaining layers * add output stride and dilation * fixup * add tests * add deeplabv3+ head * a bit of fixup * finish deeplab conversion * add link to doc * fix issue with JIT trace in_height and in_width would be Tensor objects during JIT trace, which caused Core ML conversion to fail on the remainder op. By making them ints, the result of the padding calculation becomes a constant value. * cleanup * fix order of models * fix rebase error * remove main from doc link * add image processor * remove old feature extractor * fix converter + other issues * fixup * fix unit test * add to onnx tests (but these appear broken now) * add post_process_semantic_segmentation * use google org * remove unused imports * move args * replace weird assert	2022-11-14 01:00:10 -05:00
Arthur	61a51f5f23	Add Jukebox model (replaces #16875 ) (#17826 )	2022-11-10 21:05:27 +01:00
NielsRogge	9f0c72f93b	Add doc tests (#20158 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>	2022-11-10 15:25:30 +01:00
Joao Gante	f270b960d6	Generate: move generation_.py src files into generation/.py (#20096 ) * move generation_.py src files into generation/.py * populate generation.__init__ with lazy loading * move imports and references from generation.xxx.object to generation.object	2022-11-09 15:34:08 +00:00
Yih-Dar	316bf04d3d	Improve tiny model creation script (#20119 ) * Improve tiny model creation script * sort the list of models to upload Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-09 11:34:35 +01:00
amyeroberts	4eb918e656	AutoImageProcessor (#20111 ) * AutoImageProcessor skeleton * Update references * Add mapping in init * Add model image processors to __init__ for importing * Add AutoImageProcessor tests * Fix up * Image Processor documentation * Remove pdb * Update docs/source/en/model_doc/mobilevit.mdx * Update docs * Don't add whitespace on json files * Remove fixtures * Move checking model config down * Fix up * Add check for image processor * Remove FeatureExtractorMixin in docstrings * Rename model_tmpfile to config_tmpfile * Don't make None if not in image processor map	2022-11-08 19:54:41 +00:00
Weiwe Shi	efa889d2e4	Add RocBert (#20013 ) * add roc_bert * update roc_bert readme * code style * change name and delete unuse file * udpate model file * delete unuse log file * delete tokenizer fast * reformat code and change model file path * add RocBertForPreTraining * update docs * delete wrong notes * fix copies * fix make repo-consistency error * fix files are not present in the table of contents error * change RocBert -> RoCBert * add doc, add detail test Co-authored-by: weiweishi <weiweishi@tencent.com>	2022-11-08 10:03:43 -05:00
NielsRogge	258963062b	Add CLIPSeg (#20066 ) * Add first draft * Update conversion script * Improve conversion script * Improve conversion script some more * Add conditional embeddings * Add initial decoder * Fix activation function of decoder * Make decoder outputs match original implementation * Make decoder outputs match original implementation * Add more copied from statements * Improve model outputs * Fix auto tokenizer file * Fix more tests * Add test * Improve README and docs, improve conditional embeddings * Fix more tests * Remove print statements * Remove initial embeddings * Improve conversion script * Add interpolation of position embeddings * Finish addition of interpolation of position embeddings * Add support for refined checkpoint * Fix refined checkpoint * Remove unused parameter * Improve conversion script * Add support for training * Fix conversion script * Add CLIPSegFeatureExtractor * Fix processor * Fix CLIPSegProcessor * Fix conversion script * Fix most tests * Fix equivalence test * Fix README * Add model to doc tests * Use better variable name * Convert other checkpoint as well * Update config, add link to paper * Add docs * Update organization * Replace base_model_prefix with clip * Fix base_model_prefix * Fix checkpoint of config * Fix config checkpoint * Remove file * Use logits for output * Fix tests Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-08 10:55:47 +01:00
Saad Mahmud	9617b1304e	[Doctest] Add configuration_dpr.py (#20080 ) * Add example docstring for DPRConfig * Add DPRConfig to documentation_tests	2022-11-07 14:49:59 +01:00
Saad Mahmud	790ff2544a	[Doctest] Add configuration_camembert.py (#20039 ) * Add example docstring for CamembertConfig * Add configuration_camembert to documentation_tests	2022-11-03 14:50:42 +01:00
Saad Mahmud	7487743793	[Doctest] Add configuration_deberta_v2.py (#19995 ) * Add example docstring for DebertaV2Config * Add DebertaV2Config to documentation_tests * Fix mistake with directory name	2022-11-02 16:22:11 +01:00
Ikko Ashimine	38e5b71abb	Add Japanese translated README (#19945 ) * Add japanese translated README.md * Add README_ja.md link * Add japanese transkate to check_copies.py * Add guide to Japanese README.md * Update README_ja.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update utils/check_copies.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-11-01 09:18:08 -04:00
Matt	7f9b7b3f0e	Add ESMFold (#19977 ) * initial commit * First draft that gets outputs without crashing! * Add all the ported openfold dependencies * testing * Restructure config files for ESMFold * Debugging to find output discrepancies * Mainly style * Make model runnable without extra deps * Remove utils and merge them to the modeling file * Use correct gelu and remove some debug prints * More cleanup * Update esm docs * Update conversion script to support ESMFold properly * Port some top-level changes from ESMFold repo * Expand EsmFold docstrings * Make attention_mask optional (default to all 1s) * Add inference test for ESMFold * Use config and not n kwargs * Add modeling output class * Remove einops * Remove chunking in ESM FFN * Update tests for ESMFold * Quality * REpo consistency * Remove tree dependency from ESMFold * make fixup * Add an error in case my structure map function breaks later * Remove needless code * Stop auto-casting the LM to float16 so CPU tests pass * Stop auto-casting the LM to float16 so CPU tests pass * Final test updates * Split test file * Copyright and quality * Unpin PyTorch to see built doc * Fix config file to_dict() method * Add some docstrings to the output * Skip TF checkpoint tests for ESM until we reupload those * make fixup * More docstrings * Unpin to get even with main * Flag example to write Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>	2022-10-31 21:32:58 -04:00
Saad Mahmud	6176e13612	[Doctest] Add configuration_deberta.py (#19968 ) * Add Example docstring to DebertaConfig * Add configuration_deberta to documentation_tests * Add microsoft/deberta-base to example docstring * Fix example docstring mistake	2022-10-31 17:22:01 +01:00
Yih-Dar	dcca71be61	Create dummy models (#19901 ) * create dummy models * quality * update * update * Make Wav2Vec2Conformer work * style * deal with models with text_config and vision_config * apply suggestions * Composite models * style * style * fix shape issue * fix shape issue * For VisionTextDualEncoderModel * show_progress=False when converting tokenizers * Fix for OwlViT * Fix for VisualBert * Update * final Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-28 13:05:41 +02:00
Yih-Dar	803475fb69	Add checkpoint links in a few config classes (#19910 ) * For CLIP * Others * update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-27 09:26:10 +02:00
Revanth09	1f1cc09df6	[DOCTEST] Config doctest for `MCTCT`, `MBart` and `LayoutLM` (#19889 ) * Update documentation_tests.txt * Update configuration_mbart.py * Update configuration_mctct.py * Update configuration_layoutlm.py * Update configuration_layoutlmv2.py * Update configuration_layoutlmv3.py * Update documentation_tests.txt	2022-10-26 12:05:44 +02:00
Rak Alexey	1f7e40d04f	Improve check copies (#19829 ) * print first diff line intead of first code part line * fix style	2022-10-24 11:24:18 -04:00
Ayaka	536f338441	[Doctest] Add `configuration_nezha.py` (#19810 ) * [Doctest] Add `configuration_nezha.py` * Revert line order	2022-10-24 13:50:43 +02:00
Ayaka	f58b211ed3	[Doctest] Add `configuration_electra.py` (#19807 )	2022-10-24 12:34:43 +02:00
Ayaka	c949188b9d	[Doctest] Add `configuration_poolformer.py` (#19808 )	2022-10-24 12:33:46 +02:00
Ayaka	82df83a96b	[Doctest] Add `configuration_plbart.py` (#19809 ) Additionally, I updated the doctest format to be consistent with BERT.	2022-10-24 12:32:55 +02:00
sha016	22502ebb85	[Doctest] `MaskFormerConfig` doctest (#19817 )	2022-10-24 11:08:32 +02:00
Rohith ND	eb98da9880	[Doctest] OpenAIGPTConfig and OPTConfig (#19763 )	2022-10-20 10:22:00 +02:00
Partho	506355ca75	[Doctest] SpeechToTextTransformer2 Config for doctest (#19756 )	2022-10-20 10:19:06 +02:00
Partho	123f65eea6	[Doctest] SqueezeBERT Config for doctest (#19758 )	2022-10-20 10:16:39 +02:00
Partho	cc03063366	[Doctest] SpeechToTextTransformer Config for doctest (#19757 )	2022-10-20 10:15:07 +02:00
Xabier Lahuerta Vazquez	0a03741590	[Doctest] Add `configuration_detr.py` (#19752 )	2022-10-19 18:13:34 +02:00
Xabier Lahuerta Vazquez	65d36ee861	[Doctest] Add `configuration_decision_transformer.py` (#19751 )	2022-10-19 18:12:34 +02:00
JuanCarlosPi	c206fc8779	[Doctest] Add `configuration_wavlm.py` (#19749 ) * Change the import order of the model and configuration classes * Add (with random weights) in the comment before model initialization * Add configuration_wavlm to doctest	2022-10-19 16:10:13 +02:00
Rohith ND	7df0751cc6	[Doctest] GPTNeoConfig , GPTNeoXConfig , GPTNeoXJapaneseConfig (#19741 )	2022-10-19 14:22:41 +02:00
Partho	fc5fdc109d	[Doctest] Add `configuration_clip.py` (#19647 ) * CLIP Config for doctest * add doc example to CLIPConfig * add from_text_vision_configs example * added comment explaining objective	2022-10-19 09:51:26 +02:00
AymenBer99	c9a0da1e12	[Doctest] XLM Config for doctest (#19685 )	2022-10-19 07:10:30 +02:00
JuanCarlosPi	eccbdbcd4d	[Doctest] Add wav2vec2_conformer for doctest (#19734 )	2022-10-19 06:47:41 +02:00
Sylvain Gugger	ebee0a2794	Remove debug statement	2022-10-18 13:58:09 -04:00
Rohith ND	fa8ed9ca76	[Doctest] Add doctest for `FlavaConfig` and `FNetConfig` (#19724 )	2022-10-18 19:56:49 +02:00
Sylvain Gugger	a929f81e92	Repo utils test (#19696 ) * Create repo utils test job * Last occurence * Add tests for tests_fetcher * Better filtering * Let's learn more * Should fix * Should fix * Remove debug * Style * WiP WiP WiP WiP WiP WiP WiP WiP WiP * Quality * address review comments * Fix link	2022-10-18 13:47:36 -04:00
Mukesh K	f3ed26a3fb	[Doctest] Fixing doctest `configuration_pegasus_x.py` (#19725 ) * Fixed pegasus_x config doctest * Test commit Co-authored-by: mukesh663 <mukesh13034@gmail.com>	2022-10-18 17:19:31 +02:00
Somasree Majumder	5864051109	[Doctest] Adding config files for convnext (#19717 ) * Adding config files for configuration_clip.py * Adding config files for convnext * Undoing * making the required changes * Update documentation_tests.txt	2022-10-18 17:10:09 +02:00
Mukesh K	fac1f4b188	]Fixed pegasus config doctest (#19722 ) Co-authored-by: mukesh663 <mukesh13034@gmail.com>	2022-10-18 15:38:57 +02:00
NielsRogge	dd523da577	Add table transformer [v2] (#19614 ) * First draft * Add conversion script * Make conversion work * Upload checkpoints * Add final fixes * Revert changes of conditional and deformable detr * Fix toctree, add and remove copied from * Use model type * Improve docs * Improve code example * Update copies * Add copied formt * Don't update conditional detr * Don't update deformable detr	2022-10-18 15:20:09 +02:00
JuanCarlosPi	fd99ce3329	[Doctest] Add configuration_wav2vec2.py to documentation_tests.py (#19698 )	2022-10-18 14:57:34 +02:00
AymenBer99	8fcbbd3d53	[Doctest] CVT config for doctest (#19695 )	2022-10-18 14:55:56 +02:00
Partho	2add2007c1	[Doctest] Add `configuration_data2vec_vision.py` (#19637 ) * Data2Vec Vision Config for doctest * made suggested changes Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2022-10-17 20:56:42 +02:00
Akshita Prasanth	684165b882	[Doctest] Add configuration_realm.py (#19646 ) * Update configuration_realm.py * realm config for doctest * Update configuration_realm.py doc * Update documentation_tests * clean up Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-17 20:53:24 +02:00
AymenBer99	5ac2f82267	[Doctest] Add configuration_convbert.py (#19643 ) * ConvBERT config for doctest * Add empty lines	2022-10-17 20:29:18 +02:00
Partho	4d77f18cba	[Doctest] Data2VecAudio Config for doctest (#19635 )	2022-10-17 18:39:15 +02:00
Thomas	0027edf905	[Doctest] Add configuration_transfo_xl.py (#19651 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-17 16:47:54 +02:00
Thomas	9bb26f2505	[Doctest] Add `configuration_trocr.py` (#19658 ) * trocr Config for doctest * ran make style	2022-10-17 10:53:36 +02:00

1 2 3 4 5 ...

483 Commits