Files
HuggingFace_transformer/docs/source/en/model_doc
Aritra Roy Gosthipaty 965e98dc54 [Port] TensorFlow implementation of Mistral (#29708)
* chore: initial commit

* chore: adding imports and inits

* chore: adding the causal and classification code

* chore: adding names to the layers

* chore: using single self attn layer

* chore: built the model and layers

* chore: start with testing

* chore: docstring change, transpose fix

* fix: rotary embedding

* chore: adding cache implementation

* remove unused torch

* chore: fixing the indexing issue

* make fix-copies

* Use modeling_tf_utils.keras

* make fixup

* chore: fixing tests

* chore: adding past key value logic

* chore: adding multi label classfication test

* fix: switching on the built parameters in the layers

* fixing repo consistency

* ruff formats

* style changes

* fix: tf and pt equivalence

* removing returns from docstrings

* fix docstrings

* fix docstrings

* removing todos

* fix copies

* fix docstring

* fix docstring

* chore: using easier rotate_half

* adding integration tests

* chore: addressing review related to rotary embedding layer

* review changes

* [run-slow] mistral

* skip: test save load after resize token embedding

* style

---------

Co-authored-by: Matt <rocketknight1@gmail.com>
2024-05-23 17:48:49 +01:00
..
2024-04-26 16:23:44 +01:00
2023-11-06 19:45:03 +00:00
2023-11-10 13:49:10 +00:00
2024-03-15 14:29:11 +01:00
2024-04-26 20:52:24 +01:00
2023-11-23 17:44:08 +00:00
2023-11-23 17:44:08 +00:00
2024-03-12 10:16:21 +00:00
2024-05-21 19:38:02 +02:00
2024-05-13 15:59:46 +01:00
2024-04-18 11:04:02 +02:00
2024-05-14 16:32:01 +02:00
2023-10-30 21:42:19 +01:00
2023-12-20 14:25:07 +05:30
2024-05-10 12:40:57 +01:00
2024-04-17 17:59:07 +02:00
2024-05-14 08:36:11 +02:00
2024-05-14 22:07:15 +02:00
2024-02-19 15:22:29 +01:00
2024-02-19 15:22:29 +01:00
2024-03-13 19:05:20 +00:00
2023-11-06 19:45:03 +00:00
2023-07-13 11:46:54 -04:00
2024-04-19 21:03:07 +02:00
2024-02-19 15:22:29 +01:00
2024-04-19 18:31:43 +01:00
2023-11-06 19:45:03 +00:00
2023-11-23 17:02:16 +00:00
2023-12-15 20:16:47 +01:00