Fix license (#26931)

This commit is contained in:
Mohamed Aymane Farhi
2023-10-19 14:36:41 +01:00
committed by GitHub
parent ad08137e47
commit 73dc23f786

View File

@@ -22,7 +22,7 @@ The Fuyu model was created by [ADEPT](https://www.adept.ai/blog/fuyu-8b), and au
The authors introduced Fuyu-8B, a decoder-only multimodal model based on the classic transformers architecture, with query and key normalization. A linear encoder is added to create multimodal embeddings from image inputs. The authors introduced Fuyu-8B, a decoder-only multimodal model based on the classic transformers architecture, with query and key normalization. A linear encoder is added to create multimodal embeddings from image inputs.
By treating image tokens like text tokens and using a special image-newline character, the model knows when an image line ends. Image positional embeddings are removed. This avoids the need for different training phases for various image resolutions. With 8 billion parameters and licensed under Apache, Fuyu-8B is notable for its ability to handle both text and images, its impressive context size of 16K, and its overall performance. By treating image tokens like text tokens and using a special image-newline character, the model knows when an image line ends. Image positional embeddings are removed. This avoids the need for different training phases for various image resolutions. With 8 billion parameters and licensed under CC-BY-NC, Fuyu-8B is notable for its ability to handle both text and images, its impressive context size of 16K, and its overall performance.
<Tip warning={true}> <Tip warning={true}>
@@ -112,4 +112,4 @@ The `LlamaTokenizer` is used as it is a standard wrapper around sentencepiece.
## FuyuProcessor ## FuyuProcessor
[[autodoc]] FuyuProcessor [[autodoc]] FuyuProcessor
- __call__ - __call__