Map model_type and doc pages names (#14944)
* Map model_type and doc pages names * Add script * Fix typo * Quality * Manual check for Auto Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
This commit is contained in:
@@ -40,7 +40,7 @@ Tips:
|
||||
|
||||
- Demo notebooks for ImageGPT can be found
|
||||
[here](https://github.com/NielsRogge/Transformers-Tutorials/tree/master/ImageGPT).
|
||||
- ImageGPT is almost exactly the same as [GPT-2](./model_doc/gpt2), with the exception that a different activation
|
||||
- ImageGPT is almost exactly the same as [GPT-2](gpt2), with the exception that a different activation
|
||||
function is used (namely "quick gelu"), and the layer normalization layers don't mean center the inputs. ImageGPT
|
||||
also doesn't have tied input- and output embeddings.
|
||||
- As the time- and memory requirements of the attention mechanism of Transformers scales quadratically in the sequence
|
||||
|
||||
Reference in New Issue
Block a user