Removes images to put them in a dataset (#14781)
* First try * Update instructions
This commit is contained in:
@@ -248,7 +248,7 @@ Here are the commonly used floating point data types choice of which impacts bot
|
||||
|
||||
Here is a diagram that shows how these data types correlate to each other.
|
||||
|
||||

|
||||

|
||||
|
||||
(source: [NVIDIA Blog](https://developer.nvidia.com/blog/accelerating-ai-training-with-tf32-tensor-cores/))
|
||||
|
||||
@@ -524,7 +524,7 @@ Since it has been discovered that more parameters lead to better performance, th
|
||||
|
||||
In this approach every other FFN layer is replaced with a MoE Layer which consists of many experts, with a gated function that trains each expert in a balanced way depending on the input token's position in a sequence.
|
||||
|
||||

|
||||

|
||||
|
||||
(source: [GLAM](https://ai.googleblog.com/2021/12/more-efficient-in-context-learning-with.html))
|
||||
|
||||
|
||||
Reference in New Issue
Block a user