Fix examples titles and optimization doc page (#5408)

2020-07-01 08:11:25 -04:00
parent d60d231ea4
commit 4ade7491f4
4 changed files with 228 additions and 69 deletions
--- a/docs/source/main_classes/optimizer_schedules.rst
+++ b/docs/source/main_classes/optimizer_schedules.rst
@@ -1,4 +1,4 @@
-Optimizer
+Optimization
 ----------------------------------------------------

 The ``.optimization`` module provides:
@@ -7,24 +7,25 @@ The ``.optimization`` module provides:
 - several schedules in the form of schedule objects that inherit from ``_LRSchedule``:
 - a gradient accumulation class to accumulate the gradients of multiple batches

-``AdamW``
-~~~~~~~~~~~~~~~~
+``AdamW`` (PyTorch)
+~~~~~~~~~~~~~~~~~~~

 .. autoclass:: transformers.AdamW
    :members:

-``AdamWeightDecay``
-~~~~~~~~~~~~~~~~~~~
+``AdamWeightDecay`` (TensorFlow)
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

 .. autoclass:: transformers.AdamWeightDecay

 .. autofunction:: transformers.create_optimizer

 Schedules
----------------------------------------------------
+~~~~~~~~~~~~~~~~~~~
+
+Learning Rate Schedules (Pytorch)
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

-Learning Rate Schedules
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 .. autofunction:: transformers.get_constant_schedule


@@ -56,16 +57,16 @@ Learning Rate Schedules
    :target: /imgs/warmup_linear_schedule.png
    :alt:

-``Warmup``
-~~~~~~~~~~~~~~~~
+``Warmup`` (TensorFlow)
+^^^^^^^^^^^^^^^^^^^^^^^

 .. autoclass:: transformers.WarmUp
    :members:

 Gradient Strategies
----------------------------------------------------
+~~~~~~~~~~~~~~~~~~~~

-``GradientAccumulator``
-~~~~~~~~~~~~~~~~~~~~~~~
+``GradientAccumulator`` (TensorFlow)
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

 .. autoclass:: transformers.GradientAccumulator