Fix pretrained models table

2019-11-26 15:40:03 -05:00
parent 44b82c777f
commit cf26a0c85e
2 changed files with 8 additions and 7 deletions
--- a/docs/source/pretrained_models.rst
+++ b/docs/source/pretrained_models.rst
@@ -162,31 +162,31 @@ Here is the full list of the currently provided pretrained models together with
 | ALBERT            | ``albert-base-v1``                                         | | 12 repeating layers, 128 embedding, 768-hidden, 12-heads, 11M parameters                                                            |
 |                   |                                                            | | ALBERT base model                                                                                                                   |
 |                   |                                                            | (see `details <https://github.com/google-research/google-research/tree/master/albert>`__)                                             |
-+--------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
+|                   +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
 |                   | ``albert-large-v1``                                        | | 24 repeating layers, 128 embedding, 1024-hidden, 16-heads, 17M parameters                                                           |
 |                   |                                                            | | ALBERT large model                                                                                                                  |
 |                   |                                                            | (see `details <https://github.com/google-research/google-research/tree/master/albert>`__)                                             |
-+--------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
+|                   +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
 |                   | ``albert-xlarge-v1``                                       | | 24 repeating layers, 128 embedding, 2048-hidden, 16-heads, 58M parameters                                                           |
 |                   |                                                            | | ALBERT xlarge model                                                                                                                 |
 |                   |                                                            | (see `details <https://github.com/google-research/google-research/tree/master/albert>`__)                                             |
-+--------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
+|                   +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
 |                   | ``albert-xxlarge-v1``                                      | | 12 repeating layer, 128 embedding, 4096-hidden, 64-heads, 223M parameters                                                           |
 |                   |                                                            | | ALBERT xxlarge model                                                                                                                |
 |                   |                                                            | (see `details <https://github.com/google-research/google-research/tree/master/albert>`__)                                             |
-+--------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
+|                   +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
 |                   | ``albert-base-v2``                                         | | 12 repeating layers, 128 embedding, 768-hidden, 12-heads, 11M parameters                                                            |
 |                   |                                                            | | ALBERT base model with no dropout, additional training data and longer training                                                     |
 |                   |                                                            | (see `details <https://github.com/google-research/google-research/tree/master/albert>`__)                                             |
-+--------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
+|                   +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
 |                   | ``albert-large-v2``                                        | | 24 repeating layers, 128 embedding, 1024-hidden, 16-heads, 17M parameters                                                           |
 |                   |                                                            | | ALBERT large model with no dropout, additional training data and longer training                                                    |
 |                   |                                                            | (see `details <https://github.com/google-research/google-research/tree/master/albert>`__)                                             |
-+--------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
+|                   +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
 |                   | ``albert-xlarge-v2``                                       | | 24 repeating layers, 128 embedding, 2048-hidden, 16-heads, 58M parameters                                                           |
 |                   |                                                            | | ALBERT xlarge model with no dropout, additional training data and longer training                                                   |
 |                   |                                                            | (see `details <https://github.com/google-research/google-research/tree/master/albert>`__)                                             |
-+--------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
+|                   +------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------+
 |                   | ``albert-xxlarge-v2``                                      | | 12 repeating layer, 128 embedding, 4096-hidden, 64-heads, 223M parameters                                                           |
 |                   |                                                            | | ALBERT xxlarge model with no dropout, additional training data and longer training                                                  |
 |                   |                                                            | (see `details <https://github.com/google-research/google-research/tree/master/albert>`__)                                             |