From fa6113f9a0a2386adeb9d1fb67764585933c3c5e Mon Sep 17 00:00:00 2001 From: Soham Chatterjee <96soham96@gmail.com> Date: Mon, 18 May 2020 23:23:29 +0800 Subject: [PATCH] Fixed spelling of training (#4416) --- docs/source/model_doc/albert.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/model_doc/albert.rst b/docs/source/model_doc/albert.rst index 8b06a9c1ae..8b78a336b5 100644 --- a/docs/source/model_doc/albert.rst +++ b/docs/source/model_doc/albert.rst @@ -6,7 +6,7 @@ Overview The ALBERT model was proposed in `ALBERT: A Lite BERT for Self-supervised Learning of Language Representations `_ by Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, Radu Soricut. It presents -two parameter-reduction techniques to lower memory consumption and increase the trainig speed of BERT: +two parameter-reduction techniques to lower memory consumption and increase the training speed of BERT: - Splitting the embedding matrix into two smaller matrices - Using repeating layers split among groups