diff --git a/docs/source/model_summary.mdx b/docs/source/model_summary.mdx
index 2b57187b33..64b2b95c26 100644
--- a/docs/source/model_summary.mdx
+++ b/docs/source/model_summary.mdx
@@ -199,6 +199,9 @@ The library provides a version of the model for language modeling only.
+
+
+
[XLNet: Generalized Autoregressive Pretraining for Language Understanding](https://arxiv.org/abs/1906.08237), Zhilin
Yang et al.