update readme and few typos

This commit is contained in:
thomwolf
2019-01-10 01:40:00 +01:00
parent fa5222c296
commit e5c78c6684
3 changed files with 7 additions and 7 deletions

View File

@@ -584,7 +584,7 @@ class BertModel(BertPreTrainedModel):
to the last attention block of shape [batch_size, sequence_length, hidden_size],
`pooled_output`: a torch.FloatTensor of size [batch_size, hidden_size] which is the output of a
classifier pretrained on top of the hidden state associated to the first character of the
input (`CLF`) to train on the Next-Sentence task (see BERT's paper).
input (`CLS`) to train on the Next-Sentence task (see BERT's paper).
Example usage:
```python