Use HF papers (#38184)
* Use hf papers * Hugging Face papers * doi to hf papers * style
This commit is contained in:
committed by
GitHub
parent
1031ed5166
commit
de24fb63ed
@@ -493,7 +493,7 @@ Note that we have added a randomly initialized _adapter layer_ to `wav2vec2-base
|
||||
`encoder_add_adapter=True`. This adapter sub-samples the output sequence of
|
||||
`wav2vec2-base` along the time dimension. By default, a single
|
||||
output vector of `wav2vec2-base` has a receptive field of *ca.* 25ms (*cf.*
|
||||
Section *4.2* of the [official Wav2Vec2 paper](https://arxiv.org/pdf/2006.11477.pdf)), which represents a little less a single character. On the other hand, BART
|
||||
Section *4.2* of the [official Wav2Vec2 paper](https://huggingface.co/papers/2006.11477)), which represents a little less a single character. On the other hand, BART
|
||||
makes use of a sentence-piece tokenizer as an input processor, so that a single
|
||||
hidden vector of `bart-base` represents *ca.* 4 characters. To better align the
|
||||
receptive field of the *Wav2Vec2* output vectors with *BART*'s hidden-states in the cross-attention
|
||||
|
||||
Reference in New Issue
Block a user