[s2s] distributed eval allows num_return_sequences > 1 (#7254)

2020-09-24 17:30:09 -04:00
parent 0804d077c6
commit d9d0f1140b
5 changed files with 31 additions and 11 deletions
--- a/examples/seq2seq/README.md
+++ b/examples/seq2seq/README.md
@@ -235,7 +235,7 @@ export DATA_DIR=cnn_dm
    --fp16 \
    --bs 32
 ```
-### Multi-GPU Evalulation
+### Multi-GPU Evaluation
 here is a command to run xsum evaluation on 8 GPUS. It is more than linearly faster than run_eval.py in some cases 
 because it uses SortishSampler to minimize padding. You can also use it on 1 GPU. `data_dir` must have 
 `{type_path}.source` and `{type_path}.target`. Run `./run_distributed_eval.py --help` for all clargs.
@@ -250,7 +250,7 @@ python -m torch.distributed.launch --nproc_per_node=8  run_distributed_eval.py \

 Contributions that implement this command for other distributed hardware setups are welcome!

-#### run_eval tips and tricks
+#### Single-GPU Eval: Tips and Tricks

 When using `run_eval.py`, the following features can be useful: