* refactor TF beam search * refactored generate can now properly use attention masks * add force bos/eos logit processors