A paper on a new simple network architecture, the Transformer, based solely on attention mechanisms The NIPS 2017 accepted paper,…