Return to Article Details AAN+: Generalized Average Attention Network for Accelerating Neural Transformer Download Download PDF