Attention Is All You Need
Proposed the Transformer model, a novel architecture using self-attention to improve sequence transduction tasks.
Proposed the Transformer model, a novel architecture using self-attention to improve sequence transduction tasks.