[NLP] 트랜스포머 (Transformer)
[NLP] 트랜스포머 (Transformer) 들어가며 Attention Is All You Need - 2017, A Vaswani. https://arxiv.org/abs/1706.03762 Attention Is All You Need The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new arxiv.o..