Transformer 和 Self-Attention Transformer 和 Self-Attention the Transformer is the first transduction model relying entirely on self-attention to compute representations of its input and output without using sequencealigned RNNs o 2025-10-26 Transformer #note