c 3
d 4
a, b c, d
print(a1:, a)
print(b1:, b)c [3, 4, 7]
d [4, 2, 9]
a, b c, d
print(a2:, a)
print(b2:, b)c [3, 4, 7]
d [4, 2, 9]
e [5, 2, 9]
a, b, bb c, d, e
print(a2:, a)
print(b2:, b)
print(bb:, bb)
A survey of the Vision Transformers and its CNN-Transformer based Variants 摘要1、介绍2、vit的基本概念2.1 patch嵌入2.2 位置嵌入2.2.1 绝对位置嵌入(APE)2.2.2 相对位置嵌入(RPE)2.2.3卷积位置嵌入(CPE) 2.3 注意力机制2.3.1多头自我注意(MSA) 2.4 Transformer层2.4.1 …