Unpaired I2I 之 QS-Attn

QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation cvpr2022

Contribution

随机选择patch对，不能完全关注利用domain-specific的信息。QS-attn 希望通过选择合适的anchor q，在包含更多domain-specific的patch中计算对比损失
Global attention
- 为patch定义量化函数计算该潜在位置与其他位置的相关度
- global attetion matrix: $A_g= softmax(QK), Q \in{R^{HW\times C},K=Q^T}$
- $H_g (i) = −\sum ^{HW}_{j=1}A_g (i, j) \cdot log A_g (i, j)$
- 为了选择所有重要的查询，选择 $H_g$ 最小的 N 个作为 QS-Attn 矩阵 $A_{QS} ∈ R^{N ×HW}$
Local attention
- 使用全局attention会对query周围的细节内容进行平滑，引入Local attention可以测量query与其周围区域的特征相似性，捕获局部区域的空间交互
- constant window 大小（w，w）步长 1
- $H_l (i) = −\sum ^{w^2}_{j=1}A_g (i, j) \cdot log A_g (i, j)$
Cross domain value routing for contrastive learning
- $L_{con} = − log[\dfrac{exp(q · k^+/τ )} {exp(q · k^+/τ ) + \sum ^{N −1}_{i=1} exp(q · k^−/τ )}]$
- One positive (N − 1) negative features are located in the real image x
- N anchors are from the fake image G(x)