0. 基本信息

引用信息

Z. Dou, K. Gao, X. Zhang, H. Wang and L. Han, "Band Selection of Hyperspectral Images Using Attention-Based Autoencoders," in IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 1, pp. 147-151, Jan. 2021, doi: 10.1109/LGRS.2020.2967815.

bibtex entry

@article{9032380,
  author  = {Z. {Dou} and K. {Gao} and X. {Zhang} and H. {Wang} and L. {Han}},
  journal = {IEEE Geoscience and Remote Sensing Letters},
  title   = {Band Selection of Hyperspectral Images Using Attention-Based Autoencoders},
  year    = {2021},
  volume  = {18},
  number  = {1},
  pages   = {147-151},
  doi     = {10.1109/LGRS.2020.2967815}
}

1. 总体思想

与BS-Net¹思想类似，均是将波段选择视作高光谱波段重建任务，同样根据原始的高光谱图像首先生成Attention map，而后根据Attention map与原始图像逐像素相乘的结构，使用Auto-encoder网络对于原始高光谱图像进行重建，得到重建后的高光谱图像，而后计算其与原始输入高光谱图像的差异计算损失函数。

总体流程

2. 具体方法

2.1 网络结构

在Attention Module中，网络结构实现与BS-Net几乎一致，只是网络层级和每层参数数量上的差异；对于用于重建的Auto-Encoder网络，则添加了跳链接，其余则是普通的auto-encoder结构。

网络结构

2.2 损失函数

最终的损失函数计算也和BS-Net类似，同样计算了逐像素的欧式距离相似性，并增加了Attention map权重的正则项，具体如下：

J(\theta) = \frac{1}{2}||f(\mathbf{X} \circ \mathbf{M}) - \mathbf{X}||^2_2 + \lambda||\mathbf{M}||_{2,1}

其中 $\mathbf{X}$ 是输入高光谱图像； $\mathbf{M}$ 是Attention Map，是根据 $\mathbf{X}$ 估算得到的， $||\mathbf{M}||_{2,1} = (\sum_{i=1}^n||\mathbf{M}_{i,:}||_1^2)^{1/2}$ ； $f$ 是AutoEncoder重建网络。

2.3 最优波段选择

与BS-Net不用，最后波段并不是根据Attention Map求和后选择最大得到，而是根据Attention Map进行相似性聚类（使用KMeans方法），而后根据聚类结果选择最优波段。Attention Map的相似性矩阵与原始高光谱图像的相似性矩阵有很大差异。

相似性矩阵（左：Attention Map；右：原始HSI）

Y. Cai, X. Liu and Z. Cai, "BS-Nets: An End-to-End Framework for Band Selection of Hyperspectral Image," in IEEE Transactions on Geoscience and Remote Sensing, vol. 58, no. 3, pp. 1969-1984, March 2020, doi: 10.1109/TGRS.2019.2951433.

我的笔记：论文笔记：BS-Nets: An End-to-End Framework for Band Selection of Hyperspectral Image ↩

论文笔记：Band Selection of Hyperspectral Images Using Attention-Based Autoencoders