回答編集履歴

fix context

2023/09/30 05:06

投稿

ps_aux_grep

スコア1581

answer CHANGED Viewed

@@ -10,4 +10,4 @@
 Attentionは[Attention Is All You Need](https://arxiv.org/abs/1706.03762)で示されたようなものに限らず，多種多様にわたり独自開発が多く存在します．
 画像分類処理では[SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning](https://openaccess.thecvf.com/content_cvpr_2017/papers/Chen_SCA-CNN_Spatial_and_CVPR_2017_paper.pdf)で使われるような，Spatial AttentionとChannel-wise Attentionという2種のSelf Attentionが有名です．
-貴コードにある`VideoClassifier`はこれのどちらでもないっぽいですね．
+貴コードにある`VideoClassifier`は[ChatGPTに聞いたところ](https://chat.openai.com/share/c40c9c37-0295-4cd1-8b93-34c992f15686)Channel-wise Attentionぽいですね．

fix context

2023/09/30 05:02

投稿

ps_aux_grep

スコア1581

answer CHANGED Viewed

@@ -2,14 +2,12 @@
 > Self Attention = 多重パーセプトロン
 > なのでしょうか？
-その疑問を持つことは非常に正しい感覚です．
+その疑問を持つことはAttentionたる根源を知らないことによるものだと考えます．
-そもAttentionとは，入力してきたデータに0~1の値を掛けることで重みづけを行い，1であれば要注視データ，0であれば不要データとして捨てるような処理を意味します．
+そもAttentionとは，入力してきたデータに0~1の値を掛けることで重みづけを行い，1であれば要注視データ，0であれば不要データとして捨てるような処理を意味します．`VideoClassifier`では，`F.softmax()`の出力がこの重みに該当し，これを入力値`x.permute(0,2,1)`に掛けており，Attentionの体を成しています．
 ここでは，重みづけの処理を[Attention Is All You Need](https://arxiv.org/abs/1706.03762)で示されたようなScaled Dot-Product Attentionに使われる`torch.matmul`を使っておらず，`torch.bmm`が似たような処理を担当しているといった状態です．そも重みを作るための`dot-product`を多重パーセプトロンで代用しているので独自のSelf Attentionだと思ってください．
 Attentionは[Attention Is All You Need](https://arxiv.org/abs/1706.03762)で示されたようなものに限らず，多種多様にわたり独自開発が多く存在します．
-画像分類処理では次のようなattentionも存在します．
+画像分類処理では[SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning](https://openaccess.thecvf.com/content_cvpr_2017/papers/Chen_SCA-CNN_Spatial_and_CVPR_2017_paper.pdf)で使われるような，Spatial AttentionとChannel-wise Attentionという2種のSelf Attentionが有名です．
-[SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning](https://openaccess.thecvf.com/content_cvpr_2017/papers/Chen_SCA-CNN_Spatial_and_CVPR_2017_paper.pdf)では，Spatial AttentionとChannel-wise Attentionという2種のSelf Attentionを使っています．
 貴コードにある`VideoClassifier`はこれのどちらでもないっぽいですね．

fix context

2023/09/30 03:00

投稿

ps_aux_grep

スコア1581

answer CHANGED Viewed

@@ -7,3 +7,9 @@
 ここでは，重みづけの処理を[Attention Is All You Need](https://arxiv.org/abs/1706.03762)で示されたようなScaled Dot-Product Attentionに使われる`torch.matmul`を使っておらず，`torch.bmm`が似たような処理を担当しているといった状態です．そも重みを作るための`dot-product`を多重パーセプトロンで代用しているので独自のSelf Attentionだと思ってください．
+Attentionは[Attention Is All You Need](https://arxiv.org/abs/1706.03762)で示されたようなものに限らず，多種多様にわたり独自開発が多く存在します．
+画像分類処理では次のようなattentionも存在します．
+[SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning](https://openaccess.thecvf.com/content_cvpr_2017/papers/Chen_SCA-CNN_Spatial_and_CVPR_2017_paper.pdf)では，Spatial AttentionとChannel-wise Attentionという2種のSelf Attentionを使っています．
+貴コードにある`VideoClassifier`はこれのどちらでもないっぽいですね．