質問編集履歴

コードの修正

2020/06/29 05:58

投稿

ima_chan1107

スコア1

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -1,4 +1,4 @@
-とある論文（有料論文のため本文・図の引用はひとまず避けます）で、非時系列データを含む LSTM モデルというものがあり、それを Keras/TensorFlow のカスタムレイヤーで作成しようとしています。
+とある論文（有料論文のため本文・図の引用は避けます）で、非時系列データを含む LSTM モデルというものがあり、それを Keras/TensorFlow のカスタムレイヤーで作成しようとしています。
 構造としては、時系列データは従来の LSTM と同様に入力ゲート、出力ゲート、忘却ゲート、ブロック入力に入力されますが、非時系列データは4つのうちブロック入力には入力されず、これにより記憶する必要のない非時系列データがメモリセルに記憶されず、非時系列データを含んだデータの学習精度が向上するといったものです。
 時系列データと非時系列データを入力とするため、複数の入力としてリストでそれぞれ入力するようにしています。レイヤー内での計算は、下記の Qiita 記事を参考に call メソッドでその時刻での状態 h と、次の時刻に伝える状態 h と 記憶セルの出力 c のリストを返すようにしています。実行すると、call メソッドの呼び出しでエラーが出て行き詰まっています。__init__ メソッド、もしくは build メソッドの中に誤りがあるように感じています。
@@ -39,7 +39,7 @@
 ```Python
 import tensorflow as tf
 import numpy as np
-from tensorflow.keras.layers import Input, RNN, AbstractRNNCell
+from tensorflow.keras.layers import Input, Dense, RNN, AbstractRNNCell
 from tensorflow.python.keras import activations, constraints, initializers, regularizers
 from tensorflow.python.keras import backend as K
 from tensorflow.python.keras.utils import tf_utils
@@ -89,12 +89,12 @@
         input_dim1 = input_shape[0][-1]     # 時系列データの入力
         input_dim2 = input_shape[1][-1]     # 非時系列データの入力
         self.kernel1          = self.add_weight(shape=(input_dim1, self.units * 4),
-                                                name='kernel',
+                                                name='kernel1',
                                                 initializer=self.kernel_initializer,
                                                 regularizer=self.kernel_regularizer,
                                                 constraint=self.kernel_constraint)
         self.kernel2          = self.add_weight(shape=(input_dim2, self.units * 3),
-                                                name='kernel',
+                                                name='kernel2',
                                                 initializer=self.kernel_initializer,
                                                 regularizer=self.kernel_regularizer,
                                                 constraint=self.kernel_constraint)
@@ -153,9 +153,9 @@
             x_o = K.bias_add(x_o, b_o)
         f = self.recurrent_activation(x_f + K.dot(h_tm1, self.recurrent_kernel[:, :self.units]))                    # 忘却ゲート
-        u = self.activation(x_u + K.dot(h_tm1, self.recurrene_kernel[:, self.units:self.units * 2]))                #
+        u = self.activation(x_u + K.dot(h_tm1, self.recurrene_kernel[:, self.units:self.units * 2]))                # ブロック入力
-        i = self.recurrent_activation(x_i + K.dot(h_tm1, self.recurrent_kernel[:, self.units * 2:self.units * 3]))  #
+        i = self.recurrent_activation(x_i + K.dot(h_tm1, self.recurrent_kernel[:, self.units * 2:self.units * 3]))  # 入力ゲート
-        o = self.recurrent_activation(x_o + K.dot(h_tm1, self.recurrent_kernel[:, self.units * 3:self.units * 4]))  #
+        o = self.recurrent_activation(x_o + K.dot(h_tm1, self.recurrent_kernel[:, self.units * 3:self.units * 4]))  # 出力ゲート
         c = f * c_tm1 + u * i
         h = self.activation(c) * o
@@ -169,11 +169,9 @@
 ```Python
 t_input = Input(shape=(30, 1))
-print(t_input.shape)
 n_input = Input(shape=(30, 1))
-print(n_input.shape)
 h = NontsLSTM(128)([t_input, n_input])
-output = Dense(1, activation='linear')
+output = Dense(1, activation='linear')(h)
 model = Model([t_input, n_input], output)
 model.compile(optimizer='adam', loss='mse', metrics='loss')