編集履歴

質問編集履歴

問題解決の糸口が見えてきましたが、未だに改善されません。

2019/01/26 02:35

投稿

JunyaKoga

スコア17

title CHANGED Viewed

File without changes

body CHANGED Viewed

@@ -1,31 +1,12 @@
-[GitHubの記事](https://github.com/kujirahand/book-mlearn-gyomu/blob/master/src/ch5/iris/tf-iris.py)を参考にTensorFlowでアヤメの分類問題をやっているのですが、恐らくデータ型のエラーがどうしても解消できません。
-コスト関数の最適化のとこでエラーが出ていますが、上の記事と見合わせても、データ型的にどこが間違えているのかがわかりません。
+### ＜1/26追記＞
-解決の方法がわかる方、是非ご教授くださると助かります。
+最下部の内容から進展があったので、記事を更新します。
-試行錯誤して色々書き換えると様々なエラーが生じるので、もしかしたら多くの個所がおかしいのかもしれません。
+問題があるであろう、グラフ作成の部分と実行部分を抜粋します（いくつか変更してます）。
+実行部ではfor文でエポック数を増やしたいのですが、2周目からエラーが出るようです。実行部のfor文をfor i in range(1)とした場合はエラーは生じません。
+**エラー内容は「sess.runにndarrayを入れるな。テンソルを入れてくれ」ということだと思いますが、何故、forの1周目でテンソルだったものが2周目からndarrayに変化してしまうのかがわかりません。**
-### 書いたコード
 ```python
-from urllib.request import urlretrieve
-import pandas as pd
-import numpy as np
-import tensorflow as tf
-from sklearn.model_selection import train_test_split
-# データの呼び出し
-url = '上記記事のコードのＵＲＬ'
-urlretrieve(url, 'iris.csv')
-df = pd.read_csv('iris.csv', encoding='utf-8')
-# データの準備
-names = sorted(set(df.Name.values))
-name2num = {w:i for i, w in enumerate(names)}
-df['label'] = df['Name'].map(name2num)
-X = df.iloc[:,:4].values
-y = df.label.values
-X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, stratify=y, random_state=0)
-# グラフの作成
+グラフの作成
 g = tf.Graph()
 with g.as_default():
   tf.set_random_seed(123)
@@ -34,37 +15,38 @@
   tf_y = tf.placeholder(tf.int32, shape=(None), name='tf_y')
   oh_y = tf.one_hot(tf_y, 3, dtype=tf.float32, name='oh_y')
   w = tf.Variable(tf.random_normal((4, 3)), name='weight')
   b = tf.Variable(tf.zeros(3), name='bias')
   logits = tf.add(tf.matmul(tf_x, w), b, name='logits')
   prediction = {'probabilities': tf.nn.softmax(logits, name='probabilities'),
-                'labels': tf.cast(tf.argmax(logits,1), tf.int32)}
+                'labels': tf.argmax(logits,1)}
-#   cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits_v2(logits=logits, labels=oh_y), name='cost')
+  cost = tf.losses.softmax_cross_entropy(logits=logits, onehot_labels=oh_y)
-  cost = -tf.reduce_sum(oh_y * tf.log(prediction['probabilities']))
+#   cost = -tf.reduce_sum(oh_y * tf.log(prediction['probabilities']))
   optimizer = tf.train.AdamOptimizer()
   train = optimizer.minimize(cost)
-  correct_predictions = tf.equal(prediction['labels'], tf.cast(tf.argmax(oh_y, 1), tf.int32))
+  correct_predictions = tf.equal(prediction['labels'], tf.argmax(oh_y, 1))
+#   tf.reduce_meanの中身はfloatにしてあげないと小数点の計算できない
-  accuracy = tf.reduce_mean(tf.cast(correct_predictions, tf.float32), name='accuracy')
+  accuracy = tf.reduce_mean(tf.cast(correct_predictions, tf.float32), name='accuracy')
   init = tf.global_variables_initializer()
-# 実行
+実行
 with tf.Session(graph=g) as sess:
   sess.run(init)
+# ↓のfor文のrange(1)にすると、エラーが生じません。
-  for step in range(300):
+  for step in range(2):
-    _, cost, accuracy = sess.run([train, cost, accuracy], feed_dict={tf_x: X_train, tf_y: y_train})
+    _, cost= sess.run([train, cost], feed_dict={tf_x: X_train, tf_y: y_train})
     if (step + 1) % 10 == 0:
-      print('Epoch%2d Cost:%.2f Accuracy:%.2f%%' %(step+1, cost, accuracy*100))
+      print('Epoch%2d Cost:%.2f' %(step+1, cost))
   print('Prediction Accuracy: %.2f' %(sess.run(accuracy, feed_dict={tf_x: X_test, tf_y: y_test}) * 100))
 ```
 ### 生じているエラー
-```ここに言語を入力
+```
 ---------------------------------------------------------------------------
 TypeError                                 Traceback (most recent call last)
 /usr/local/lib/python3.6/dist-packages/tensorflow/python/client/session.py in __init__(self, fetches, contraction_fn)
@@ -87,12 +69,12 @@
 During handling of the above exception, another exception occurred:
 TypeError                                 Traceback (most recent call last)
-<ipython-input-1-c6d4748fd830> in <module>()
+<ipython-input-134-171c87a28859> in <module>()
-     52
+      3 # ↓のfor文のrange(1)にすると、エラーが生じません。
-     53   for step in range(300):
+      4   for step in range(2):
----> 54     _, cost, accuracy = sess.run([train, cost, accuracy], feed_dict={tf_x: X_train, tf_y: y_train})
+----> 5     _, cost= sess.run([train, cost], feed_dict={tf_x: X_train, tf_y: y_train})
-     55     if (step + 1) % 10 == 0:
+      6     if (step + 1) % 10 == 0:
-     56       print('Epoch%2d Cost:%.2f Accuracy:%.2f%%' %(step+1, cost, accuracy*100))
+      7       print('Epoch%2d Cost:%.2f' %(step+1, cost))
 /usr/local/lib/python3.6/dist-packages/tensorflow/python/client/session.py in run(self, fetches, feed_dict, options, run_metadata)
     927     try:
@@ -150,6 +132,80 @@
     305       except ValueError as e:
     306         raise ValueError('Fetch argument %r cannot be interpreted as a '
+TypeError: Fetch argument 5.690171 has invalid type <class 'numpy.float32'>, must be a string or Tensor. (Can not convert a float32 into a Tensor or Operation.)
+```
+＜以下、更新前の原文＞
+[GitHubの記事](https://github.com/kujirahand/book-mlearn-gyomu/blob/master/src/ch5/iris/tf-iris.py)を参考にTensorFlowでアヤメの分類問題をやっているのですが、恐らくデータ型のエラーがどうしても解消できません。
+コスト関数の最適化のとこでエラーが出ていますが、上の記事と見合わせても、データ型的にどこが間違えているのかがわかりません。
+解決の方法がわかる方、是非ご教授くださると助かります。
+試行錯誤して色々書き換えると様々なエラーが生じるので、もしかしたら多くの個所がおかしいのかもしれません。
+### 書いたコード
+```python
+from urllib.request import urlretrieve
+import pandas as pd
+import numpy as np
+import tensorflow as tf
+from sklearn.model_selection import train_test_split
+# データの呼び出し
+url = '上記記事のコードのＵＲＬ'
+urlretrieve(url, 'iris.csv')
+df = pd.read_csv('iris.csv', encoding='utf-8')
+# データの準備
+names = sorted(set(df.Name.values))
+name2num = {w:i for i, w in enumerate(names)}
+df['label'] = df['Name'].map(name2num)
+X = df.iloc[:,:4].values
+y = df.label.values
+X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, stratify=y, random_state=0)
+# グラフの作成
+g = tf.Graph()
+with g.as_default():
+  tf.set_random_seed(123)
+  tf_x = tf.placeholder(tf.float32, shape=(None, 4), name='tf_x')
+  tf_y = tf.placeholder(tf.int32, shape=(None), name='tf_y')
+  oh_y = tf.one_hot(tf_y, 3, dtype=tf.float32, name='oh_y')
+  w = tf.Variable(tf.random_normal((4, 3)), name='weight')
+  b = tf.Variable(tf.zeros(3), name='bias')
+  logits = tf.add(tf.matmul(tf_x, w), b, name='logits')
+  prediction = {'probabilities': tf.nn.softmax(logits, name='probabilities'),
+                'labels': tf.cast(tf.argmax(logits,1), tf.int32)}
+#   cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits_v2(logits=logits, labels=oh_y), name='cost')
+  cost = -tf.reduce_sum(oh_y * tf.log(prediction['probabilities']))
+  optimizer = tf.train.AdamOptimizer()
+  train = optimizer.minimize(cost)
+  correct_predictions = tf.equal(prediction['labels'], tf.cast(tf.argmax(oh_y, 1), tf.int32))
+  accuracy = tf.reduce_mean(tf.cast(correct_predictions, tf.float32), name='accuracy')
+  init = tf.global_variables_initializer()
+# 実行
+with tf.Session(graph=g) as sess:
+  sess.run(init)
+  for step in range(300):
+    _, cost, accuracy = sess.run([train, cost, accuracy], feed_dict={tf_x: X_train, tf_y: y_train})
+    if (step + 1) % 10 == 0:
+      print('Epoch%2d Cost:%.2f Accuracy:%.2f%%' %(step+1, cost, accuracy*100))
+  print('Prediction Accuracy: %.2f' %(sess.run(accuracy, feed_dict={tf_x: X_test, tf_y: y_test}) * 100))
+```
+### 生じているエラー（追記時に字数が足りなくなったので削りました）
+```ここに言語を入力
+---------------------------------------------------------------------------
 TypeError: Fetch argument 682.8205 has invalid type <class 'numpy.float32'>, must be a string or Tensor. (Can not convert a float32 into a Tensor or Operation.)
 ```