回答編集履歴

誤字の修正

2021/11/18 01:58

投稿

HRCo4

スコア140

answer CHANGED Viewed

@@ -1,5 +1,5 @@
 （回答修正しました）
-コードと見ると、np.expand_dims を2回行っており、ネットワークの入力が1chということから、おそらくコードの参考元では1chのグレースケール画像を入力しており、今回試したのはカラーあるいは3chのグレースケール画像を入力しようとしたのではないでしょうか？
+コードと見ると、np.expand_dims を2回行っていますね。ネットワークの入力が1chということから、おそらくコードの参考元では1chのグレースケール画像を入力しており、今回試したのはカラーあるいは3chのグレースケール画像を入力しようとしたのではないでしょうか？
 おそらくPreprocessedDatasetクラスで画像をロードするときに np.expand_dims を行っており、推論実行するときにまた np.expand_dims を行っているため、入力する画像の shape が (1, 1, h, w, ch) になっています。

補足追加

2021/11/18 01:58

投稿

HRCo4

スコア140

answer CHANGED Viewed

@@ -1,16 +1,103 @@
+（回答修正しました）
-PreprocessedDatasetクラスで画像をロードするときに np.expand_dims を行っており、推論実行するときにまた np.expand_dims を行っているため、入力する画像の shape が (1, 1, h, w, ch) になっています。
+コードと見ると、np.expand_dims を2回行っており、ネットワークの入力が1chということから、おそらくコードの参考元では1chのグレースケール画像を入力しており、今回試したのはカラーあるいは3chのグレースケール画像を入力しようとしたのではないでしょうか？
-pred = model.predictor(np.expand_dims(img, axis=0))　を
+おそらくPreprocessedDatasetクラスで画像をロードするときに np.expand_dims を行っており、推論実行するときにまた np.expand_dims を行っているため、入力する画像の shape が (1, 1, h, w, ch) になっています。
-pred = model.predictor(img)　に
-書き換えればよろしいかと。
-あと、chainer は確か入力が (batch, ch, h, w) の形式だったかと思います。
+ネットワーク構成的に入力を1chにしなければならないので画像を読み込んだ際にグレースケール化すれば問題ないかと思います。
-また、skimage の imread は (h, w, ch) の形式で読み込んだはずですので、
+修正したコードを記入しておきます。
 ```
+import os
+import numpy as np
+import skimage. io as io
+from skimage.color import rgb2gray # 追加(グレースケール用)
+import chainer
+import chainer.links as L
+import chainer.functions as F
+class PreprocessedDataset(chainer.dataset.DatasetMixin):
+    def __init__(
+        self,
+        root_path,
+        split_list
+    ):
+        self.root_path = root_path
+        with open(split_list) as f:
+            self.split_list = [line.rstrip() for line in f]
+        self.dtype = np.float32
+    def __len__(self):
+        return len(self.split_list)
-def _get_image(self, i):
+    def _get_image(self, i):
         image = io.imread(os.path.join(self.root_path, self.split_list[i]))
         image = self._min_max_normalize_one_image(image)
+        if len(image.shape) == 3: # カラーあるいは3chグレスケ画像の場合は1chグレスケ化
-        image = image.transpose(2,0,1)
+            image = rgb2gray(image) # (h, w, ch) -> (h, w)
-        return np.expand_dims(image.astype(self.dtype), axis=0)
+        return np.expand_dims(image.astype(self.dtype), axis=0) # (h, w) -> (1, h, w)
+    def _min_max_normalize_one_image(self, image):
+        max_int = image.max()
+        min_int = image.min()
+        out = (image.astype(np.float32) - min_int) / (max_int - min_int)
+        return out
+    def _get_label(self, i):
+        label = 0 if 'false' in self.split_list[i] else 1
+        return label
+    def get_example(self, i):
+        x, y = self._get_image(i), self._get_label(i)
+        return x, y
+class ClassificationModel(chainer.Chain):
+  def __init__(self, n_class=2):
+    super(ClassificationModel, self).__init__()
+    with self.init_scope():
+      self.conv1 = L.Convolution2D(1, 32, 5, 1, 2)
+      self.bn1 = L.BatchNormalization(32)
+      self.conv2 = L.Convolution2D(32, 64, 5, 1, 2)
+      self.bn2 = L.BatchNormalization(64)
+      self.conv3 = L.Convolution2D(64, 128, 3, 1, 1)
+      self.bn3 = L.BatchNormalization(128)
+      self.conv4 = L.Convolution2D(128, 256, 3, 1, 1)
+      self.bn4 = L.BatchNormalization(256)
+      self.fc5 = L.Linear(16384, 1024)
+      self.fc6 = L.Linear(1024, n_class)
+  def __call__(self, x):
+    h = F.relu(self.conv1(x)) ←#エラー箇所
+    h = F.max_pooling_2d(self.bn1(h), 2, 2)
+    h = F.relu(self.conv2(x))
+    h = F.max_pooling_2d(self.bn2(h), 2, 2)
+    h = F.relu(self.conv3(x))
+    h = F.max_pooling_2d(self.bn3(h), 2, 2)
+    h = F.relu(self.conv4(x))
+    h = F.max_pooling_2d(self.bn4(h), 2, 2)
+    h = F.dropout(F.relu(self.fc5(h)))
+    return self.fc6(h)
+root_path = './dataset_cls'
+split_list = './dataset_cls/split_list/test.txt'
+test_dataset = PreprocessedDataset(root_path, split_list)
+model = L.Classifier(ClassificationModel(n_class=2))
+print('================')
+for i in range(10):
+  with chainer.using_config('train', False):
+    img, label = test_dataset.get_example(i)
+    pred = model.predictor(np.expand_dims(img, axis=0)) # (1, h, w) -> (1, 1, h, w)
+    pred = F.softmax(pred)
+  print('test {}'.format(i + 1))
+  print(' pred: {}'.format(np.argmax(pred.data)))
+  print(' label: {}'.format(label))
+  print('================')
-```
+```
-のようにして、shape を (1, ch, h, w) にする必要があります。