二次元のnumpy配列をリサイズしたい

numpyの二次元配列で例えば、256256のものがあるとします。
これを図に表すと正方形の図として、可視化できると思いますが、それを画像拡大・縮小のように、128128に変えたいのですが、色々調べても分からず…。画像の拡大縮小なら多くの関数があったのですが。
説明が下手でわかりにくいかも知れませんが、よろしくお願いいたします…！

hayataka2049

2018/10/16 18:43

アルゴリズムが何通りか考えられます。どんなアルゴリズムを使いたいですか？

sasasho

2018/10/17 01:09

ありがとうございます。用途としてはGANの画像生成で256ではサイズが大きすぎるので128にリサイズして、画像生成し生成後再び256に戻すことを考えています。それに適したものがあれば、お願いしたいです（回答になっていないかもしれません…）。

tiitoi

2018/10/17 02:06

GAN なら縮小は Max Pooling、拡大は Upsampling を使うのでは駄目なのでしょうか？DCGAN とか実際そうなっていると思いますが

sasasho

2018/10/17 02:58

勉強不足で申し訳ないのですが、参考にしているDCGANを利用した論文ではリサイズして縮小してからGANにかけていました→『Unlabeled Samples Generated by GAN』。また大きな画像で行うとパラメータ数が大きく計算に時間がかかるのではないかと考えました。ご意見いただければ幸いです。

行動規範の内容に同意します

回答3件

ベストアンサー

numpy配列を画像処理系のライブラリの型に変換し、縮小してからnumpy配列に戻せば良いでしょう。

python
1import numpy as np
2from PIL import Image
3
4a = np.arange(8*8).reshape((8,8))
5print(a)
6i = Image.fromarray(np.uint8(a))
7a = np.asarray(i.resize((4,4)))  # フィルタ等はお好みで
8print(a)
9
10""" =>
11[[ 0  1  2  3  4  5  6  7]
12 [ 8  9 10 11 12 13 14 15]
13 [16 17 18 19 20 21 22 23]
14 [24 25 26 27 28 29 30 31]
15 [32 33 34 35 36 37 38 39]
16 [40 41 42 43 44 45 46 47]
17 [48 49 50 51 52 53 54 55]
18 [56 57 58 59 60 61 62 63]]
19[[ 9 11 13 15]
20 [25 27 29 31]
21 [41 43 45 47]
22 [57 59 61 63]]
23"""

参考：
NumPyのarrayとPILの変換 - white wheelsのメモ
 Python, Pillowで画像を一括リサイズ（拡大・縮小） | note.nkmk.me

投稿2018/10/17 01:20

編集2018/10/17 01:21

hayataka2049

総合スコア30939

sasasho

2018/10/17 01:42

ありがとうございます。もし（10000, 256, 256）のような10000個の256*256に対して全て同じ拡大縮小を行いたい場合には、どうすればいいでしょうか。

hayataka2049

2018/10/17 01:59

私の回答の方法でやるなら、ループを10000回回すしかありません。ただ、機械学習等で使うならフレームワークごとに何らかの前処理の方法をサポートしていると思うので、そういったものを活用する方向で調べてみてください

sasasho

2018/10/17 02:55

ありがとうございます。ループorサポートを調べてみようと思います。

行動規範の内容に同意します

コメント欄で言及している論文は Unlabeled Samples Generated by GAN
Improve the Person Re-identification Baseline in vitro でしょうか？
生成した画像を (128, 128, 3) を (256, 256, 3) にリサイズすると書いてありますね

1枚の画像をリサイズするサンプル

TensorFlow / Keras であれば、tf.image.resize_bilinear() という関数があるので、これでミニバッチをまとめてリサイズできます。

python
1import cv2
2import tensorflow as tf
3import numpy as np
4
5# 入力画像作成
6img = np.random.randint(0, 10, (128, 128, 3), dtype=np.uint8)
7
8# OpenCV によるリサイズ
9cv_resized = cv2.resize(img, (256, 256), interpolation=cv2.INTER_LINEAR)
10print(cv_resized.shape)  # (256, 256, 3)
11
12# TensorFlow によるリサイズ
13x = tf.placeholder(tf.float32)
14resize = tf.image.resize_bilinear(x, (256, 256))
15with tf.Session() as sess:
16    tf_resized = sess.run(resize, feed_dict={x: img[np.newaxis, ...]})
17    tf_resized = tf_resized[0]
18print(tf_resized.shape)  # (256, 256, 3)
19
20# OpenCV と TensorFlow のリサイズ結果が一致するかどうか
21print(np.any(np.isclose(cv_resized, tf_resized)))  # True

速度

N枚の画像を1枚ずつ cv.resize() でリサイズする。
Tensorflow の tf.image.resize_bilinear() でN枚の画像をまとめてリサイズする。

python
1import time
2import cv2
3import numpy as np
4import tensorflow as tf
5
6# 入力画像作成
7num_imgs = 100
8img_batch = np.random.randint(0, 10, (num_imgs, 128, 128, 3), dtype=np.uint8)
9
10# OpenCV によるリサイズ
11#------------------------------------
12start = time.time()
13for img in img_batch:
14    cv2.resize(img, (256, 256), interpolation=cv2.INTER_LINEAR)
15end = time.time() - start
16print('resize {} images by using cv2.resize(): {:.4f} s'.format(num_imgs, end))
17
18# TensorFlow によるリサイズ
19#------------------------------------
20x = tf.placeholder(tf.float32)
21resize = tf.image.resize_bilinear(x, (256, 256))
22
23start = time.time()
24with tf.Session() as sess:
25    sess.run(resize, feed_dict={x: img_batch})
26end = time.time() - start
27print('resize {} images by using tf.image.resize_bilinear(): {:.4f} s'.format(num_imgs, end))

resize 100 images by using cv2.resize(): 0.0144 s
resize 100 images by using tf.image.resize_bilinear(): 0.0330 s

OpenCV のほうが早い結果になりました。
TensorFlow でもリサイズは GPU ではなく、CPU で行っているのではないでしょうか？
OpenCV はこうした処理はかなり最適化されてるので、OpenCV のほうが早いのでしょう。

投稿2018/10/17 03:48

編集2018/10/17 03:49

tiitoi

総合スコア21960

sasasho

2018/10/17 07:16

かなり詳しく教えていただき、ありがとうございます。入力はnumpyの二次元配列(128, 128)でchannelの部分がない白黒画像なので、GANを行ったあとのリサイズで利用させていただきます！

行動規範の内容に同意します

こうですか？

A = np.empty((256,256),float)

#Aに値を代入(省略)

#128×128の配列BにAを圧縮して代入する
B = np.empty((128,128),float)

for i in range(128):
    for j in range(128):
        B[i][j]=0
        for k in range(2):
            for l in range(2):
                B[i][j]=B[i][j]+A[2*i+k][2*j+l]/4

投稿2018/10/16 16:58

編集2018/10/16 16:59