cifar10プログラムについて

Question

cifar10の画像認識方法についてなのですが、まず
```python
import cv2
cv2.imread("画像ファイル")
```
によって、画像を読み取り、それをいろいろ変換して、32x32の３チャンネルの画像10枚を取得しました。
これらをimagesに入れて、images.shape = (10, 32, 32, 3)の状態です。(imagesはnumpy配列です。)
ここまでは前置きです。

この後、画像変換に使用される計算グラフは三次元（一枚の画像)を想定しているために、ひとまとめにではなく一枚ずつ送り込む必要がある、と書いてあったのですが、その計算グラフの部分は、
```python
#画像の配列を一枚ずつ供給するproducerの形成
image, = tf.train.slice_input_producer([images], shuffle = False)
#画像の切り落とし
reshaped_image = tf.cast(image, tf.float32) #何故キャスト？
resized_image = tf.image.resize_image_with_crop_or_pad(reshaped_image, 24, 24)
#画像の正規化
float_image = tf.image.per_image_standardization(resized_image)
```
でした。
この後、
```python
images = tf.train.batch([float_image], batch_size = FLAGS.batch_size)
```
によって、個々の画像を四次元配列に戻すため、ここまでが画像を三次元で扱わなくてはいけないところだと思うのですが、いろいろ調べてみると、
[https://www.tensorflow.org/api_docs/python/tf/image/resize_with_crop_or_pad](https://www.tensorflow.org/api_docs/python/tf/image/resize_with_crop_or_pad)
には、引数は3Dでも4Dでも良いと書いてあり、
[https://www.tensorflow.org/api_docs/python/tf/image/per_image_standardization](https://www.tensorflow.org/api_docs/python/tf/image/per_image_standardization)
には、引数はn次元と書いてあります。
つまり僕の解釈としては、どこにも画像を３次元で扱わなくてはいけないような要素はないと感じました。
本当に三次元で考える必要はあるのでしょうか。

そしてそもそも、slice_input_producerによって、imageには一枚目の画像データしか入らないので、batch()によって画像をまとめても、最初の一枚しか変換されていないように思うのですが、どういうことでしょうか。

それからもう一つ、コードの部分にも記述しましたように、何故浮動小数点にキャストする必要があるのでしょうか。

この三つについてご教授願います。

長くなるとアレなので、なるべく短く説明しましたので、不足があったら申し訳ありません。

### 追記
念の為、全コードを追記します。
```python
import os
import cv2
import numpy as np
import TensorFlow as tf
from フォルダ名 import cifar10

image_path = '画像フォルダ'
classes = ['airplane', 'automobile', 'bird', 'cat', 'deer', 'dog', 'frog', 'horse', 'ship', 'truck']
#画像の読み込み
images = []
files = os.listdir(image_path)
for file in files:
    img = cv2.imread(os.path.join(image_path, file))
    img = img[:, :, ::-1]
    height = img.shape[0]
    width  = img.shape[1]
    cropped_size = min(width, height)
    sx = (width - cropped_size) // 2
    sy = (height - cropped_size) // 2
    cropped_img = img[sy:sy + cropped_size, sx:sx + cropped_size]
    resized_img = cv2.resize(cropped_img, (32, 32))
    images.append(resized_img)

images = np.array(images)

FLAGS = tf.app.flags.FLAGS
FLAGS.batch_size = len(images)

image, = tf.train.slice_imput_producer([images], shuffle = False)
reshaped_image = tf.cast(image, tf.float32)
resized_image = tf.image.resize_image_with_crop_or_pad(reshaped_image, 24, 24)
float_image = tf.image.per_image_standardization(resized_image)

#バッチ入力の設定
images = tf.train.batch([float_image], batch_size = FLAGS.batch_size)

#予測器の作成
logits = cifar10.inference(images)

softmax = tf.nn.softmax(logits)
prediction = tf.argmax(softmax, 1)

#移動平均版の学習データを復元するように設定
variable_averages = tf.train.ExponentialMovingAverage(cifar10.MOVING_AVERAGE_DECAY)
variables_to_restore = variable_averages.variables_to_restore()
saver = tf.train.Saver(variables_to_restore)

sess = tf.Session()

checkpoint = tf.train.latest_checkpoint('cifar10_train')
if checkpoint:
    saver.restore(session, checkpoint)

#複数の画像がキューに詰められた状態なので、一つずつ取り出して処理するランナーの生成
coord = tf.train.Coordinator()
try:
    #処理を行うスレッドの生成
    threads = []
    for qr in tf.get_collection(tf.GraphKeys.QUEUE_RUNNERS):
        threads.extend(qr.create_threads(sess, coord = coord, daemon = True, start = True))

    softmaxs, predictions = session.run([softmax, prediction])
    for f, s, p in zip(files, softmaxs, predictions):
        print(f, classes[p])
        print(list(s))
        print()

except Exception as e:
    coord.request_stop(e)

#スレッドを止める
coord.request_stop()
coord.join(threads, stop_grace_period_secs = 10)
```

結果は,
画像ファイル名 dog
[0.02342324234, 0.00324532453, ..., ..., ..., ..., ..., ..., ..., ...]

以下省略

のような形です。

Accepted Answer

解決したので、簡単に載せます。
まず、何故三次元に直して扱わなくてはならないのかについてですが、
これは、tensorflowのバージョンによっては三次元で扱わなくてはいけない、ということでした。
そして、imageには一枚目の画像データしか入らないので、batch()によって画像をまとめても、最初の一枚しか変換されていないように思える、という点についてなのですが、
これは資料の記述ミスで、本当はslice_input_producer()からper_image_standardization()までを、画像の枚数分ループしなくてはいけないらしいです。