AttributeError: 'Tensor' object has no attribute 'numpy' in Tensorflow 2.1

Question

我正在尝试在 Tensorflow 2.1 中转换 Tensor 的 shape 属性，但出现此错误：

AttributeError: 'Tensor' object has no attribute 'numpy'

我已经检查过 tf.executing eagerly() 的输出是 True,

一些上下文：我从 TFRecords 加载 tf.data.Dataset，然后应用 map。映射函数正在尝试将数据集样本之一 Tensor 的 shape 属性转换为 numpy:

def _parse_and_decode(serialized_example):
    """ parse and decode each image """
    features = tf.io.parse_single_example(
        serialized_example,
        features={
            'encoded_image': tf.io.FixedLenFeature([], tf.string),
            'kp_flat': tf.io.VarLenFeature(tf.int64),
            'kp_shape': tf.io.FixedLenFeature([3], tf.int64),
        }
    )
    image = tf.io.decode_png(features['encoded_image'], dtype=tf.uint8)
    image = tf.cast(image, tf.float32)

    kp_shape = features['kp_shape']

    kp_flat = tf.sparse.to_dense(features['kp_flat'])
    kp = tf.reshape(kp_flat, kp_shape)

    return image, kp


def read_tfrecords(records_dir, batch_size=1):
    # Read dataset from tfrecords
    tfrecords_files = glob.glob(os.path.join(records_dir, '*'))
    dataset = tf.data.TFRecordDataset(tfrecords_files)
    dataset = dataset.map(_parse_and_decode, num_parallel_calls=batch_size)
    return dataset


def transform(img, labels):
    img_shape = img.shape  # type: <class 'tensorflow.python.framework.ops.Tensor'>`
    img_shape = img_shape.numpy()  # <-- Throws the error
    # ...    

dataset = read_tfrecords(records_dir)

这会引发错误：

dataset.map(transform, num_parallel_calls=1)

虽然这非常有效：

for img, labels in dataset.take(1):
    print(img.shape.numpy())

编辑： 尝试访问 img.numpy() 而不是 img.shape.numpy() 会在上面的转换器和代码中导致相同的行为。

我检查了 img_shape 的类型，它是 <class 'tensorflow.python.framework.ops.Tensor'>。

有没有人在新版本的 Tensorflow 中解决了此类问题？

Answer 1

您的代码中的问题是您不能在映射到 tf.data.Datasets 的函数内部使用 .numpy()，因为 .numpy() 是 Python 代码而不是纯 TensorFlow 代码.

当您使用像 my_dataset.map(my_function) 这样的函数时，您只能在 my_function 函数中使用 tf.* 函数。

这不是 TensorFlow 2.x 版本的错误，而是关于如何在幕后出于性能目的生成静态图的错误。

如果您想在映射到数据集的函数中使用自定义 Python 代码，则必须使用 tf.py_function()，文档：https://www.tensorflow.org/api_docs/python/tf/py_function。在数据集上映射时，确实没有其他方法可以混合 Python 代码和 TensorFlow 代码。

您也可以参考这个问题了解更多信息；这正是我几个月前问过的问题：

AttributeError: 'Tensor' object has no attribute 'numpy' in Tensorflow 2.1

AttributeError: 'Tensor' object has no attribute 'numpy' in Tensorflow 2.1

python

numpy

tensorflow

tfrecord

tensorflow2.x