使用 Word2Vec 和深度卷积自编码器的电影相似度

Question

我是 python 的新手，我正在尝试创建一个模型，该模型可以根据电影描述来衡量电影的相似程度，到目前为止我遵循的步骤是：

1.turn 使用 Word2Vec 将每个电影描述转换为 100*（电影描述可能的最大单词数）值的向量，这会为每个电影描述生成一个 21300 个值的向量。 2.create 尝试压缩每个向量（并希望从中提取含义）的深度卷积自动编码器。

虽然第一步成功了，但我仍在努力使用自动编码器，这是我目前的代码：

encoder_input = keras.Input(shape=(21300,), name='sum')
encoded= tf.keras.layers.Reshape((150,142,1),input_shape=(21300,))(encoder_input)
x = tf.keras.layers.Conv2D(128, (3, 3), activation="relu", padding="same",input_shape=(1,128,150,142))(encoded)
x = tf.keras.layers.MaxPooling2D((2, 2), padding="same")(x)
x = tf.keras.layers.Conv2D(64, (3, 3), activation="relu", padding="same")(x)
x = tf.keras.layers.MaxPooling2D((2, 2), padding="same")(x)#49*25*64
x = tf.keras.layers.Conv2D(32, (3, 3), activation="relu", padding="same")(x)
x = tf.keras.layers.MaxPooling2D((2, 2), padding="same")(x)#25*13*32
x = tf.keras.layers.Conv2D(16, (3, 3), activation="relu", padding="same")(x)
x = tf.keras.layers.MaxPooling2D((2, 2), padding="same")(x)
x = tf.keras.layers.Conv2D(8, (3, 3), activation="relu", padding="same")(x)
x = tf.keras.layers.MaxPooling2D((2, 2), padding="same")(x)
x=tf.keras.layers.Flatten()(x)
encoder_output=keras.layers.Dense(units=90, activation='relu',name='encoder')(x)
x= tf.keras.layers.Reshape((10,9,1),input_shape=(28,))(encoder_output)

# Decoder

decoder_input=tf.keras.layers.Conv2D(8, (3, 3), activation='relu', padding='same')(x)
x = tf.keras.layers.UpSampling2D((2, 2))(decoder_input)
x = tf.keras.layers.Conv2D(16, (3, 3), activation='relu')(x)
x = tf.keras.layers.UpSampling2D((2, 2))(x)
x = tf.keras.layers.Conv2D(32, (3, 3), activation='relu')(x)
x = tf.keras.layers.UpSampling2D((2, 2))(x)
x = tf.keras.layers.Conv2D(64, (3, 3), activation='relu')(x)
x = tf.keras.layers.UpSampling2D((2, 2))(x)
x = tf.keras.layers.Conv2D(128, (3, 3), activation='relu')(x)
x = tf.keras.layers.UpSampling2D((2, 2))(x)
decoder_output = keras.layers.Conv2D(1, (3, 3), activation='relu', padding='same')(x)

autoencoder = keras.Model(encoder_input, decoder_output)
opt = tf.keras.optimizers.Adam(learning_rate=0.001, decay=1e-6)

autoencoder = keras.Model(encoder_input, decoder_output, name='autoencoder')

autoencoder.compile(opt, loss='mse')
print("STARTING FITTING")


history = autoencoder.fit(
movies_vector,
movies_vector,
epochs=25,

        )


print("ENCODER READY")
#USING THE MIDDLE LAYER 
encoder = keras.Model(inputs=autoencoder.input,
                    outputs=autoencoder.get_layer('encoder').output)

运行此代码给我以下错误：

required broadcastable shapes [[node mean_squared_error/SquaredDifference (defined at tmp/ipykernel_52/3425712667.py:119) ]] [Op:__inference_train_function_1568]

我有两个问题：

1.how 我可以修复这个错误吗？

2.how 我可以改进我的自动编码器以便我可以使用压缩向量来测试电影相似性吗？

Answer 1

您模型的输出是 (batch_size, 260, 228, 1)，而您的目标似乎是 (batch_size, 21300)。您可以通过在模型末尾添加 tf.keras.layers.Flatten() 层或不展平输入来解决该问题。
您可能不应该使用 2D 卷积，因为在大多数文本嵌入中，相邻特征通道之间没有空间或时间相关性。您应该能够安全地重塑为 (150,142) 而不是 (150, 142, 1) 并使用一维卷积、池化和上采样层。

使用 Word2Vec 和深度卷积自编码器的电影相似度

movie similarity using Word2Vec and deep Convolutional Autoencoders

autoencoder

word2vec

deep-learning

keras

tensorflow