lstm_attention.py 训练问题 #4

sue2415535899 · 2018-12-13T11:53:32Z

您好，我在执行您的代码时运行到
out, attn, alpha = tf.split(lstm_net.outputs, [n_hidden, d_local, a_local ** 2], axis=2)
这一步报错：tensorflow.python.framework.errors_impl.InvalidArgumentError: Sum of output sizes must match the size of the original Tensor along the split dimension or the sum of the positive sizes must be less if it contains a -1 for 'split_1' (op: 'SplitV') with input shapes: [?,?,512], [3], [] and with computed input tensors: input[1] = <512 512 49>, input[2] = <2>.
我理解的是lstm_net.outputs 输出的是[?,?,512]维的tensor ,不能拆分为512，512，49维的out ,attn ,alpha。
我不知道是源代码问题，还是tensorflow 版本问题。
想问您原网络lstm_net输出的是[?,?,1073]还是[?,?,512]呢？

qiongxiao · 2019-06-24T03:49:55Z

我运行lstm_attention.py也遇到同样的问题，期待作者来解答。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lstm_attention.py 训练问题 #4

lstm_attention.py 训练问题 #4

sue2415535899 commented Dec 13, 2018

qiongxiao commented Jun 24, 2019

lstm_attention.py 训练问题 #4

lstm_attention.py 训练问题 #4

Comments

sue2415535899 commented Dec 13, 2018

qiongxiao commented Jun 24, 2019