sg.sample_sequence returns context after pre-trained model #12

bytes-commerce · 2020-10-06T19:10:08Z

First ofd all, thanks for providing this amazing repository providing a possibility for tf2!
Secondly, I were using the Readme to pre-train my model and eventually using sequence_generator.py to pass some context to the model.

However, the response is always 1:1 the same as the context but the capital letters are being replaced with ??s. The question now is, what am I doing wrong? Have I maybe forgotten a thing? Is there maybe a edge case leading to this point that could be prevented?

Please let me know any additional information you might need! Thanks a lot!

jzl0166 · 2020-10-13T18:52:21Z

same problem

jspangl3r · 2020-11-21T02:06:19Z

also getting weird output like this.

vedranbajic · 2020-12-11T02:31:39Z

First of all, thank you for sharing your code! Helped me a lot starting with gpt2.
I really do not know if this is relevant but I just debugged sample.py.

output will only append zeros:
tf.Tensor([[ 3 13727 5825 0 0 0 0 0 ...]], shape=(1, 515), dtype=int32)

If my sequence length is 512 — I will get 512 zeros (+3 above zero numbers because of my context).
My output is just the words I have provided as context because the rest is 0.

edit 1:
logits is always nan in my case resulting in 0.

edit 2:
self.embedding_weights is nan. Maybe somethings wrong with the initializer?

akanyaani self-assigned this Oct 10, 2020

akanyaani added the bug Something isn't working label Oct 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sg.sample_sequence returns context after pre-trained model #12

sg.sample_sequence returns context after pre-trained model #12

bytes-commerce commented Oct 6, 2020

jzl0166 commented Oct 13, 2020

jspangl3r commented Nov 21, 2020

vedranbajic commented Dec 11, 2020 •

edited

Loading

sg.sample_sequence returns context after pre-trained model #12

sg.sample_sequence returns context after pre-trained model #12

Comments

bytes-commerce commented Oct 6, 2020

jzl0166 commented Oct 13, 2020

jspangl3r commented Nov 21, 2020

vedranbajic commented Dec 11, 2020 • edited Loading

vedranbajic commented Dec 11, 2020 •

edited

Loading