How is sentence embeddings calculated from the corpus using Sent2vec command? #86

tqx94 · 2019-09-12T10:30:21Z

Hi,

when using the sent2vec command, a model will be produced through the cbow model.
According to the paper, sent2vec will average the words vectors based on the weights learned in the training of corpus phase.
But how does cbow initialise and update the weights, and what are the n grams used?
For instance, when training a wikipedia corpus, what goes under the hood to calculate the different weights and dimensions for the sentence- 'I ate my breakfast in the morning'?
What are the unigrams and bigrams involved here to be averaged? how are the initialisation of weights done? what is the target/source word in the sentence above? Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How is sentence embeddings calculated from the corpus using Sent2vec command? #86

How is sentence embeddings calculated from the corpus using Sent2vec command? #86

tqx94 commented Sep 12, 2019

How is sentence embeddings calculated from the corpus using Sent2vec command? #86

How is sentence embeddings calculated from the corpus using Sent2vec command? #86

Comments

tqx94 commented Sep 12, 2019