Concepts

Embedding

nn.Embedding(num_embeddings: int, embedding_dim : int)

num_embeddings : size of the dictionary of embeddings
embedding_dim : the size of each embedding vector

Lookup table

Like a hashmap, but instead of using a hash function to hash the key and then access the value, we use directly the key to access the value.

Embedding lookup table in machine learning

Transform one integer token into a vector that has a defined size and different weights. This weights will be trained using the loss gradient.

integer token : Take a letter and use a tokenizer to transform this token into a number. This number becomes then the signature of this token. encode (input : str)

LOSS

Cross Entropy :

LOGITS

In the context of machine learning it is the raw output of the model. It represents the predictions of it.

Batch, Temporal, Channels

References :

Andrej KARPATHY, Let's build GPT: from scratch, in code, spelled out. https://www.youtube.com/watch?v=kCc8FmEb1nY

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin. Attention Is All You Need. https://arxiv.org/abs/1706.03762

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Playground		Playground
Resources		Resources
__pycache__		__pycache__
BigramLanguageModel.py		BigramLanguageModel.py
Readme.md		Readme.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Concepts

Embedding

Lookup table

Embedding lookup table in machine learning

LOSS

LOGITS

Batch, Temporal, Channels

References :

About

Releases

Packages

Languages

jorgekorgut/playground-language-model

Folders and files

Latest commit

History

Repository files navigation

Concepts

Embedding

Lookup table

Embedding lookup table in machine learning

LOSS

LOGITS

Batch, Temporal, Channels

References :

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages