This is an implementation from Transformer from scratch for self-educative purposes. This was created following the tutorial from Umar Jamil