Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 143 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 143 Bytes

Tranformer Decoder Model, Neural Network, Backpropagation

Gardient Descent, Masked Attention, Positional Encoding, Transformer Decoder Model