Recent posts

Transformer Architecture

5 minute read

A transformer is a neural network architecture proposed in Google’s 2017 paper Attention is All You Need. It is the basis of many NLP models such as BERT, GP...