Advertisement

Transformer

The neural network architecture, based on attention, behind modern large language models.

Transformers process sequences in parallel, unlike older recurrent networks.

Advertisement

Related terms

Back to Fundamentals of Generative AI

Advertisement