Term of the Moment

electronic


Look Up Another Term


Definition: AI transformer


An AI deep learning model that is used in a wide range of applications, including language processing, content generation and answering questions. The transformer is a major advancement over the recurrent neural network (RNN). Initially presented in a 2017 paper by eight scientists at Google, entitled "Attention Is All You Need," transformers are able to understand the relationshps between data that are far apart much more efficiently than RNNs. Instead of using labeled data, transformers find the patterns in the data mathematically.

The Transformer Token Process
First cleaned by removing punctuation and symbols, the text is formatted and turned into "tokens," which are mathematical representations of the words. Important words are identified by using "attention mechanisms." To generate results, the tokens are decoded and the output is formatted back into readable text with symbols and punctuation. See GPT and recurrent neural network.