Deepmind released overview of transformer architecture and algorithm

George S
Apr 17, 2023

This describes what they are, how they are trained and what they are used for. Also key architectural components.

https://arxiv.org/abs/2207.09238

--

--

George S

senior ML researcher, sharing knowledge and news in AI