Difference between revisions of "Attention is all you need (2017)"

From wikieduonline

Jump to navigation Jump to search

Revision as of 14:18, 24 January 2023

wikipedia:Attention is all you need

https://arxiv.org/abs/1706.03762

Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task

See also

Transformer, GPT, Transformer 8, Ethched, Megatron-Core
Artificial neural networks, Neuronal network (NN), CNN, Micrograd, NPU, ConvNet, AlexNet, GoogLeNet, Apache MXNet, Neural architecture search, DAG, Feedforward neural network, NeurIPS, Feature Pyramid Network, TPU, NPU, Apple Neural Engine (ANE), LLM, TFLOPS

Retrieved from "https://www.wikieduonline.com/index.php?title=Attention_is_all_you_need_(2017)&oldid=220693"

ML

Advertising: