GPT-3

From wikieduonline

Revision as of 17:50, 8 April 2023 by Welcome (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to navigation Jump to search

wikipedia:GPT-3

wikipedia:Generative Pre-trained Transformer 3

The architecture is a decoder-only transformer network with a 2048-token-long context and then-unprecedented size of 175 billion parameters, requiring 800GB to store.

See also

GPT, GPT-2, GPT-3, GPT-4, GPT-4o, Tiktoken, Bigram, Transformer, PaLM, ChatGPT
NLP, GPT-3, Amazon Comprehend
OpenAI, GitHub Copilot, ChatGPT, OpenAI Codex, GPT-3, GPT-4, Whisper, Sam Altman, Mira Murati, Greg Brockman, Ilya Sutskever, OpenAI board, John Schulman

Retrieved from "https://www.wikieduonline.com/index.php?title=GPT-3&oldid=241372"

IT

Advertising: