Difference between revisions of "GPT-3"
Jump to navigation
Jump to search
Tags: Mobile web edit, Mobile edit |
|||
(11 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
− | [[wikipedia:GPT-3]] | + | [[wikipedia:GPT-3]] (Jun 2020) |
[[wikipedia:Generative Pre-trained Transformer 3]] | [[wikipedia:Generative Pre-trained Transformer 3]] | ||
+ | The architecture is a decoder-only [[transformer network]] with a 2048-token-long context and 175 billion [[parameters]], requiring 800GB to store. | ||
− | [[Deep learning]] | + | * [[Deep learning]] |
+ | * [[NLP]] | ||
− | {{OpenAI}} | + | * [[Transformer (machine learning model)]] |
+ | * [[ChatGPT]] | ||
+ | * [[GPT-2]] | ||
+ | |||
+ | == See also == | ||
+ | * {{GPT}} | ||
+ | * {{NLP}} | ||
+ | * {{OpenAI}} | ||
+ | |||
+ | [[Category:IT]] |
Latest revision as of 15:22, 9 April 2023
wikipedia:GPT-3 (Jun 2020)
wikipedia:Generative Pre-trained Transformer 3
The architecture is a decoder-only transformer network with a 2048-token-long context and 175 billion parameters, requiring 800GB to store.
See also[edit]
Advertising: