
GPT-2: 1.5B release - OpenAI
Nov 5, 2019 · As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facilitate detection …
GPT-2 - Wikipedia
Generative Pre-trained Transformer 2 (GPT-2) is a large language model by OpenAI and the second in their foundational series of GPT models. GPT-2 was pre-trained on a dataset of 8 …
openai-community/gpt2 - Hugging Face
GPT-2 is a transformers model pretrained on a very large corpus of English data in a self-supervised fashion. This means it was pretrained on the raw texts only, with no humans …
GitHub - openai/gpt-2: Code for the paper "Language Models are ...
This repository is meant to be a starting point for researchers and engineers to experiment with GPT-2. For basic information, see our model card.
OpenAI GPT2 - Hugging Face
GPT-2 is a large transformer-based language model with 1.5 billion parameters, trained on a dataset[1] of 8 million web pages. GPT-2 is trained with a simple objective: predict the next …
GPT-2: 6-month follow-up - OpenAI
Aug 20, 2019 · We’re releasing the 774 million parameter GPT‑2 language model after the release of our small 124M model in February, staged release of our medium 355M model in …
Better language models and their implications - OpenAI
Feb 14, 2019 · GPT‑2 is a large transformer (opens in a new window)-based language model with 1.5 billion parameters, trained on a dataset A of 8 million web pages. GPT‑2 is trained …
Our largest model, GPT-2, is a 1.5B parameter Transformer that achieves state of the art results on 7 out of 8 tested lan-guage modeling datasets in a zero-shot setting but still underfits …
GPT-2 Explained - Papers With Code
GPT-2 is a Transformer architecture that was notable for its size (1.5 billion parameters) on its release. The model is pretrained on a WebText dataset - text from 45 million website links.
The Illustrated GPT-2 (Visualizing Transformer Language Models)
Aug 12, 2019 · The OpenAI GPT-2 exhibited impressive ability of writing coherent and passionate essays that exceed what we anticipated current language models are able to produce. The …