Gpt-2 - Search

About 106,000,000 results

Open links in new tab

Any time

openai.com
https://openai.com › index
GPT-2: 1.5B release - OpenAI
Nov 5, 2019 · As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facilitate detection …
wikipedia.org
https://en.wikipedia.org › wiki
GPT-2 - Wikipedia
Generative Pre-trained Transformer 2 (GPT-2) is a large language model by OpenAI and the second in their foundational series of GPT models. GPT-2 was pre-trained on a dataset of 8 …
huggingface.co
https://huggingface.co › openai-community
openai-community/gpt2 - Hugging Face
GPT-2 is a transformers model pretrained on a very large corpus of English data in a self-supervised fashion. This means it was pretrained on the raw texts only, with no humans …
github.com
https://github.com › openai
GitHub - openai/gpt-2: Code for the paper "Language Models are ...
This repository is meant to be a starting point for researchers and engineers to experiment with GPT-2. For basic information, see our model card.
huggingface.co
https://huggingface.co › docs › transformers › model_doc
OpenAI GPT2 - Hugging Face
GPT-2 is a large transformer-based language model with 1.5 billion parameters, trained on a dataset[1] of 8 million web pages. GPT-2 is trained with a simple objective: predict the next …
openai.com
https://openai.com › index
GPT-2: 6-month follow-up - OpenAI
Aug 20, 2019 · We’re releasing the 774 million parameter GPT‑2 language model after the release of our small 124M model ⁠ in February, staged release of our medium 355M model ⁠ in …
openai.com
https://openai.com › index › better-language-models
Better language models and their implications - OpenAI
Feb 14, 2019 · GPT‑2 is a large transformer ⁠ (opens in a new window)-based language model with 1.5 billion parameters, trained on a dataset A of 8 million web pages. GPT‑2 is trained …
openai.com
https://cdn.openai.com › better-language-models › ...
[PDF]
Language Models are Unsupervised Multitask Learners - OpenAI
Our largest model, GPT-2, is a 1.5B parameter Transformer that achieves state of the art results on 7 out of 8 tested lan-guage modeling datasets in a zero-shot setting but still underfits …
paperswithcode.com
https://paperswithcode.com › method
GPT-2 Explained - Papers With Code
GPT-2 is a Transformer architecture that was notable for its size (1.5 billion parameters) on its release. The model is pretrained on a WebText dataset - text from 45 million website links.
jalammar.github.io
https://jalammar.github.io
The Illustrated GPT-2 (Visualizing Transformer Language Models)
Aug 12, 2019 · The OpenAI GPT-2 exhibited impressive ability of writing coherent and passionate essays that exceed what we anticipated current language models are able to produce. The …

Pagination
- 1
- 2
- 3
- 4
- Next

GPT-2: 1.5B release - OpenAI

GPT-2 - Wikipedia

openai-community/gpt2 - Hugging Face

GitHub - openai/gpt-2: Code for the paper "Language Models are ...

OpenAI GPT2 - Hugging Face

GPT-2: 6-month follow-up - OpenAI

Better language models and their implications - OpenAI

Language Models are Unsupervised Multitask Learners - OpenAI

GPT-2 Explained - Papers With Code

The Illustrated GPT-2 (Visualizing Transformer Language Models)