Gpt2 - Search

About 60,600 results

Open links in new tab

Any time

openai.com
https://openai.com › index
GPT-2: 1.5B release - OpenAI
Nov 5, 2019 · As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facilitate detection of outputs of GPT-2 models. While there have been larger language models released since August, we’ve continued with our original staged release plan in order to provide the community with a …
huggingface.co
https://huggingface.co › docs › transformers › model_doc
OpenAI GPT2 - Hugging Face
To get proper results, you should use openai-community/gpt2 instead of openai-community/gpt2. If you get out-of-memory when loading that checkpoint, you can try adding device_map="auto" in the from_pretrained call.
github.com
https://github.com › openai
GitHub - openai/gpt-2: Code for the paper "Language Models are ...
GPT-2 models' robustness and worst case behaviors are not well-understood. As with any machine-learned model, carefully evaluate GPT-2 for your use case, especially if used without fine-tuning or in safety-critical applications where reliability is important.
huggingface.co
https://huggingface.co › openai-community
openai-community/gpt2 - Hugging Face
Test the whole generation capabilities here: https://transformer.huggingface.co/doc/gpt2-large. Pretrained model on English language using a causal language modeling (CLM) objective. It was introduced in this paper and first released at this page. Disclaimer: The team releasing GPT-2 also wrote a model card for their model. Content from this ...
zhihu.com
https://zhuanlan.zhihu.com
手把手训练一个GPT2模型 - 知乎 - 知乎专栏
from transformers import pipeline generator = pipeline("text-generation", model="./my_gpt2", tokenizer="./my_gpt2") print(generator("今天天气真好，")) 以上是训练和部署 GPT-2 类似模型的完整流程。
zhihu.com
https://zhuanlan.zhihu.com
GPT2复现笔记 Let's reproduce GPT-2 - 知乎 - 知乎专栏
gpt2 开源了权重和推理代码，但是没有训练代码和训练细节；gpt3 论文比较详细的说明了各种参数和细节，但是没有开源权重和代码。所以下面按照 gpt3 的论文中的参数来设置 gpt2。
zhihu.com
https://zhuanlan.zhihu.com
预训练语言模型之GPT-1，GPT-2和GPT-3 - 知乎 - 知乎专栏
前言. Generative Pre-trained Transformer （GPT）系列是由 OpenAI 提出的非常强大的预训练语言模型，这一系列的模型可以在非常复杂的NLP任务中取得非常惊艳的效果，例如文章生成，代码生成，机器翻译，Q&A等，而完成这些任务并不需要有监督学习进行模型微调。而对于一个新的任务，GPT仅仅需要非常少的 ...
wikipedia.org
https://zh.wikipedia.org › wiki
GPT-2 - 维基百科，自由的百科全书
本页面最后修订于2025年3月21日 (星期五) 02:40。本站的全部文字在知识共享署名-相同方式共享 4.0协议之条款下提供，附加条款亦可能应用。（请参阅使用条款） Wikipedia®和维基百科标志是维基媒体基金会的注册商标；维基™是维基媒体基金会的商标。维基媒体基金会是按美国国內稅收法501(c)(3 ...
wikipedia.org
https://en.wikipedia.org › wiki
GPT-2 - Wikipedia
GPT-2 was first announced on 14 February 2019. A February 2019 article in The Verge by James Vincent said that, while "[the] writing it produces is usually easily identifiable as non-human", it remained "one of the most exciting examples yet" of language generation programs: [17]. Give it a fake headline, and it’ll write the rest of the article, complete with fake quotations and statistics.
jalammar.github.io
https://jalammar.github.io
The Illustrated GPT-2 (Visualizing Transformer Language Models)
Aug 12, 2019 · The GPT-2 wasn’t a particularly novel architecture – it’s architecture is very similar to the decoder-only transformer. The GPT2 was, however, a very large, transformer-based language model trained on a massive dataset. In this post, we’ll look at the architecture that enabled the model to produce its results.
Some results have been removed
Pagination
- 1
- 2
- 3
- 4
- Next

GPT-2: 1.5B release - OpenAI

OpenAI GPT2 - Hugging Face

GitHub - openai/gpt-2: Code for the paper "Language Models are ...

openai-community/gpt2 - Hugging Face

手把手训练一个GPT2模型 - 知乎 - 知乎专栏

GPT2复现笔记 Let's reproduce GPT-2 - 知乎 - 知乎专栏

预训练语言模型之GPT-1，GPT-2和GPT-3 - 知乎 - 知乎专栏

GPT-2 - 维基百科，自由的百科全书

GPT-2 - Wikipedia

The Illustrated GPT-2 (Visualizing Transformer Language Models)