Code and models from the paper “Language Models are Unsupervised Multitask Learners”.
gpt-2 code of above paper