1Model pretrained on C4 using T5's unsupervised objective for ~500k steps, model 2size is comparable to T5's base ~770m parameters. 3 4