Colossal Clean Crawled Corpus (C4): Open-Source NLP Pretraining Corpus by Google tensorflow.org 4 points by Riccardo_G 6 years ago · 0 comments Reader PiP Save No comments yet.