OpenAI Multimodal Research
openai.comFascinating stuff. The idea that visual and language generation could be generalized with the same underlying model was the most interesting part of Lex Fridman's podcast with Ilya Sutskever in May: https://www.youtube.com/watch?v=13CZPWmke6A
https://news.ycombinator.com/item?id=25649557
and https://news.ycombinator.com/item?id=25649740
point to the specific two releases.
Very cool stuff. However, I didn't find any links to the Github Code. Is this release as well part of their GPT-2 strategy?