LLM Attention is all you need UNIFIED-IO: A UNIFIED MODEL FOR VISION, LANGUAGE, AND MULTI-MODAL TASKS Learning Transferable Visual Models From Natural Language Supervision openai CLIP https://openai.com/index/clip/