DALL-E: Zero-Shot Text-to-Image Generation | Paper Explained

❤️ Become The AI Epiphany Patreon ❤️ ► In this video I cover DALL-E or “Zero-Shot Text-to-Image Generation“ paper by OpenAI team. They train a VQ-VAE to learn compressed image representations and then they train an autoregressive transformer on top of that discrete latent space and BPEd text. The model learns to combine distinct concepts in a plausible way, image to image capabilities emerge, etc. ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ ✅ Paper: ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ ⌚️ Timetable: 00:00 What is DALL-E? 03:25 VQ-VAE blur problems 05:15 transformers

7 views