20/11/2023 Implementing Vision-Enabled Dialogue in Social Robots

Hey there!👋 Imagine a world where your favourite chatbot or social robot isn't just responding to text-based inputs but is also getting a real-time visual sneak peek into the conversation. Exciting, right? Well, we implemented just that with the help of GPT-4:


Combining live visual input from a webcam or social robot, and large language models results in a conversational experience that's actually context-aware, and you can do it too! Check out our Tutorial and read our paper I Was Blind but Now I See: Implementing Vision-Enabled Dialogue in Social Robots for more details.