Chapter 1. Introduction to Vision and Language
We are social and visual animals. We evolved in groups, depending on one another for survival. Our brains are wired for communication, empathy, and connection. Even our emotional well-being is deeply tied to our relationships.
To strengthen our bonds, we developed language, a tool to optimize communication, and, soon after, writing systems that allowed us to reach beyond the limits of face-to-face interaction.
Yet, among all our senses, vision dominates our perception of the world. A huge part of our brain is dedicated to processing visual information. We rely on sight to understand our environment, navigate spaces, recognize others, and pick up subtle social cues like facial expressions and body language.
It’s no surprise, then, that when we set out to build intelligent systems, we were captivated by two grand challenges: teaching them language to communicate with us, and vision to perceive the world as we do.
At first, our systems were limited ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access