How do different kinds of visual information come together?