Connecting images and natural language