The study investigates how text-based models like LLMs perceive and interpret visual information in exploring the intersection of language models and visual understanding. The research ventures into uncharted territory, probing the extent to which models designed for text processing can encapsulate and depict visual concepts, a challenging area considering the inherent non-visual nature of these…
