Large Language Models (LLMs) have advanced considerably in generating and understanding text, and recent developments have extended these capabilities to multimodal LLMs that integrate both visual and ...