There are multiple definitions:
vastus lateralis muscle(English)
Vision-Language Model(English)
- an advanced AI system that jointly processes and interprets both visual data (images and videos) and text. By merging computer vision with natural language processing, VLMs can "see" a visual input, understand its context, and reason or generate text about it
- AI, CLIP, GLIP, UDA
- Medical informatics, Medical imaging, Machine learning, Model(ing), Speaking / language [incl. disorders]
- https://doi.org…7701.2024.00267