Vision Language Model

Vision Language Model (VLM) is an innovative step towards fluid human-computer interaction, leveraging the capabilities of multi-modal AI.

1. turn on your camera - 2. start live vision