visual-language-models

Here are 2 public repositories matching this topic...

NxtGenLegend / TreeHacks-ZoneOut

#3 Winner of Best Use of Zoom API at Stanford TreeHacks 2025! An AI-powered meeting assistant that captures video, audio and textual context from Zoom calls using multimodal RAG.

Updated Feb 16, 2025
JavaScript

alessioborgi / RealTime-VLM

Star

RealTime-VLM brings real-time VLM inference to the browser. It continuously captures webcam frames, sends image+text to an OpenAI-compatible API, and displays responses with sub-second latency. Works with local or hosted VLMs.

computer-vision vlm vision-language-model visual-language-models real-time-vlm

Updated Aug 11, 2025
JavaScript

Improve this page

Add a description, image, and links to the visual-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the visual-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly