nanoVLM

Lightweight vision-language model training for embodied AI

View on GitHub

Overview

nanoVLM is a minimal, educational implementation for training vision-language models. Ideal for rapid prototyping and understanding VLM architectures for embodied AI applications.