Models
Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub
NVIDIA has released the Llama Nemotron Nano, a new vision-language model (VLM) now available on the Hugging Face Hub. This model features a compact architecture with 7 billion parameters and demonstrates state-of-the-art performance on multiple benchmark tasks, including zero-shot image captioning and visual question answering. Its integration into the Hugging Face ecosystem facilitates easy access and deployment for practitioners looking to leverage efficient VLMs in their applications.
nvidiallamavlm