Posts by Pavlo Molchanov
Generative AI
Nov 22, 2024
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Generative AI
May 03, 2024
Visual Language Models on NVIDIA Hardware with VILA
Note: As of January 6, 2025 VILA is now part of the new Cosmos Nemotron vision language models. Visual language models have evolved significantly recently....
11 MIN READ