Pavlo Molchanov

Pavlo Molchanov is a distinguished research scientist and manager at NVIDIA Research. He leads the Deep Learning Efficiency Research team. His main areas of interest include LLM and VLM efficiency, novel architecture design, post-training model compression, and adaptive/conditional inference.
Avatar photo

Posts by Pavlo Molchanov

Generative AI

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,... 12 MIN READ
Decorative image.
Generative AI

Visual Language Models on NVIDIA Hardware with VILA

Note: As of January 6, 2025 VILA is now part of the new Cosmos Nemotron vision language models. Visual language models have evolved significantly recently.... 11 MIN READ