Author: Pavlo Molchanov | NVIDIA Technical Blog

Pavlo Molchanov

Pavlo Molchanov is a distinguished research scientist and manager at NVIDIA Research. He leads the Deep Learning Efficiency Research team. His main areas of interest include LLM and VLM efficiency, novel architecture design, post-training model compression, and adaptive/conditional inference.

Posts by Pavlo Molchanov

Generative AI Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,... 12 MIN READ

Generative AI May 03, 2024

Visual Language Models on NVIDIA Hardware with VILA

Note: As of January 6, 2025 VILA is now part of the new Cosmos Nemotron vision language models. Visual language models have evolved significantly recently.... 11 MIN READ