AI Inference / Inference Microservices

2025? 4? 16?
NVIDIA, Meta Llama 4 Scout ? Maverick??? ?? ???
?? ??? ??? Llama AI ??? ?? ??, Llama 4 Scout? Llama 4 Maverick? ??? ??????.
3 MIN READ

2025? 3? 12?
Spotlight: NVIDIA TensorRT-LLM? ??? NAVER Place? SLM Vertical Service ?? ????
NAVER Place??? Place ??? ??? SLM Vertical Service? ???? ???? ???? ??(????, ??…
7 MIN READ

2025? 2? 13?
DeepSeek-R1 ? ?? ?? ????? ?? GPU ?? ?? ???
AI ??? ?? ? ??? ??? ???? ?? ??? ?????, ??? ?? ?? ?? ?? ?? ????? ??? ???? ????.
4 MIN READ

2025? 2? 7?
OpenAI Triton, NVIDIA Blackwell?? AI ?? ? ??????? ??
?? ??? ??? ????? ?? AI ????? ??? ?????. NVIDIA cuDNN? ?? ?????? ??? ???? ??? ????…
3 MIN READ

2024? 12? 13?
NVIDIA TensorRT-LLM, ????? ??? ???-??? ?? ???
NVIDIA? ?? NVIDIA TensorRT-LLM? ???-??? ?? ????? ?????? ??????.
3 MIN READ

2024? 11? 15?
NVSwitch? TensorRT-LLM ????? 3? ?? AllReduce ??
??? ?? ?? ??? ??? ??? ???? ? ??? ?? ??? ??? ?? ???? ???? ??? AI ????? ???? ?? ??…
3 MIN READ