[???] NVIDIA AI ??? ???? ??? ‘? ?? ??’

Reading Time: 2 minutes

‘NVIDIA AI ??? ?? – TensorRT/Triton Inference Server’? 4? 21? ?? 2??? 4??? ????? ?????. ?? ????? ?? ?? ? ?? ?? ??? ??? ?? ???? ?????.

NVIDIA GPU? ??? ? ?? ??? ?? ??? ?? ??? ???? ???? ?? ??? ????? ???? .predict(), .forward() ??? ???? ???? ???? ???? ???? ?? ??? ?? ??? ???? ?? ??, ???? ????? ???? NVIDIA? ??? ?? ??? ?????.

?? ????? NVIDIA ?? ?????? ???? ?????. ? ?? ?? ???? ?? TensorRT? ?? ???? ??? ?? ??? ?? NVIDIA Triton Inference Server? ?? ?? ????, ??? ?? ??? ?? ?? ?? ? ?? ?????? ???? ?? ??? ?? ? ????.

??? NVIDIA ??? ???? ??? ??? Developer Relations ??? ??? ??? ????.

??? NVIDIA ?????? ??? ?? ???? ??? ?? ? ?? ???? ??? ????? ?? ???? ?????. ????? ???? ? ??? ??? ???? ??? ? ????. ??? ???? ??? ??? ?? ??? ???? ??? ??????.

?? ??

??? ??

NVIDIA? ??? ??? ?? ????, ?? ????? ????, ?? ?? ?? ? ?? ?? ?? ?? ??? ???? Developer Relations ??? ??????. ?? ??(Samsung Electronics)/??? ??????(Lucent Technologies)?? ??? ?????, ????(Xilinx)?? FAE(field application engineer)? ??????. ??????? ???? ?? ? ?? ??? ??????.

??? ??

NVIDIA? ??? ?????, ???????? ???? ??? ??? ?? ??? ??????. ?? NVIDIA ???? GPU ??? ???? ??? ??? ???? ????

Platform& SDK

TensorRT

AI ??? ?? ?????? ???? ???? ??? ??? ?? ?? ??? ?? ??? ? ?? ??? ??????. ??? ??? ???, ???? ?? ??, ?? ????? ???? ??? ?????. ?? ??????? ? ?? ???? ???? ???? NVIDIA TensorRT? ?????. ????? ????? ?? ??? ??? ??????? ???(throughput)? ????? CNN, RNN? ?????? ??? ?? ????? ??????. ?? TensorRT? ???? ?? ??? ?? SDK? 25,000? ??? ??? ?????? ??? ????? ??? ??? ?? ???? ????.

Triton Inference Server

NVIDIA Triton Inference Server(Triton)? ??? ????? ??? ?? ?? ??? ????? ?? ?? ?? ??? ????????. ?? ??(inference serving)? ???? ??? ???? ?? ???? ?????? ?? ??? ??? ? ????. Triton? ????, ?????, ??? GPU/CPU ?? ????? ??? ?? CUDA? ??? ???? ???? ?? ?????(TensorRT, TensorFlow, ONNX, PyTorch ?)? ?????. Triton? ??? ??, ??? ??, ?? ??? ?? ?? ?????? ??? ?????? ????? ???? ?? ??? ? ?? ???? ?????.

[???] NVIDIA AI ??? ???? ??? ‘? ?? ??’

?? ??

Platform& SDK

Tags

??? ??

??

?? ??? ?? ??

Related posts

?????? NVIDIA TensorRT 10.0? ???, ??, AI ?? ??

NVIDIA TensorRT Model Optimizer? ??? AI ?? ?? ???

NVIDIA TensorRT-LLM ? NVIDIA Triton Inference Server? Meta Llama 3 ?? ??

8-bit ??? ???? ???? ???? ??? 2? ? ??? ????? NVIDIA TensorRT

NVIDIA AI ?? ????? ???? Diffusion XL? ?? ??? ????