• <xmp id="om0om">
  • <table id="om0om"><noscript id="om0om"></noscript></table>
  • ?? AI ??????? GPU ?? SDK? NVIDIA Riva? ?????

    Reading Time: 4 minutes

    ?? AI? ??? ?? ??, ?? ?????? ?? ?????, ?? ?? ? ??? ?? ? ??? ??????? ?????. ?? AI?? ?? ?? ??(ASR)? ?? ?? ??(TTS) ????? ??????. ASR ?????? ?? ???? ???? ????, TTS ?????? ???? ???? ????.

    ? ?? ??? ?? AI ???? ??? ??? ???? ??????. ?? AI ??????? ????? ??? ?? ??? ??? ???, ???? ??? ?? ??? ?? ??? ??? ??????? ??? ?, ??? ?? ??? ?????. ?? ????? ??? ????? ????? ???? ???? ??? ??? ?? ?? 300 ???(ms)? ?? ???? ??. NVIDIA Riva? ?? AI ??? ??? ?? ? ?? ????? ?????, ??? ??? ????? ???? ?? ??? ??? ?????.

    Riva SDK

    NVIDIA Riva? ?? AI ?????? ??? ?? GPU ?? SDK???. ??? AI? ?? ??? ?? ??? ?????? ????. ?? ?????? API? ??? ??? ???? ?????, ??? ??? ? ????.

     Diagram shows workflow starting from pretrained models in NGC, TAO toolkit for retraining the models and Riva for optimized speech AI skills to generate high-performance inference.
    ?? 1. Riva? ?? ?????? ?? ?????

    Riva SDK?? ?? ??? ??/?? ??? ? ??? ??? ??????? ?? ??? NVIDIA TAO Toolkit, ??? ?? ??? ?? ??, ?? ??? ???? ?? ? ?? ???? ??? ????.

    Riva? ???? ??? ???? ???? ????? ??? ?? ??? ??? ??? ??? ??? ?? ? ????. ??? ???? ?? CPU ?? ????? 25?? ??? ??? ???? ?? ??? 150 ???? ????? ???.

    ??? AI ???? gRPC ?????(endpoint)? ?? ??? ???? ?? ?? ??? ?? ?? ??? ??? ??(NLP), ?? ?? ?? ????? ?????. ??? AI ????? ?? ?? ??? ??/?? ?????? ???? ?? ???? ????. ?? ?? ??? ??? ??? ? ???? ?????? ?? ??? ??? ??? ??? ?? ??? ?? ????.

    Riva? NVIDIA Triton Inference Server ? ?? ??? ??? ??? ??? ???? ???? ??, ???? ??? ??, ???? ??? ???? ??? ??? ?????.

    Riva ?? ??

    Riva? ??? ??(transcription)? ?? ????? ?? ?? ??? ??? ?? ??? ?? ??? ??? ???? ???? ?????. Riva? ?? ?? ??? ???? ?? ??? ??????? ??? ??? ?????. ??, ????(podcast), ???? ??? ???? ?? ??? ????? ?? ???? ??? ??? ???? ?????.

    Riva? ?? ?? ???? ?? ?? ??? ??, ??????(non-autoregressive) ??? ??? ??? ??? ??? ?????. NVIDIA V100 GPU ?? Tacotron 2/WaveGlow ??? ??? ?, Riva? ?????? ???? NVIDIA A100 GPU??? 12? ??? ??? ????. ? ??? 30? ??? ?? ??? ???? ???? ? ???? ?? ???? ?? ?????? ??? ????? ??? ??? ??? ? ????.

    Diagram shows capabilities in Riva Speech Recognition and Speech Synthesis services such as support for both batch and streaming mode, multiple sampling rates (8,16,22,32, 44, 48 kHz), and models such as Jasper, QuartzNet, CitriNet, Tacotron2, WaveGlow, Fastpitch, and HiFiGAN.
    ?? 2. Riva ??? ??

    GPU? ??? ??? ??? ????? Riva? NVIDIA Triton Inference Server? ??? ?? ???? ??? ?????(ensemble pipeline)? NVIDIA TensorRT? ?? ????? ????? ?????.

    Riva? ????? gRPC ?????? ???? ??? API? ?? ????? ???? ??? ???? ????. ?? 3? Riva ???? ?? ??? ?????. gRPC API? ??(Docker) ?????? ???? API ??? ?????. ??? ????? ??/NLP ??? ??? ??? ????.

    Diagram shows the Riva client applications such as desktop, mobile, and laptop interactions with Riva speech recognition and speech synthesis pipelines.
    ?? 3. Riva ??? ?????

    ?? API ??? NVIDIA Triton? ?? ??? ??? ??? ?????.

    NVIDIA Triton? ?? GPU?? ?? ???? ??? ?????? ???? ??? ?? ??? ??? ???? ???(backend) ?????.

    ??? AI ??????? ??, ????? ??? ??? ???? ???? ?? ?? ?????. ?? ?? ??? ?????? ???? ??? ????. GPU? ??? ???? ??? ?????? ?? ???(batch size)? ??? ??, ? ?? ??? ???? ? ? ??? ??? ??? ?? ??? ???? ???.

    ?? NVIDIA Triton? ?? ?? ? ??? ????? ?? ??? ?????.

    Riva? NGC?? ??? ??? ????? ?????? ??? ????? ?? ????(bare-metal)? ?? ?????, ?? ???? ?? ??(Helm chart)? ?????? ??? ? ????.

    ???? Riva? ?????? ??? ??? ???????. ???(Python) ?????? ??? ??? API ??? ?? ????? ???? Riva ???? ??? ? ???? ????. ?? ?? ?? TTS Riva ???? ??? ???? 3?? ??? ??? ????.

    ?? Riva API? ???? ???:

    import src.riva_proto.riva_tts_pb2 as rtts 
    import src.riva_proto.riva_tts_pb2_grpc as rtts_srv 
    import src.riva_proto.riva_audio_pb2 as ri

    ???? Riva ?????? gRPC ??? ?????:

    channel = grpc.insecure_channel('localhost:50051') 
    riva_tts = rtts_srv.RivaSpeechSynthesisStub(channel)

    ?? TTS ??? ?????:

    req = rtts.SynthesizeSpeechRequest() 
    req.text = "We know what we are, but not what we may be?" 
    req.language_code = "en-US" 
    req.encoding = ri.AudioEncoding.LINEAR_PCM 
    req.sample_rate_hz = 22050 
    req.voice_name = "ljspeech" 
    resp = riva_tts.Synthesize(req) 
    audio_samples = np.frombuffer(resp.audio, dtype=np.float32)

    ???? ???? ?? ????????

    NVIDIA TAO Toolkit? ??? ??? Riva?? ??? ? ?? ????(?? 4). ?? ??? ??? ??? ??????? ??? ?? ??? ? ??.

    Diagram shows the workflow of TAO toolkit starting from NGC pretrained model to adding your custom data and deploying it as a Riva skill.
    ??4. NVIDIA TAO Toolkit ?????

    ?? ASR? ???? ???? ???? ???? ???? ?? ??, ?? ASR ???? ???? ???? ??? ??? ??? ?? ??? ???? ?? ????? ???? ??? ????? ??? ? ????.

    ?? ??? BERT ??? ???? ?? 1??? ?????? ?????. ??? ?????? ?? ??? ?? ??? ?? ??? ?????.

    • ?? ?? ?? ???? ? ???.
    • ?? ??? ???? ??.

    ?????? ???? ??? ??? ????? ??? ??? ?????. ??? ???? ?? ???? ???? ??? ???? ?? ??? ????? ?? NVIDIA Triton? ?? ???? ?????.

    NVIDIA Riva Speech Skills ???????? ?? ???? ??? ?? ?? ??? ??? ??? ????. ? ???? TAO Toolkit? ??? ??????? ??? ??? ? ? ??? ??? ?? ????.

    Riva?? ?? ????

    Riva? ??? ??? AI? ?? ?????. ? ??? ??? ?? ???? ????? ??? ? ??? ?? ??? ??? ??? ?? ??? ?? ?????(?? 5).

    Diagram shows the workflow for deploying TAO model as a Riva skill and highlights the TensorRT optimizations and Triton Inference Server under the hood.
    ?? 5. Riva??? ??? ?? ??? ???? ???? ??? ?????

    NGC ????? ?? ?? ??? ??? ?? ??? ?? ??? ?????. ??? ??? ?? ??? ??, ???? ?? ??? ??? ???? ??? ? ????.

    ??

    Riva? NVIDIA Developer Program? ?????? ?? ??? ???? ??? ??? ?? ?????, ??? ??? ??? ?????. ??? ??? ??? ???? AI ????? ??? ?? ???? Riva Enterprise? ??? ? ????.

    ? ??? ??? Riva Getting Started?? ?????.

    Discuss (0)
    +1

    Tags

    ?? ???

    人人超碰97caoporen国产