Translation – NVIDIA Technical Blog

Translation – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-13T20:13:39Z http://www.open-lab.net/blog/feed/ Anu Srivastava <![CDATA[Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs]]> http://www.open-lab.net/blog/?p=96519 2025-03-06T19:26:43Z 2025-02-26T22:05:00Z

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...]]>

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical... An image of a phone with a chatbot dialog on the screen but also showing the inside of the phone.

An image of a phone with a chatbot dialog on the screen but also showing the inside of the phone.

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical for the current resource constraints that many companies have. The rise of small language models (SLMs) bridge quality and cost by creating models with a smaller resource footprint. SLMs are a subset of language models that tend to��

]]> 0 Sven Chilton <![CDATA[Deploying NVIDIA Riva Multilingual ASR with Whisper and Canary Architectures While Selectively Deactivating NMT]]> http://www.open-lab.net/blog/?p=95339 2025-03-06T19:26:55Z 2025-02-20T18:54:48Z

NVIDIA has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry. Earlier versions of NVIDIA Riva, a...]]>

NVIDIA has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry. Earlier versions of NVIDIA Riva, a... Two people sitting at their desks with icons for speech translation in the background.

Two people sitting at their desks with icons for speech translation in the background.

NVIDIA has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry. Earlier versions of NVIDIA Riva, a collection of GPU-accelerated speech and translation AI microservices for ASR, TTS, and NMT, support English-Spanish and English-Japanese code-switching ASR models based on the Conformer architecture, along with a model supporting multiple��

]]> 0 Cheng-Han (Hank) Du <![CDATA[Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM]]> http://www.open-lab.net/blog/?p=95756 2025-02-06T19:33:46Z 2025-02-05T21:30:00Z

Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...]]>

Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...

ai-models-representation

Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and technical terminology handling. The emergence of sovereign AI has highlighted critical challenges in large language models (LLMs), particularly their struggle to capture nuanced cultural and linguistic contexts beyond English-dominant��

]]> 1 Chintan Patel <![CDATA[New NIM Available: Mistral Large 2 Instruct LLM]]> http://www.open-lab.net/blog/?p=87308 2024-08-22T18:24:59Z 2024-08-13T20:37:24Z

The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and...]]>

The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and...

Mistral Large 2407

The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and answering, and conversational AI.

]]> 0 Elias Wolfberg <![CDATA[AI Brain Implant Restores Bilingual Communication for Stroke Survivor]]> http://www.open-lab.net/blog/?p=84040 2024-06-27T18:17:55Z 2024-06-20T15:57:05Z

Scientists have enabled a stroke survivor, who is unable to speak, to communicate in both Spanish and English by training a neuroprosthesis implant to decode...]]>

Scientists have enabled a stroke survivor, who is unable to speak, to communicate in both Spanish and English by training a neuroprosthesis implant to decode...

bilingual-ezgif.com-optimize

Scientists have enabled a stroke survivor, who is unable to speak, to communicate in both Spanish and English by training a neuroprosthesis implant to decode his bilingual brain activity. The research, published in Nature Biomedical Engineering, comes from the lab of University of California, San Francisco professor Dr. Edward Chang. It builds on his groundbreaking work from 2021 with the��

]]> 0 Gordana Neskovic <![CDATA[NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy]]> http://www.open-lab.net/blog/?p=79365 2024-08-12T16:09:12Z 2024-03-19T16:00:00Z

Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...]]>

Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...

speech-ai-composite-graphic

Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition (ASR) family of models and the NVIDIA Canary multilingual, multitask ASR and translation model currently top the Hugging Face Open ASR Leaderboard. In addition, a multilingual P-Flow-based text-to-speech (TTS) model won the LIMMITS ��24��

]]> 0 Anjali Shah <![CDATA[Mastering LLM Techniques: Training?]]> http://www.open-lab.net/blog/?p=73464 2024-01-22T22:05:25Z 2023-11-16T14:00:00Z

Large language models (LLMs) are a class of generative AI models built using transformer networks that can recognize, summarize, translate, predict, and...]]>

Large language models (LLMs) are a class of generative AI models built using transformer networks that can recognize, summarize, translate, predict, and...

llm-visual-mastering-large-language-model-training-2968826-r1

Large language models (LLMs) are a class of generative AI models built using transformer networks that can recognize, summarize, translate, predict, and generate language using very large datasets. LLMs have the promise of transforming society as we know it, yet training these foundation models is incredibly challenging. This blog articulates the basic principles behind LLMs��

]]> 0 Abhishek Sawarkar <![CDATA[Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning]]> http://www.open-lab.net/blog/?p=73312 2023-12-30T00:41:50Z 2023-11-15T16:00:00Z

Generative AI is revolutionizing how organizations across all industries are leveraging data to increase productivity, advance personalized customer engagement,...]]>

Generative AI is revolutionizing how organizations across all industries are leveraging data to increase productivity, advance personalized customer engagement,... Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning

Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning

Generative AI is revolutionizing how organizations across all industries are leveraging data to increase productivity, advance personalized customer engagement, and foster innovation. Given its tremendous value, enterprises are looking for tools and expertise that help them integrate this new technology into their business operations and strategies effectively and reliably.

]]> 0 Nirmal Kumar Juluru <![CDATA[Build Custom Enterprise-Grade Generative AI with NVIDIA AI Foundation Models?]]> http://www.open-lab.net/blog/?p=73688 2024-01-02T18:37:01Z 2023-11-15T16:00:00Z

In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the...]]>

In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the...

ngc-ai-summit-blog-2973793-1920x1080

In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the accelerated infrastructure, and optimizing the models. Developers can begin with pretrained models and fine-tune them for their use case, saving time and getting their solutions faster to market. Developers need an easy way to try out models��

]]> 0 Sven Chilton <![CDATA[How to Deploy NVIDIA Riva Speech and Translation AI in the Public Cloud]]> http://www.open-lab.net/blog/?p=69702 2023-10-20T18:13:34Z 2023-08-29T17:00:00Z

From start-ups to large enterprises, businesses use cloud marketplaces to find the new solutions needed to quickly transform their businesses. Cloud...]]>

From start-ups to large enterprises, businesses use cloud marketplaces to find the new solutions needed to quickly transform their businesses. Cloud... Image of two boxes with text, in two languages, with speech icons joining them to a central box symbolizing translation. The English language box displays,

Image of two boxes with text, in two languages, with speech icons joining them to a central box symbolizing translation. The English language box displays,

From start-ups to large enterprises, businesses use cloud marketplaces to find the new solutions needed to quickly transform their businesses. Cloud marketplaces are online storefronts where customers can purchase software and services with flexible billing models, including pay-as-you-go, subscriptions, and privately negotiated offers. Businesses further benefit from committed spending at��

]]> 0 Vishal Manchanda <![CDATA[How Language Neutralization Is Transforming Customer Service Contact Centers]]> http://www.open-lab.net/blog/?p=65761 2023-10-30T23:18:55Z 2023-05-30T22:58:34Z

According to Gartner,? "Nearly half of digital workers struggle to find the data they need to do their jobs, and close to one-third have made a wrong business...]]>

According to Gartner,? "Nearly half of digital workers struggle to find the data they need to do their jobs, and close to one-third have made a wrong business...

transcription-graphic

According to Gartner,? ��Nearly half of digital workers struggle to find the data they need to do their jobs, and close to one-third have made a wrong business decision due to lack of information awareness.��1 To address this challenge, more and more enterprises are deploying AI in customer service, as it helps to provide more efficient and information-based personalized services.

]]> 0 Vinh Nguyen <![CDATA[How to Create a Custom Language Model]]> http://www.open-lab.net/blog/?p=61684 2023-06-13T17:55:25Z 2023-03-15T17:00:00Z

Generative AI has captured the attention and imagination of the public over the past couple of years. From a given natural language prompt, these generative...]]>

Generative AI has captured the attention and imagination of the public over the past couple of years. From a given natural language prompt, these generative... Abstract image

Abstract image

Generative AI has captured the attention and imagination of the public over the past couple of years. From a given natural language prompt, these generative models are able to generate human-quality results, from well-articulated children��s stories to product prototype visualizations. Large language models (LLMs) are at the center of this revolution. LLMs are universal language comprehenders��

]]> 0 Michelle Horton <![CDATA[Top Speech AI Developer Day Sessions at NVIDIA GTC 2023]]> http://www.open-lab.net/blog/?p=60997 2023-03-14T19:01:01Z 2023-02-14T22:00:00Z

Explore the latest advances in accurate and customizable automatic speech recognition, multi-language translation, and text-to-speech.]]>

Explore the latest advances in accurate and customizable automatic speech recognition, multi-language translation, and text-to-speech. Black background with bright green sound waves and GTC banner in the corner.

Black background with bright green sound waves and GTC banner in the corner.

Explore the latest advances in accurate and customizable automatic speech recognition, multi-language translation, and text-to-speech.

]]> 0 Sven Chilton <![CDATA[Reducing Development Time for Intelligent Virtual Assistants in Contact Centers]]> http://www.open-lab.net/blog/?p=58450 2023-08-22T20:30:40Z 2022-12-15T16:00:00Z

As the global service economy grows, companies rely increasingly on contact centers to drive better customer experiences, increase customer satisfaction, and...]]>

As the global service economy grows, companies rely increasingly on contact centers to drive better customer experiences, increase customer satisfaction, and...

NVIDIA Speech AI Riva

As the global service economy grows, companies rely increasingly on contact centers to drive better customer experiences, increase customer satisfaction, and lower costs with increased efficiencies. Customer demand has increased far more rapidly than contact center employment ever could. Combined with the high agent churn rate, customer demand creates a need for more automated real-time customer��

]]> 0 Vinh Nguyen <![CDATA[Making an NVIDIA Riva ASR Service for a New Language]]> http://www.open-lab.net/blog/?p=50426 2024-08-28T14:49:34Z 2022-10-28T17:00:00Z

Speech AI is the ability of intelligent systems to communicate with users using a voice-based interface, which has become ubiquitous in everyday life. People...]]>

Speech AI is the ability of intelligent systems to communicate with users using a voice-based interface, which has become ubiquitous in everyday life. People...

automatic-speech-recognition-tech-blog-featured-image

Speech AI is the ability of intelligent systems to communicate with users using a voice-based interface, which has become ubiquitous in everyday life. People regularly interact with smart home devices, in-car assistants, and phones through speech. Speech interface quality has improved leaps and bounds in recent years, making them a much more pleasant, practical, and natural experience than just a��

]]> 4 Bartley Richardson https://www.linkedin.com/in/bartleyrichardson/%20 <![CDATA[Changing Cybersecurity with Natural Language Processing]]> http://www.open-lab.net/blog/?p=55796 2023-03-14T19:01:41Z 2022-10-19T17:51:00Z

If you��ve used a chatbot, predictive text to finish a thought in an email, or pressed ��0�� to speak to an operator, you��ve come across natural language...]]>

If you��ve used a chatbot, predictive text to finish a thought in an email, or pressed ��0�� to speak to an operator, you��ve come across natural language...

nlp-and-cybersecurity-security-tech-blog-phishing-1920x1080

If you��ve used a chatbot, predictive text to finish a thought in an email, or pressed ��0�� to speak to an operator, you��ve come across natural language processing (NLP). As more enterprises adopt NLP, the sub-field is developing beyond those popular use cases of machine-human communication to machines interpreting both human and non-human language. This creates an exciting opportunity for��

]]> 0 Sirisha Rella <![CDATA[Developing the Next Generation of Extended Reality Applications with Speech AI]]> http://www.open-lab.net/blog/?p=54831 2023-11-03T07:15:10Z 2022-09-14T16:00:00Z

Virtual reality (VR), augmented reality (AR), and mixed reality (MR) environments can feel incredibly real due to the physically immersive experience. Adding a...]]>

Virtual reality (VR), augmented reality (AR), and mixed reality (MR) environments can feel incredibly real due to the physically immersive experience. Adding a...

convai-gtc22-fall-promo-pack-xr-applications-1600x900

]]> 0 Mikiko Bazeley <![CDATA[An Easy Introduction to Speech AI]]> http://www.open-lab.net/blog/?p=48941 2023-11-03T07:15:11Z 2022-06-23T16:00:00Z

Artificial intelligence (AI) has transformed synthesized speech from monotone robocalls and decades-old GPS navigation systems to the polished tone of virtual...]]>

Artificial intelligence (AI) has transformed synthesized speech from monotone robocalls and decades-old GPS navigation systems to the polished tone of virtual...

ai-for-dev-blog-green-neon-wave-1600x950

Artificial intelligence (AI) has transformed synthesized speech from monotone robocalls and decades-old GPS navigation systems to the polished tone of virtual assistants in smartphones and smart speakers. It has never been so easy for organizations to use customized state-of-the-art speech AI technology for their specific industries and domains. Speech AI is being used to power virtual��

]]> 1 Nefi Alarcon <![CDATA[Netflix Builds Proof-of-Concept AI Model to Simplify Subtitles for Translation]]> https://news.www.open-lab.net/?p=17143 2023-11-03T07:15:17Z 2020-07-15T15:56:00Z

To help localize subtitles from English to other languages, such as Russian, Spanish, or Portuguese, Netflix developed a proof-of-concept AI model that can...]]>

To help localize subtitles from English to other languages, such as Russian, Spanish, or Portuguese, Netflix developed a proof-of-concept AI model that can...

Netflix_feature_2020

To help localize subtitles from English to other languages, such as Russian, Spanish, or Portuguese, Netflix developed a proof-of-concept AI model that can automatically simplify and translate subtitles to multiple languages. The work is presented in a paper, Simplify-then-Translate: Automatic Preprocessing for Black-Box Machine Translation, published this month on the preprint platform��

]]> 0 Maxim Milakov <![CDATA[Neural Machine Translation Inference with TensorRT 4]]> http://www.open-lab.net/blog/?p=17146 2023-03-14T19:00:03Z 2018-07-18T19:00:00Z

Neural machine translation exists across a wide variety consumer applications, including web sites, road signs, generating subtitles in foreign languages, and...]]>

Neural machine translation exists across a wide variety consumer applications, including web sites, road signs, generating subtitles in foreign languages, and...

sequence-2-sequence-model

Neural machine translation exists across a wide variety consumer applications, including web sites, road signs, generating subtitles in foreign languages, and more. TensorRT, NVIDIA��s programmable inference accelerator, helps optimize and generate runtime engines for deploying deep learning inference apps to production environments. NVIDIA released TensorRT 4 with new features to accelerate��

]]> 2 Nefi Alarcon <![CDATA[Neural Machine Translation Now Available with TensorRT]]> https://news.www.open-lab.net/?p=10851 2023-03-14T19:00:14Z 2018-07-18T17:23:52Z

NVIDIA released TensorRT 4 with new features to accelerate inference of neural machine translation (NMT) applications on GPUs. Neural machine translation offers...]]>

NVIDIA released TensorRT 4 with new features to accelerate inference of neural machine translation (NMT) applications on GPUs. Neural machine translation offers...

Feature_Tensor_RT

NVIDIA released TensorRT 4 with new features to accelerate inference of neural machine translation (NMT) applications on GPUs. Neural machine translation offers AI-based text translation for large number of consumer applications, including web sites, road signs, generating subtitles in foreign languages, and more. The new TensorRT 4 release brings support for new RNN layers such as Batch��

]]> 0 Siddharth Sharma <![CDATA[TensorRT 4 Accelerates Neural Machine Translation, Recommenders, and Speech]]> http://www.open-lab.net/blog/?p=10726 2023-03-14T19:00:22Z 2018-06-19T13:00:45Z

NVIDIA has released TensorRT?4 at CVPR 2018. This new version of TensorRT, NVIDIA��s powerful inference optimizer and runtime engine provides: New Recurrent...]]>

NVIDIA has released TensorRT?4 at CVPR 2018. This new version of TensorRT, NVIDIA��s powerful inference optimizer and runtime engine provides: New Recurrent...

onnx2

NVIDIA has released TensorRT 4 at CVPR 2018. This new version of TensorRT, NVIDIA��s powerful inference optimizer and runtime engine provides: Additional features include the ability to execute custom neural network layers using FP16 precision and support for the Xavier SoC through NVIDIA DRIVE AI platforms. TensorRT 4 speeds up deep learning inference applications such as neural machine��

]]> 0 Nefi Alarcon <![CDATA[Microsoft Uses AI for Chinese to English Translation]]> https://news.www.open-lab.net/?p=9640 2023-03-14T19:00:33Z 2018-03-27T01:36:58Z

Researchers from Microsoft recently?announced?they��ve created the first deep learning translation system capable of translating sentences of news articles...]]>

Researchers from Microsoft recently?announced?they��ve created the first deep learning translation system capable of translating sentences of news articles...

Voice Recognition at Microsoft of Fil & XD

Researchers from Microsoft recently announced they��ve created the first deep learning translation system capable of translating sentences of news articles from Chinese to English with the same level of accuracy as a person. Microsoft used NVIDIA Tesla GPUs and millions of sentences from various online newspapers to train their neural network. The team used a dual learning system where the AI��

]]> 0 ��˳��97caoporen��