Speech and Translation AI

NVIDIA Riva

Build and deploy fully customizable multilingual speech and translation AI for your large language model and retrieval-augmented generation based applications.

What Is NVIDIA Riva?

NVIDIA® Riva is a set of GPU-accelerated multilingual speech and translation microservices for building fully customizable, real-time conversational AI pipelines. Riva includes automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) and is deployable in all clouds, in data centers, at the edge, or on embedded devices. With Riva, organizations can add speech and translation interfaces with large language models (LLMs) and retrieval-augmented generation (RAG) to transform chatbots into engaging and expressive multilingual assistants and avatars.

Unveiling End-To-End Speech and Translation AI Magic

Deliver AI chatbots with the state-of-the-art multilingual transcription, translation, and voices.

See Riva in Action

Try NVIDIA Riva Automatic Speech Recognition

Select the language and check out how Riva ASR delivers highly accurate transcription in real time by providing an input through your microphone or uploading a .wav file from your device.

Note: The duration of each sample is limited to 30 seconds.

Try saying something

Try NVIDIA Riva Text-to-Speech

Select a voice and type in a test sentence to hear Riva’s out-of-the-box English female or male voice.

Note: Input text is limited to 400 characters.

Use of Riva skills is subject to NVIDIA Riva terms of use. Your data will be used to improve NVIDIA products and services.

NVIDIA Riva Benefits

Highly Accurate and Expressive Multilingual Voices

Achieve high transcription accuracy for bilingual and multilingual translations and deploy out-of-the-box expressive professional female and male voices with state-of-the-art models pretrained on thousands of hours of audio on NVIDIA supercomputers.

Fully Customizable

Customize across ASR pipelines for different languages, accents, domains, vocabulary, and context for the best possible accuracy for your use case and across TTS pipelines for the voice and intonation you want.

Flexible Deployments

Provide consistent experiences to your customers for hundreds of thousands of input streams with higher inference performance versus existing technology and on deployment of your choice—in data centers, on premises, in the cloud, at the edge, or in embedded devices.

Use Cases of Riva

Q&A Assistants

Companies are deploying Q&A assistants to automatically address the queries of millions of customers and employees around the clock. With Riva’s speech and translation AI microservices, these assistants provide helpful and natural responses at every turn of the conversation despite background noise, poor sound quality, and diverse speaker dialects and accents.

Q&A assistants to automatically address the queries of millions of customers and employees around the clock.

Contact Center Agent Assists

Consumers expect contact center agents to resolve their issues quickly and efficiently. To support agents to deliver the best experiences possible, enterprises across industries are deploying agent assist technology based on Riva speech and translation AI, which can provide facts and suggestions in real time.

Agent assist technology based on Riva speech and translation AI

Digital Avatars and Brand Ambassadors

To improve customer service experiences and build relationships with their customers, businesses are building avatars with recognizable brand voices. With Riva, they can create a unique, high-quality, personalized voice with just three seconds of speech data.

Learn more about digital avatars

Video-Conferencing Transcription

With hundreds of millions of online meetings held daily, video conferencing has become an indispensable tool for enterprises. Through Riva's real-time transcription, video conferencing applications achieve impressive accuracy in live captioning and meeting summarizations, accommodating users with worldwide accents and diverse domain-specific vocabularies.

Learn more about Riva's real-time transcription, video conferencing applications

Translation

In the global economy, businesses operate across countries and serve customers with diverse linguistic and cultural backgrounds. This diversity in global languages poses a unique challenge, as hiring native speakers and training employees in multiple languages isn't scalable, cost-effective, or efficient. Riva translation empowers accurate and effective communication applications, facilitating smooth global interactions.

Riva translation empowers accurate and effective communication applications, facilitating smooth global interactions.

Service Robots

Service robots are increasingly found in hospitals, airports, and retail stores worldwide. They aid frontline workers by handling daily repetitive tasks in restaurants and manufacturing facilities, assist customers in locating items in stores, and support physicians and nurses in patient care. With Riva, it’s easy to add speech and translation AI to service robots.

Service robots in retail stores

Starting Options

Get Started With NVIDIA Riva

Use the right tools to build and deploy fully customizable, multilingual speech and translation AI applications.

Experience APIs and Interactive Demos

For individuals looking to experience Riva, the API catalog offers a UI-based playground and access to NVIDIA-managed API endpoints for free as a great starting point.

Try Before You Buy

For enterprises looking to try Riva before purchasing NVIDIA AI Enterprise for production, there are two options to get started for free:

Without Infrastructure:
For those without existing infrastructure, NVIDIA offers free hands-on labs through NVIDIA LaunchPad.

With Infrastructure:
For those with existing infrastructure, NVIDIA offers a free evaluation license to try NVIDIA AI Enterprise for 90 days.

Leading Adopters Across All Industries

Resources

Using Speech AI for Transcription, Translation, and Voice

Build world-class, fully customizable, speech AI applications such as intelligent virtual assistants, audio transcription services, digital avatars, and more.

Reinvent Contact Center Experiences With NVIDIA Riva Transcription

By generating an accurate transcript of customer interactions in real time, Riva enables AI to provide contextual insights, measure sentiment, and recommend the next-best action to an agent, ensuring a great personalized experience.

Robot Dog Fetches Snacks Across Town

Watch as Spot uses AI and super-accurate GPS to order and pick up snacks.

Try Riva on NVIDIA LaunchPad

Have an existing speech AI project? Apply to get hands-on experience testing and prototyping your conversation-based solutions with speech skills in the high-performance Riva software stack that’s deployable today.

Get Started With Highly Accurate Custom ASR for Speech AI

Learn to build, train, fine-tune, and deploy a GPU-accelerated automatic speech recognition (ASR) service with Riva that includes customized features.

Talk to Your Data in Your Native Language

Join AI experts to learn how to build, fine-tune, and deploy production-ready multilingual speech and translation AI on top of LLM-based applications to unmute your chatbots, enable them to speak in the language of your choice, and provide better services.

NVIDIA Parlays Win in Voice Challenge

Read how a team of NVIDIANs won the LIMMITS ’24 challenge, which asked contestants to recreate in real time a speaker’s voice in English or any of six languages spoken in India with the appropriate accent.

An Introduction to NVIDIA Riva

Learn about Riva’s architecture, key features, and components for building speech and translation AI services.

Building Speech AI Applications

Explore how to get started with integrating and deploying Riva ASR and TTS models in production with high-performance inference and minimal effort.

GTC Sessions

Dive into the latest content and see how businesses are making powerful technologies like virtual assistants, real-time transcriptions, voice searches, and question-answering systems possible.

Speech AI Day

Speech AI Day offers you the opportunity to hear from renowned speech and translation AI leaders and experts as they share their groundbreaking research, explore real-world applications, and discuss open-source contributions.

Webinars

Explore how to kickstart your journey with Riva’s cutting-edge speech and translation AI and fully customize it to achieve the highest-accuracy agent assist solution. Demos by Infosys, Quantiphi, and NVIDIA conversational AI experts are featured.

T-Mobile

Speech AI for Award- Winning Customer Care

T-Mobile uses Riva ASR in their call center to accurately transcribe customer conversations and provide real-time recommendations to help agents quickly resolve customer queries.

NCS

Easy Speech AI Customization for Local Singaporean Voice

NCS used Riva TTS to customize a Singaporean voice with local pronunciation, tone, and accent for thousands of monthly active Breeze users—a driver’s companion app.

Tarteel

Automating Real-Time Arabic Speech Recognition

Tarteel uses Riva and NVIDIA NeMo™ to provide real-time feedback on Quran recitation at scale, enabling Muslims, instructors, content creators, and researchers to engage with the Quran.

Riva User Forum

Explore the online community for Riva, where you can browse how-to questions, learn best practices, engage with other developers, and report bugs.

NVIDIA Developer Program

Connect with millions of like-minded developers and access hundreds of GPU-accelerated containers, models, and SDKs—all the tools necessary to successfully build apps with NVIDIA technology—through the NVIDIA Developer Program.

Accelerate Your Startup

NVIDIA Inception is a free program for cutting-edge startups that offers critical access to go-to-market support, technical expertise, training, and funding opportunities.

AI2Labs

In 2021, AI2Labs spun off from Yoozoo Games as a local tech startup in Singapore. AI2Labs innovates, experiments, and develops AI products and applications, enabling efficient processes and improving sustainability and business outcomes.

AI2Labs integrated Riva into their Speakr—domain-specific speech AI—speech recognition API to accommodate the intricacies of Asian speech and business domains and achieved state-of-the-art Singlish translation accuracy.

Avaya

Avaya specializes in cloud communications and workstream collaboration solutions, providing unified communications, contact center, communications platform as a service (CPaaS), and services with their OneCloud platform.

Avaya integrated the NVIDIA Riva speech-to-text engine for real-time captions at scale. Riva enables better transcription quality, lower word-error rate, and economical delivery.

C-DAC

For over 10 years, the Applied AI Group at C-DAC in Pune, India, has focused on research and development of speech technology. They’ve successfully created a cutting-edge speech-to-text (STT) system for Indic languages such as Hindiand Marathi. The group continues to advance their work by exploring AI-enabled, open-source deep learning frameworks, libraries, and tools for creating STT and speech-enabled applications for other Indic and low-resource languages. Experiments were conducted using various neural network architectures and topologies from NVIDIA’s open-source NeMo framework, with Citrinet and Conformer-CTC network topologies proving to be effective in building and training neural acoustic models for speech recognition. These models were trained on single- and multi-node Param Siddhi AI systems, optimizing training time and performance. Finally, the models were deployed for real-time and batch-mode inference using the Riva GPU-accelerated production pipeline.

NCS

NCS, a subsidiary of Singtel Group, is a leading technology services firm with presence in Asia Pacific and partners with governments and enterprises to advance communities through technology. Combining the experience and expertise of its 12,000-strong team across 61 specialisations, NCS provides differentiated and end-to-end technology services to clients with its NEXT capabilities in digital, data, cloud and platforms, as well as core offerings in application, infrastructure, engineering and cybersecurity. NCS also believes in building a strong partner ecosystem with leading technology players, research institutions and start-ups to support open innovation and co-creation. 

NCS uses NVIDIA Riva TTS in Breeze—the driver’s companion app—for voice-guided navigation, live traffic and road condition updates, real-time parking rates, and electronic road pricing rates and operating hours, to help Singapore drivers experience smooth driving journeys. 

Learn more.

breeze.com.sg/

www.ncs.co

Customer Story

RingCentral

RingCentral, a leading provider of global enterprise cloud communications, collaboration, and contact center solutions, serves millions of users. The RingCentral platform empowers collaboration from any location and device, improving business efficiency and customer satisfaction. RingCentral uses NVIDIA Riva for video conferencing transcription for 200,000 concurrent users on their platform.

Learn more.

www.ringcentral.com

Customer Story

GTC Session

Snap

Snap is a camera and social media company that enables multimedia message creation with filters and effects. To create more interactive experiences, Snapchat users play with Lenses—a feature that adds real-time effects into snaps—over 6 billion times per day. 

NVIDIA Riva’s noise- and lingo-optimized speech AI service is integrated into Snap AR Lens Studio, enabling creators—artists and developers—to build gripping augmented reality (AR) experiences.

T-Mobile

T-Mobile, a supercharged Un-carrier, delivers an advanced 4G LTE and transformative 5G network for the best customer experience. To empower contact center agents, T-Mobile implements Expert Assist. This AI-based software uses NVIDIA Riva to transcribe real-time customer conversations that feed recommenders and assist thousands of agents.

With Riva, T-Mobile fine-tunes automatic speech recognition models on custom datasets and interprets customer jargon accurately across noisy environments.

Learn more.

www.t-mobile.com

Customer Story

GTC Session

Contact an NVIDIA AI Enterprise Sales Representative

We'll answer your questions and help with your organization's needs.

Contact Us