# NVIDIA NeMo

A comprehensive toolkit to build, monitor, and optimize AI agents across their lifecycle at enterprise scale.

[Get Started](https://www.nvidia.com/en-us/ai-data-science/products/nemo/get-started.md)

[Documentation](https://docs.nvidia.com/nemo/)

Overview

## What Is NVIDIA NeMo?

NVIDIA NeMo™ is a comprehensive toolkit for managing the AI agent lifecycle. It includes open libraries and microservices for data processing, data generation, model fine-tuning and evaluation, reinforcement learning, speech, safety, and agent observability. Use NeMo to customize NVIDIA Nemotron™ and other open models to build production-grade, specialized agentic systems tailored to your domain needs and data.  

It integrates with existing AI platforms and supports cloud, on-premises, and hybrid deployments.

### NVIDIA Data Flywheel Blueprint

This AI blueprint enables developers to build an automated data flywheel that captures real-world usage data to continuously improve the accuracy and efficiency of generative and agentic AI applications.

[Try Now](https://build.nvidia.com/nvidia/build-an-enterprise-data-flywheel)

### NVIDIA AI-Q Blueprint

A reference workflow for building custom enterprise research agents that securely connect to company data, tools, and workflows to deliver accurate, context-aware insights grounded in organizational knowledge.

[Try Now](https://build.nvidia.com/nvidia/aiq)

### Features

## Tools for Managing the AI Agent Lifecycle

The AI agent lifecycle is an end-to-end process for developing and improving AI agents in production applications. NVIDIA NeMo provides tools that enable each step of this workflow, so enterprises can build specialized agents that are powerful, secure, and continuously learn.

| Build |
| --- |
| **Prepare AI-ready data**    Process existing multimodal datasets into high-quality, AI-ready formats for development pipelines, and generate synthetic data to close critical data gaps. | * [NeMo Curator](https://developer.nvidia.com/nemo-curator)   Clean, filter, and prepare safe multimodal data for agentic use cases and model training. * [NeMo Data Designer](https://build.nvidia.com/nemo/data-designer)    Create domain‑specific synthetic datasets from scratch for building and evaluating specialized agents. |
| **Select the right model**    Pick or build models suited to the use case: selecting from open Nemotron models, other open or proprietary options, or training from scratch. Validate with evaluation runs, and fine-tune as needed. | * [NVIDIA Nemotron](https://www.nvidia.com/en-us/ai-data-science/foundation-models/nemotron.md)    State-of-the-art open NVIDIA models for reasoning, RAG, speech, vision, and safety. * [NeMo Retriever](https://developer.nvidia.com/nemo-retriever)    Build scalable pipelines to extract text, tables, charts, and images from complex documents. * [NeMo Evaluator](https://developer.nvidia.com/nemo-evaluator/)    Benchmark and test models and agents using academic, LLM-as-a-judge, and custom evaluations. |
| **Build your AI agent**    Profile and optimize agentic workflows across frameworks, with built-in performance analysis, bottleneck detection, evaluation-driven RL tuning, and interoperability with LangChain, LlamaIndex, and other agent ecosystems. | * [NeMo Agent Toolkit](https://developer.nvidia.com/nemo-agent-toolkit)   Framework-agnostic, open library to build, profile,and optimize AI agents. |
| Deploy |
| **Deploy your agent with maximum performance**    Optimize your agent for production with high-throughput, low-latency inference, ensuring it can scale to meet enterprise demands and deliver fast, reliable responses. | * [NVIDIA NIM](https://developer.nvidia.com/nim/)    Run AI models in optimized containers, exposed as OpenAI-compatible APIs. |
| **Stay grounded in data and apply guardrails**    Use retrieval-augmented generation (RAG) to anchor agent responses in trusted knowledge while applying safety, compliance, and content moderation guardrails. | * [NeMo Retriever](https://developer.nvidia.com/nemo-retriever)    Build accurate, privacy-preserving RAG pipelines. * [NeMo Guardrails](https://developer.nvidia.com/nemo-guardrails/)    Enhance safety, compliance, and control across AI interactions. |
| Optimize |
| **Monitor and collect feedback**    Track the agent's real-world interactions with users and other systems. Systematically evaluate its performance and accuracy, finding opportunities to continuously improve. | * [NeMo Agent Toolkit](https://developer.nvidia.com/nemo-agent-toolkit)    Framework-agnostic, open library to build, profile, and optimize AI agents. * [NeMo Customizer](https://developer.nvidia.com/nemo-customizer)    Microservice to fine-tune and align models with domain data. * [NeMo Framework](https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html)    Collection of all open-source NeMo libraries for training and aligning LLMs and multimodal models efficiently at scale. |
| **Continuously improve with data flywheels**    Use the feedback and data gathered from monitoring to create a data-driven flywheel, iteratively retraining the agent to continuously optimize and stay effective over time. | * [NeMo RL](https://docs.nvidia.com/nemo/rl/latest/index.html)    Post-train and align models at scale with advanced reinforcement learning (RL) techniques. * [NeMo Gym](https://docs.nvidia.com/nemo/gym/0.1.0/index.html)    Simulated RL training environments to generate high-quality agentic RL training data/rollouts. * [NeMo Evaluator](https://developer.nvidia.com/nemo-evaluator/)    Benchmark and test models and agents using academic, LLM-as-a-judge, and custom evaluations. * [NeMo Customizer](https://developer.nvidia.com/nemo-customizer)    Microservice to fine-tune and align models with domain data. |

Use Cases

## How NeMo Is Being Used

See how NVIDIA NeMo supports industry use cases and jump-starts your AI development.

1. AI Agents
2. SDG for Agentic AI
3. AI Assistant
4. Enterprise Search
5. Content Generation
6. Humanoid Robot

### AI Agents

AI agents are transforming customer service across sectors, helping companies enhance customer conversations, achieve high resolution rates, and improve human representative productivity. AI agents can handle predictive tasks, reason and problem-solve, be trained to understand industry-specific terms, and pull relevant information from an organization’s knowledge bases, wherever that data resides.

[Learn More About AI Agents](https://blogs.nvidia.com/blog/ai-agents-customer-service/)

### Synthetic Data Generation for Agentic AI

Specialized agentic systems need massive, high-quality datasets that are slow and expensive to collect from real-world sources. Synthetic data created through simulations or generative AI models can eliminate this bottleneck by creating unlimited training scenarios without privacy restrictions or quality issues. This enables faster development of reasoning LLMs, multi-step decision-makers, and multimodal AI assistants.

[Learn More About SDG for Agentic AI](https://www.nvidia.com/en-us/use-cases/synthetic-data-generation-for-agentic-ai.md)

### AI Assistant

Businesses are deploying AI assistants to efficiently address the queries of millions of customers and employees around the clock. Powered by customized NVIDIA NIM™ microservices for [LLMs](https://developer.nvidia.com/blog/develop-custom-enterprise-generative-ai-with-nvidia-nemo/), [RAG](https://developer.nvidia.com/blog/how-to-take-a-rag-application-from-pilot-to-production-in-four-steps/), and [speech](https://developer.nvidia.com/blog/an-easy-introduction-to-speech-ai/) and translation AI, these AI teammates deliver immediate and accurate spoken responses, even in the presence of background noise, poor sound quality, and diverse dialects and accents.

[Learn More About AI Assistants](https://www.nvidia.com/en-us/use-cases/ai-for-customer-support.md)

[Try Now](https://build.nvidia.com/nvidia/ai-virtual-assistant-for-customer-service)

### Enterprise Search

Enterprises generate trillions of documents annually—including PDFs, reports, presentations, —each containing text, images, charts, and tables—spread across disconnected systems. AI-powered enterprise search transforms this scattered data into a unified knowledge base, enabling employees to instantly surface insights using natural language and driving faster decisions at lower cost.

[Learn More About Enterprise Search](https://build.nvidia.com/nvidia/aiq)

### Content Generation

Generative AI makes it possible to generate highly relevant, bespoke, and accurate content grounded in the domain expertise and proprietary IP of your enterprise.

[Learn More About Content Generation](https://www.nvidia.com/en-us/use-cases/content-creation-using-generative-ai.md)

### Humanoid Robot

Humanoid robots are built to adapt quickly to existing human-centric urban and industrial work spaces, tackling tedious, repetitive, or physically demanding tasks. Their versatility has them in such varied locations as factory floors to healthcare facilities, where these robots are assisting humans and helping alleviate labor shortages with automation.

[Learn More About Humanoid Robots](https://www.nvidia.com/en-us/use-cases/humanoid-robots.md)

Apptronik

### Benefits

## Explore the Benefits of NVIDIA NeMo for Agentic AI

### Comprehensive, Modular AI Agent Suite

Manage the full agent lifecycle from data curation and post-training to evaluation, guardrails, observability, and continuous optimization using an interoperable, enterprise-grade software suite.

### Accelerate at Scale

Deploy and scale data flywheels using enterprise data, with GPU-accelerated training, inference, multi-node scaling, and cost-efficient optimization for high-throughput agent workloads.

### Increased ROI

Build, customize, and deploy specialized agentic systems faster—shortening time to production and maximizing return on AI investments.

### Secure and Production-Ready

Safeguard sensitive data, enforce policy and prompt guardrails, validate models, and continuously detect vulnerabilities. Deploy securely with enterprise-grade support and stability across cloud, data center, and edge with [NVIDIA AI Enterprise](https://www.nvidia.com/en-us/data-center/products/ai-enterprise.md).

[Get Started](https://www.nvidia.com/en-us/ai-data-science/products/nemo/get-started.md)

### Starting Options

## Ways to Get Started With NVIDIA NeMo

Manage the AI agent lifecycle with tools and technologies for building, monitoring, and optimizing AI agents in production.

### 1

Try NVIDIA-optimized foundation models like NVIDIA Nemotron.

[Try Now](https://build.nvidia.com/explore/discover)

### 2

Build, monitor, and optimize AI agents with NVIDIA NeMo.

[Get Started](https://www.nvidia.com/en-us/ai-data-science/products/nemo/get-started/#get-started)

### 3

Jump-start building your AI solutions with NVIDIA Blueprints.

[Try the Blueprints](https://build.nvidia.com/blueprints)

[Compare Starting Options](https://www.nvidia.com/en-us/ai-data-science/products/nemo/get-started.md)

### Customer Stories

## How Industry Leaders Are Driving Innovation With NeMo

[More Customer Stories](https://www.nvidia.com/en-us/case-studies/?page=1&useCase=Generative%20AI)

### Adopters

## Leading Adopters Across All Industries

1. ### Customers

2. ### Partners

### Resources

## The Latest in NVIDIA NeMo Resources

1. Blogs
2. Sessions
3. Training
4. Videos

Load More

[View All Blogs](https://blogs.nvidia.com/blog/tag/nvidia-nemo/)

[View All Sessions](https://www.nvidia.com/en-us/on-demand/playlist/playList-54f387be-e581-4990-9a37-1285ee0fce3f/)

### Get Started With LLM Customization

In this course, you’ll go beyond prompt-engineering LLMs and learn techniques to efficiently customize pretrained LLMs for your specific use cases. Using NVIDIA NIM microservices, NeMo Curator, and NeMo Framework, you’ll learn various parameter-efficient fine-tuning methods to customize LLM behavior for your organization.

[Get Started](https://www.nvidia.com/gtc/session-catalog/?regcode=no-ncid&ncid=no-ncid&tab.catalogallsessionstab=16566177511100015Kus&search=DLIW73630#/session/1730224902616001qxKv)

### Elevate Your LLM Skills

Take advantage of our comprehensive LLM learning path, covering fundamental to advanced topics featuring hands-on training developed and delivered by NVIDIA experts. You can opt for the flexibility of self-paced courses or enroll in instructor-led workshops to earn a certificate of competency.

[Explore LLM Training](https://www.nvidia.com/en-us/learn/learning-path/generative-ai-llm.md)

### Get Certified by NVIDIA

Showcase your Generative AI skills and advance your career by getting certified by NVIDIA. Our new professional certification program offers two developer exams focusing on proficiency in large language models (LLMs) and multimodal workflow skills.

[Learn About Certification](https://www.nvidia.com/en-us/learn/certification.md)

### Train a Reasoning-Capable LLM in One Weekend

Explore a simple and computationally efficient recipe for training reasoning models with small amounts of training data curated from the Llama Nemotron post-training dataset and NVIDIA NeMo.

[Watch Reasoning Video](https://www.youtube.com/watch?v=hMGikmMFLAU)

### Optimize AI Agents Using a Data Flywheel

Learn how to optimize AI agents in production using the NVIDIA Data Flywheel Blueprint—a continuous loop of distillation, fine-tuning, and evaluation powered by NeMo and NIM microservices.

[Watch Data Flywheel Video](https://www.youtube.com/watch?v=hTYrjYbFHyU)

### Build AI Agents With NeMo Agent Open-Source Toolkit

Learn how to build, integrate, and optimize custom AI agents using the NVIDIA NeMo Agent open-source Python toolkit.

[Watch Custom AI Agent Video](https://www.youtube.com/watch?v=NsogD7UhZ4Q)

[View All Videos](https://www.youtube.com/results?search_query=nvidia+nemo)

### Next Steps

## Ready to Get Started?

Use the right tools and technologies to take your agentic AI applications from development to production.

[Get Started](https://www.nvidia.com/en-us/ai-data-science/products/nemo/get-started.md)

### For Developers

Explore everything you need to start developing with NVIDIA NeMo, including the latest documentation, tutorials, technical blogs, and more.

[Start Developing](https://www.nvidia.com/en-us/ai-data-science/products/nemo/get-started/#nemo-tools-microservices)

### Get in Touch

Talk to an NVIDIA product specialist about moving from pilot to production with the assurance of security, API stability, and support that comes with [NVIDIA AI Enterprise](https://www.nvidia.com/en-us/data-center/products/ai-enterprise.md).

[Contact Us](https://www.nvidia.com/en-us/data-center/products/ai-enterprise/contact-sales.md)

## Shell

**Shell Trains Custom AI Chatbot With NVIDIA NeMo to Uplevel Operations**

Shell, a global leader in the energy industry, has leveraged NVIDIA NeMo to empower its journey toward developing a custom AI chatbot for chemical domain expertise. This innovative solution has the potential to significantly enhance employee productivity by streamlining search processes, improving decision-making, and supporting research and development in production environments.

[Learn More](https://www.nvidia.com/en-us/case-studies/shell-trains-custom-ai-chatbot-with-nemo.md)

## AI Sweden

### **Accelerate Industry Applications With LLMs**

AI Sweden facilitated regional language model applications by providing easy access to a powerful 100 billion-parameter model. They digitized historical records to develop language models for commercial use.

[Learn More](https://blogs.nvidia.com/blog/2022/06/19/ai-sweden-nlp/)

## Amazon

### **How Amazon and NVIDIA Help Sellers Create Better Product Listings With AI**

Amazon doubles inference speeds for new AI capabilities using NVIDIA TensorRT-LLM and GPUs to help sellers optimize product listings faster.

[Learn More](https://blogs.nvidia.com/blog/amazon-product-listings/)

## Amdocs

### **NVIDIA and Amdocs Bring Custom Generative AI to Global Telco Industry**

Amdocs plans to build custom LLMs for $1.7 trillion global telecommunications industry using NVIDIA AI foundry service on Microsoft Azure.

[Learn More](https://blogs.nvidia.com/blog/amdocs-telco-industry/)

## AT&T

### **AT&T Drives Customer Care AI Agents’ Accuracy, Efficiency, and Performance With NVIDIA NeMo**

AT&T, one of the world’s largest telecommunications companies, is reimagining customer care through the power of AI. Facing challenges like model drift, rising computational demands, and the need for real-time data access, AT&T turned to NVIDIA NeMo™ microservices to build a feedback-driven AI platform that continuously improves performance while optimizing cost, speed, and compliance.

[Learn More](https://www.nvidia.com/en-us/case-studies/att-drives-ai-agents-with-nemo.md)

## AWS

### **NVIDIA Powers Training for Some of the Largest Amazon Titan Foundation Models**

Amazon leveraged the NVIDIA NeMo framework, GPUs, and AWS EFAs to train its next-generation LLM, giving some of the largest Amazon Titan foundation models customers a faster, more accessible solution for generative AI.

[Learn More](https://blogs.nvidia.com/blog/nemo-amazon-titan/)

## Accenture

### **Accelerate Generative AI Adoption for Enterprises**

ServiceNow, NVIDIA, and Accenture announced the launch of AI Lighthouse, a first-of-its-kind program designed to fast-track the development and adoption of enterprise generative AI capabilities.

[Learn More](https://nvidianews.nvidia.com/news/servicenow-nvidia-accenture-ai-lighthouse)

## Azure

### **Harnessing the Power of NVIDIA AI Enterprise on Azure Machine Learning**

Get access to a complete ecosystem of tools, libraries, frameworks, and support services tailored for enterprise environments on Microsoft Azure.

[Learn More](https://developer.nvidia.com/blog/harnessing-the-power-of-nvidia-ai-enterprise-on-azure-machine-learning/)

## Bria

### **Bria Builds Responsible Generative AI for Enterprises Using NVIDIA NeMo, Picasso**

Bria, a startup based in Tel Aviv, is helping businesses who are seeking responsible ways to integrate visual generative AI technology into their enterprise products with a generative AI service that emphasizes model transparency alongside fair attribution and copyright protections.

## Cohesity

### **Unlock Your Data Superpower: NVIDIA Microservices Unleash Enterprise-Grade Secure Generative AI for Cohesity**

With NVIDIA NIM and optimized models, Cohesity DataProtect customers can add generative AI intelligence to data backups and archives. This allows Cohesity and NVIDIA to bring the power of generative AI to all Cohesity DataProtect customers. Leveraging the power of NIM and NVIDIA optimized models, Cohesity DataProtect customers obtain the power of data-driven insights from their data backups and archives, unleashing new levels of efficiency, innovation, and growth.

[Learn More](https://www.cohesity.com/press/unlock-gen-ai-capabilities-via-nvidia-collaboration/)

## CrowdStrike

### **Shaping the Future of AI in the Cybersecurity Domain**

CrowdStrike and NVIDIA are leveraging accelerated computing and generative AI to provide customers with an innovative range of AI-powered solutions tailored to efficiently address security threats.

[Learn More](https://www.crowdstrike.com/press-releases/crowdstrike-nvidia-generative-ai-collaboration/)

## Dell

### **Dell Validated Design for Generative AI With NVIDIA**

Dell Technologies and NVIDIA announced an initiative to make it easier for businesses to build and use generative AI models on premises quickly and securely.

[Learn More](https://nvidianews.nvidia.com/news/dell-technologies-and-nvidia-introduce-project-helix-for-secure-on-premises-generative-ai)

## Deloitte

### **Unlock the Value of Generative AI Across Enterprise Software Platforms**

Deloitte will use NVIDIA AI technology and expertise to build high-performing generative AI solutions for enterprise software platforms to help unlock significant business value.

[Learn More](https://www2.deloitte.com/us/en/pages/about-deloitte/articles/press-releases/strategic-alliance-with-nvidia-expanded-to-unlock-generative-ai-value-across-enterprise-software-platforms.html)

## Domino Data Lab

### **Domino Offers Production-Ready Generative AI Powered by NVIDIA**

With NVIDIA NeMo, data scientists can fine-tune LLMs in Domino’s platform for domain-specific use cases based on proprietary data and IP—without needing to start from scratch.

[Learn More](https://domino.ai/blog/beyond-hype-production-ready-generative-ai)

## Dropbox

### **Dropbox and NVIDIA to Bring Personalized Generative AI to Millions of Customers**

Dropbox plans to leverage NVIDIA’s AI foundry to build custom models and improve AI-powered knowledge work with Dropbox Dash universal search tool and Dropbox AI.

[Learn More](https://nvidianews.nvidia.com/news/dropbox-and-nvidia-team-to-bring-personalized-generative-ai-to-millions-of-customers)

## Google Cloud

### **AI Titans Collaborate to Create Generative AI Magic**

At its Next conference, Google Cloud announced the availability of its A3 instances powered by NVIDIA H100 Tensor Core GPUs. Engineering teams from both companies have collaborated to bring NVIDIA NeMo to the A3 instances for faster training and inference.

[Learn More](https://developer.nvidia.com/blog/streamline-generative-ai-development-with-nvidia-nemo-on-gpu-accelerated-google-cloud)

[Google Cloud Console](https://console.cloud.google.com/marketplace/product/nvidia/nvidia-ai-enterprise-vmi?project=nvidia-vgpu-public)

## HuggingFace

### **Leading AI Community to Accelerate Data Curation Pipeline**

Hugging Face, the leading open platform for AI builders, is collaborating with NVIDIA to integrate NeMo Curator and accelerate DataTrove, their data filtering and deduplication library. “We are excited about the GPU acceleration capabilities of NeMo Curator and can’t wait to see them contributed to DataTrove!” says Jeff Boudier, Product Director at Hugging Face.

## KT

### **Creating New Customer Experiences With LLMs**

South Korea’s leading mobile operator builds billion-parameter LLMs trained with the NVIDIA DGX SuperPOD platform and NeMo framework to power smart speakers and customer call centers.

[Learn More](https://blogs.nvidia.com/blog/2022/09/20/kt-large-language-models/)

## Lenovo

### **New Reference Architecture for Generative AI Based on LLMs**

Solution to expedite innovation by empowering global partners and customers to develop, train, and deploy AI at scale across industry verticals with utmost safety and efficiency.

[Learn More](https://nvidianews.nvidia.com/news/lenovo-nvidia-hybrid-ai)

## Quantiphi

### **Enabling Enterprises to Fast-Track Their AI-Driven Journeys**

Quantiphi specializes in training and fine-tuning foundation models using the NVIDIA NeMo framework, as well as optimizing deployments at scale with the NVIDIA AI Enterprise software platform, while adhering to responsible AI principles.

[Learn More](https://quantiphi.com/partners/nvidia/#overview)

## SAP

### **SAP and NVIDIA Accelerate Generative AI Adoption Across Enterprise Applications Powering Global Industries**

Customers can harness their business data in cloud solutions from SAP using customized LLMs deployed with NVIDIA AI foundry services and NVIDIA NIM Microservices.

[Learn More](https://nvidianews.nvidia.com/news/sap-nvidia-generative-ai-enterprise-applications)

## ServiceNow

### **Building Generative AI Across Enterprise IT**

ServiceNow develops custom LLMs on its ServiceNow platform to enable intelligent workflow automation and boost productivity across enterprise IT processes.

[Learn More](https://nvidianews.nvidia.com/news/servicenow-and-nvidia-announce-partnership-to-build-generative-ai-across-enterprise-it)

## Perplexity

### **Enhance Model Performance for AI-Powered Search Engines**

Using NVIDIA NeMo, Perplexity aims to quickly customize frontier models to improve the accuracy and quality of search results and optimize them for lower latency and high throughput for a better user experience.

[Learn More](https://www.nvidia.com/en-us/case-studies/perplexity-enhances-model-performance-with-nemo.md)

## VMware

### **VMware and NVIDIA Unlock Generative AI for Enterprises**

VMware Private AI Foundation with NVIDIA will enable enterprises to customize models and run generative AI applications, including intelligent chatbots, assistants, search, and summarization.

[Learn More](https://nvidianews.nvidia.com/news/vmware-and-nvidia-unlock-generative-ai-for-enterprises)

## Weight & Biases

### **Debug, Optimize, and Monitor LLM Pipelines**

Weights & Biases helps teams working on generative AI use cases or with LLMs track and visualize all prompt-engineering experiments—helping users debug and optimize LLM pipelines—as well as provides monitoring and observability capabilities for LLMs.

[Learn More](https://wandb.ai/wandb/wb-announcements/reports/Our-Growing-Partnership-with-NVIDIA--VmlldzoyMDIwMDI4)

## Writer

### **Startup Pens Generative AI Success Story With NVIDIA NeMo**

Using NVIDIA NeMo, Writer is building LLMs that are helping hundreds of companies create custom content for enterprise use cases across marketing, training, support, and more.

[Learn More](https://blogs.nvidia.com/blog/2023/08/08/writer-nemo-generative-ai/?nvid=nv-int-bnr-397231)

## Arize

### **Arize Powers Self-Improving AI Data Flywheels**

Arize’s LLM engineering and observability platform integrates NVIDIA NeMo microservices to power AI data flywheels, enabling continuous model refinement through real-world feedback. With NeMo Customizer, Evaluator, and Guardrails, Arize ensures agentic systems are performant, safe, and aligned with evolving enterprise needs. This collaboration supports the development of adaptive AI that learns and evolves over time.

[Learn More](https://arize.com/blog/arize-nvidia-nemo-integration/)

## DataRobot

### **Enterprise-Ready, Trustworthy AI Agents With NeMo on DataRobot**

With NVIDIA NeMo embedded into the DataRobot Enterprise AI Suite, enterprises can ensure agentic systems are safe, compliant, and grounded in enterprise-specific data. This integration facilitates the development of AI agents that deliver accurate, context-aware responses while adhering to organizational standards.

[Learn More](https://www.datarobot.com/solutions/partners/nvidia/)

## DataStax

### **DataStax and NVIDIA Build Data and AI Platform**

Over the past year, DataStax has partnered with NVIDIA to adopt NVIDIA NeMo microservices to enhance generative AI, retrieval-augmented generation, and hybrid search across its database and AI offerings. The results have been impressive: 19x better performance in throughput, a significant reduction in costs, and improved latency.

[Learn More](https://www.datastax.com/blog/ai-agent-speed-savings-langflow-nvidia-nemo-microservices)

## Galileo

### **Galileo and NVIDIA NeMo: De-Risking Agentic AI in Production**

Galileo integrates NVIDIA NeMo microservices to build AI data flywheels that strengthen agent performance, reliability, and trust. NeMo adds complementary capabilities to the Galileo platform—enabling continuous domain-specific fine-tuning via NeMo Customizer, advanced evaluation with NeMo Evaluator, and safeguarding user interactions with NeMo Guardrails to empower AI teams to build, evaluate, and monitor agentic AI systems that learn and improve continuously in real-world environments.

[Learn More](https://www.galileo.ai/blog/nvidia-data-flywheel-for-de-risking-agentic-ai)

![YouTube Video](https://img.youtube.com/vi_webp/1V5_wJzTCzc/maxresdefault.webp)

Consent for Optional Cookies

(googleCookiePolicyLink)YouTube sets performance, advertising, and other optional cookies(/googleCookiePolicyLink) when you watch embedded videos. To watch this video, you need to turn on optional cookies for the site. By clicking “Accept and Play Video,” you will automatically turn on advertising and other optional cookies for the site and accept our (nvidiaTermsOfServiceLink)Terms of Service(/nvidiaTermsOfServiceLink) (which contains important waivers). Please see our (nvidiaPrivacyPolicyLink)Privacy Policy(/nvidiaPrivacyPolicyLink) and (nvidiaCookiePolicyLink)Cookie Policy(/nvidiaCookiePolicyLink) for more information.

Cancel

Accept and Play Video

Alternatively, you can (youtubeLink)watch this video on YouTube(/youtubeLink).

![YouTube Video](https://img.youtube.com/vi_webp/Hg2KibOvnLM/maxresdefault.webp)

Consent for Optional Cookies

(googleCookiePolicyLink)YouTube sets performance, advertising, and other optional cookies(/googleCookiePolicyLink) when you watch embedded videos. To watch this video, you need to turn on optional cookies for the site. By clicking “Accept and Play Video,” you will automatically turn on advertising and other optional cookies for the site and accept our (nvidiaTermsOfServiceLink)Terms of Service(/nvidiaTermsOfServiceLink) (which contains important waivers). Please see our (nvidiaPrivacyPolicyLink)Privacy Policy(/nvidiaPrivacyPolicyLink) and (nvidiaCookiePolicyLink)Cookie Policy(/nvidiaCookiePolicyLink) for more information.

Cancel

Accept and Play Video

Alternatively, you can (youtubeLink)watch this video on YouTube(/youtubeLink).

### Building and Deploying Generative AI Models

Enterprises are turning to generative AI to revolutionize the way they innovate, optimize operations, and build a competitive advantage. NeMo is an end-to-end platform for curating data; training, customizing, and evaluating multimodal models; and running inference at scale. It supports text, image, video, and speech generation.

[Watch Now](https://resources.nvidia.com/en-us-ai-large-language-models/watch-78?lx=Hh_Qnz)

### Unlocking Synthetic Data Generation with Llama 3.1

Learn how to use the Meta Llama 3.1 405B model to generate tailored synthetic data for your specific domain and explore how to evaluate this data using the Nemotron-4 340B Reward model and ensure alignment with human preferences through NVIDIA NeMo.

[Watch Now](https://www.youtube.com/watch?v=lqbYP6Cysgw)

### Build World-Class AI Virtual Assistants for Customer Service with RAG

Learn how companies can use the AI virtual assistant for customer service NVIDIA AI Blueprint to improve the operational efficiency of existing contact center solutions or build new customer service-centric systems.

[Watch Now](https://www.youtube.com/watch?v=0nQ3lawmIN0)