NVIDIA AI Foundation Models and Endpoints

Optimized for enterprise generative AI.

Introduction
Benefits
AI Models
Success Stories
Partners
Get Started

Introduction
Benefits
AI Models
Success Stories
Partners
Get Started

What Are NVIDIA AI Foundation Models and Endpoints?

NVIDIA AI Foundation models are community and NVIDIA-built models and are NVIDIA-optimized to deliver the best performance on NVIDIA accelerated infrastructure. Enterprises can customize and deploy these models with NVIDIA microservices and streamline the transition to production AI.

Explore the NVIDIA API catalog and experience the models directly from a browser or connect to NVIDIA-hosted endpoints and start POC for free.

Accelerate Time to Production AI

Deploy the NVIDIA AI Foundation models at scale with NVIDIA NIM—a set of easy-to-use microservices that ensures seamless, scalable inference, on-premises or in the cloud, leveraging industry-standard APIs.

Explore NVIDIA AI Platform

Build Custom Generative AI Models for Enterprise Applications

The NVIDIA AI foundry service—a collection of NVIDIA AI Foundation Models, NVIDIA NeMo™ framework and tools, and NVIDIA DGX™ Cloud gives enterprises an end-to-end solution for creating custom generative AI models.

Start with State-of-the-Art Generative AI Models

Try leading foundation models, including Llama 2, Stable Diffusion, and NVIDIA’s Nemotron-3 8B family, optimized for the highest performance efficiency.

Experience NVIDIA AI Foundation Models

Customize the Foundation Models

Tune and test the models with proprietary data using NVIDIA NeMo.

Customize With NVIDIA NeMo

Build Models Faster in the Cloud

Customize models on DGX Cloud, a serverless AI-training-as-a-service platform for enterprise developers.

Train on NVIDIA DGX Cloud

Run Models in Production

Deploy custom and NVIDIA AI Foundation Models anywhere with enterprise-grade NVIDIA NIM.

Scale With NVIDIA NIM

Start with State-of-the-Art Generative AI Models

Try leading foundation models, including Llama 2, Stable Diffusion, and NVIDIA’s Nemotron-3 8B family, optimized for the highest performance efficiency.

Experience NVIDIA AI Foundation Models

Customize the Foundation Models

Tune and test the models with proprietary data using NVIDIA NeMo.

Customize With NVIDIA NeMo

Build Models Faster in the Cloud

Customize models on DGX Cloud, a serverless AI-training-as-a-service platform for enterprise developers.

Train on NVIDIA DGX Cloud

Run Models in Production

Deploy custom and NVIDIA AI Foundation Models anywhere with enterprise-grade NVIDIA NIM.

Scale With NVIDIA NIM

Benefits of NVIDIA AI Foundation Models and Endpoints

Performance Optimized

Lower your TCO and increase energy efficiency by running inference up to 4x faster.

Enterprise-Grade

Use lean, high-performing large language models (LLMs) built from responsibly sourced datasets.

Try Models on the Fly

Experience a models’ peak performance directly from a browser with a GUI or API.

Ready-to-Integrate APIs

Connect your applications to API endpoints and test their real-world performance running on a fully-accelerated stack.

Deploy Your Models Anywhere

Run the model anywhere, from cloud to data center to workstations, with NVIDIA AI Enterprise.

Experience-Optimized Generative AI Models

NVIDIA AI Foundation Models include leading community- and NVIDIA-built models to support various use cases, including content generation, image creation, drug discovery, and IT service automation.

Llama 2

Llama 2 is a large language AI model capable of generating text and code in response to prompts.

Try Llama 2

Stable Diffusion XL

Stable Diffusion XL (SDXL) generates expressive images with shorter prompts and inserts words inside images.

Try SDXL

Nemotron-3-8B-QA

Nemotron-3 8B is an enterprise-grade Question-Answering LLM that enterprises can customize for their domains.

Try Nemotron-3-8B-QA

View All Models

Power Your Enterprise Applications With Retrieval-Augmented Generation (RAG)

Build AI chatbots that connect with your custom LLMs and knowledge bases to accurately and naturally answer domain-specific questions in real time.

Explore the RAG AI Workflow

Success Stories

Generative AI is impacting every industry today—from IT services and telecommunications to finance and retail. Putting generative AI into practice requires enterprises to have access to an AI foundry to build custom models using proprietary data and deploy them at scale. See how the world’s leading organizations are serving their customers with NVIDIA AI.

ServiceNow

ServiceNow is bringing intelligent workflow automation to their Now Platform with custom LLMs using NVIDIA AI Foundation Models and NVIDIA NeMo on NVIDIA DGX.

Learn More

Amdocs

Amdocs is building custom LLMs for the $1.7 trillion global telecommunications industry using the NVIDIA AI foundry service on Microsoft Azure.

Learn More

cont-1
cont-2

Ecosystem Partners

Let’s Get Started

Try the latest, fully optimized NVIDIA AI Foundation Models today from the NGC catalog, Azure ML model catalog, or Hugging Face.

Experience the Models

Notify me as new models are optimized and added to NVIDIA’s collection of AI foundation models.

Notify Me

Explore additional generative AI resources and tools.

Learn More