NVIDIA NIM Is A Game Changer For Generative AI

AI and The Next Computing Platforms With Jensen Huang and Mark Zuckerberg

NVIDIA NIM - Deploy Accelerated AI in 5 minutes

小丑在游泳池做什么#short #angel #clown

“It seems your luggage was lost in transit” ✈️

C’est qui le plus fort 😂

How to Deploy NVIDIA NIM in 5 Minutes

NVIDIA Developer

zhlédnutí 13 779

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 10. 09. 2024
NVIDIA NIM is a set of microservices for deploying AI models. Tap into the latest AI foundation models-like Stable Diffusion, esmfold, and Llama 3-with downloadable NIM microservices for your application deployment.
Join Neal Vaidya, developer advocate at NVIDIA, for a demo on how to quickly deploy NVIDIA NIM microservices, locally with Python or programmatically through Docker. This tutorial focuses on deploying Llama 3.
0:22 - Overview of NIM microservices (nvda.ws/4bZLY9E)
0:36 - Test the Llama 3 model on a web browser with a hosted API
0:51 - Generate an API key and get sample code snippets
0:59 - Test the Llama 3 model in a self-hosted environment
1:08 - Get access to API catalog to begin self-hosted deployment
1:22 - Pre-install Docker engine and Docker CLI tool
1:50 - Authenticate your container
1:55 - Generate an environment variable called NGC API key
2:05 - Input a single Docker run command
2:19 - Expose Docker to all GPUs to the running container
2:28 - Expose the API environment variable
2:35 - Mount the cache to download and store model weights
2:48 - Specify the NIM should run as a local user
2:53 - Expose the main port to interact with the running NIM
3:03 - Add the model name to the image path
3:30 - Confirm the service is ready in another terminal using curl
3:41 - Send the container a new request
Developer resources:
▫️ Learn more about NIM: nvda.ws/3yqsuNw
▫️ Join the NVIDIA Developer Program: nvda.ws/3OhiXfl
▫️ Access downloadable NIM microservices on the API catalog: nvda.ws/4bZLY9E
▫️ Read the Mastering LLM Techniques series to learn about inference optimization, LLM training, and more: resources.nvid...
#inferencemicroservices #inferenceoptimization #api #selfhosting #modeldeployment #aimodel #LLM #generativeai #aimicroservices #nvidianim #generativeaideployment #aiinference #productiongenai #enterprisegenerativeai #acceleratedinference #nvidiaai #apicatalog

Komentáře • 9

@TristanVash38 Před měsícem ⁺¹
Awesome! Thanks, NVIDIA Team!
@infraia Před 14 dny
Things are moving fast! Exciting times !
@cho7official55 Před měsícem ⁺¹
Very nice tutorial, thanks a lot
@JayMatth Před měsícem
Very nice! I will surely have a look at this :)
@MeownaMeow Před měsícem
Nice, thanks 😁
@saitaro Před měsícem ⁺¹
Where is the notebook in the description?
@StiekemeHenk Před měsícem
Rip, I think it's this article? Build a RAG using a locally hosted NIM
@wyattx008 Před měsícem ⁺¹
So... can you make me some money? 😇

Další v pořadí

Automatické přehrávání

NVIDIA NIM Is A Game Changer For Generative AI

NVIDIA NIM Is A Game Changer For Generative AI

AI and The Next Computing Platforms With Jensen Huang and Mark Zuckerberg

AI and The Next Computing Platforms With Jensen Huang and Mark Zuckerberg

NVIDIA NIM - Deploy Accelerated AI in 5 minutes

NVIDIA NIM - Deploy Accelerated AI in 5 minutes

小丑在游泳池做什么#short #angel #clown

小丑在游泳池做什么#short #angel #clown

“It seems your luggage was lost in transit” ✈️

“It seems your luggage was lost in transit” ✈️

C’est qui le plus fort 😂

C’est qui le plus fort 😂

هذه الحلوى قد تقتلني 😱🍬

هذه الحلوى قد تقتلني 😱🍬

Building LLM Assistants with LlamaIndex, NVIDIA NIM, and Milvus | LLM App Development

Building LLM Assistants with LlamaIndex, NVIDIA NIM, and Milvus | LLM App Development

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

NVIDIA’s New AI Did The Impossible!

NVIDIA’s New AI Did The Impossible!

I ranked EVERY terminal emulator. It was nuts

I ranked EVERY terminal emulator. It was nuts

InfiniBand and RoCE: Artificial Intelligence Data Centers | FiberMall

InfiniBand and RoCE: Artificial Intelligence Data Centers | FiberMall

I forced EVERYONE to use Linux

I forced EVERYONE to use Linux

Nvidia CUDA in 100 Seconds

Nvidia CUDA in 100 Seconds

F&#% it...I'm giving my AI idea away (could be MASSIVE)

F&#% it...I'm giving my AI idea away (could be MASSIVE)

Every Developer Needs a Raspberry Pi

Every Developer Needs a Raspberry Pi

МЫ ОТМЫЛИ ОСКАРУ ПОПКУ 😍

МЫ ОТМЫЛИ ОСКАРУ ПОПКУ 😍

Ondřej Novotný - GELNÁROVÁ JE HODNÁ HOLKA, TRILOGII VYHRAJE KARLOS,PRODALI JSME 17.000 LÍSTKŮ ZA DEN

Ondřej Novotný - GELNÁROVÁ JE HODNÁ HOLKA, TRILOGII VYHRAJE KARLOS,PRODALI JSME 17.000 LÍSTKŮ ZA DEN

MAGISCHES MAKE-UP: GEHEIMNISSE UND DUNKLE AUGENRINGE VERSTECKEN! 😂✨💄

MAGISCHES MAKE-UP: GEHEIMNISSE UND DUNKLE AUGENRINGE VERSTECKEN! 😂✨💄

GTA 5 vs GTA San Andreas Doctors🥼🚑

GTA 5 vs GTA San Andreas Doctors🥼🚑

Trollím PODVODNÍKOV na Facebook Marketplace w/MarleyKKT a Lukáš

Trollím PODVODNÍKOV na Facebook Marketplace w/MarleyKKT a Lukáš

Touching Act of Kindness Brings Hope to the Homeless #shorts

Touching Act of Kindness Brings Hope to the Homeless #shorts

C’est qui le plus fort 😂

C’est qui le plus fort 😂

Секрет летающего стула! #shorts

Секрет летающего стула! #shorts