Video není dostupné.

Omlouváme se.

NVIDIA Nemotron-4 340B Q8_0 running on AMD Epyc 9374F - real time generation speed

Dreaming Fairy

zhlédnutí 728

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 11. 07. 2024
I was a bit disappointed that no one had yet run NVIDIA Nemotron-4 340B on a CPU, so I took up the challenge myself and here are the initial results after 3 days of work.

Komentáře • 6

@rehanahmed1939 Před 16 dny
Can you please create a full video how you did it or any other source from where I can learn .
@dreamingfairy8804 Před 15 dny
You want to run Nemotron-4 340B on your PC? The following steps are needed:
1) Download the model from huggingface.co/nvidia/Nemotron-4-340B-Instruct
2) Convert the model to safetensors format with this script: github.com/fairydreaming/export-nemo-to-safetensors (install script requirements first)
3) Download my branch of llama.cpp: github.com/fairydreaming/llama.cpp/tree/nemotron
4) Compile llama.cpp from source code
5) Convert the safetensors model to GGUF with llama.cpp convert_hf_to_gguf.py script (install conversion script requirements first)
6) Quantize the model with llama-quantize so it will fit in your RAM
7) Run the quantized model as shown in the video
Good luck!
@johann09 Před 24 dny
How much RAM? This is insane
@dreamingfairy8804 Před 24 dny ⁺⁴
12 channels x 32 GB = 384 GB
@Philip8888888 Před 24 dny
@@dreamingfairy8804 Dare I ask, how much does such a server cost to buy and how many watts does it burn during inference and idle?

Další v pořadí

Automatické přehrávání

Evolution of NVIDIA Tech Demos 1999-2022 w/ Facts

Evolution of NVIDIA Tech Demos 1999-2022 w/ Facts

The market sell-off will end up looking like a growth scare, says Fundstrat's Tom Lee

The market sell-off will end up looking like a growth scare, says Fundstrat's Tom Lee

256 Cores! What can the most powerful CPUs do?

256 Cores! What can the most powerful CPUs do?

5 NEJLEPŠÍCH Gólů z EURA 2024…

5 NEJLEPŠÍCH Gólů z EURA 2024…

小路飞跟姐姐去哪里了#海贼王#路飞

小路飞跟姐姐去哪里了#海贼王#路飞

WHY IS THIS SO FUNNY😂

WHY IS THIS SO FUNNY😂

Dokázal Jsem To

Dokázal Jsem To

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

I Bought a $5000 PC in a Random Asian Tech Mall

I Bought a $5000 PC in a Random Asian Tech Mall

Legion Slim 5 8845HS vs 7735HS CPU: Marginal Battery Improvement of ~30 Minutes

Legion Slim 5 8845HS vs 7735HS CPU: Marginal Battery Improvement of ~30 Minutes

CPU vs GPU vs TPU vs DPU vs QPU

CPU vs GPU vs TPU vs DPU vs QPU

Jim Cramer cuts through the noise of today's market decline

Jim Cramer cuts through the noise of today's market decline

Market expert predicts ‘more carnage’ ahead

Market expert predicts ‘more carnage’ ahead

AMD Stock: Next to Surge Like NVIDIA? What You Should Know

AMD Stock: Next to Surge Like NVIDIA? What You Should Know

I Bought the HEAVIEST Computer on eBay: The PDP-11/34!

I Bought the HEAVIEST Computer on eBay: The PDP-11/34!

How To Identify A CPU Bottleneck - Is Your CPU Bottlenecking Your GPU?

How To Identify A CPU Bottleneck - Is Your CPU Bottlenecking Your GPU?

Unique technique for connecting a garden tap to a plastic pipe without a special coupling #shorts

Unique technique for connecting a garden tap to a plastic pipe without a special coupling #shorts

NEVIDITELNOST???🤯

NEVIDITELNOST???🤯

Jak rozebírat tyhle skleněné mlýnky na sůl, pepř a další koření? 🤔

Jak rozebírat tyhle skleněné mlýnky na sůl, pepř a další koření? 🤔

Do u want the result? 🤔 Elsa and Nevada #elsarca #tiktok

Do u want the result? 🤔 Elsa and Nevada #elsarca #tiktok

CHURAQ CLIQUE - STRÁŽE

CHURAQ CLIQUE - STRÁŽE

Why Is He Unhappy…?

Why Is He Unhappy…?

Střelec na Donalda Trumpa - Thomas Crooks | Co o něm po týdnu víme? | #mscrewpodcast

Střelec na Donalda Trumpa - Thomas Crooks | Co o něm po týdnu víme? | #mscrewpodcast