Settings

Theme

Nvidia Blackwell Ultra (GB300): -97% INT8/FP64,+50% FP4 Dense,+55% VRAM,+114% At

resources.nvidia.com

8 points by Sweepi 4 months ago · 3 comments

Reader

nabla9 4 months ago

96% TCO and energy savings for 65 racks eight-way HGX H100 air-cooled versus 1 rack GB200 NLV72 liquid-cooled with equivalent performance on GPT-MoE-1.8T real-time inference throughput.

Big if true. Energy and cooling costs can represent up to 30-40% of the total cost of setting up and running an AI data center.

SweepiOP 4 months ago

[+114% Attention acceleration] Any idea how they got +50% FP4 from the same silicon? "Firmware" improvements? Or did they found a way to disable the INT8 and FP64 units and re-use them e.g. as overspill registers? Any other ideas why INT8/FP64 is down -97% on the same chip? QA/certification issues?

In case you you want to compare the complete specs, I would post them here, but since hn supports less formatting than early 2000s bb-forums, check it here: https://www.forum-3dcenter.org/vbulletin/showpost.php?p=1380...

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection