Elon Musk xAI Colossus AI supercomputer with 100,000 NVIDIA H100 AI GPUs gets in-depth look

Take a look inside of the world's largestr AI supercluster, with Elon Musk's xAI supercomputer powered with 100,000 x NVIDIA H100 AI GPUs.

Elon Musk xAI Colossus AI supercomputer with 100,000 NVIDIA H100 AI GPUs gets in-depth look
Comment IconFacebook IconX IconReddit Icon
Gaming Editor
Published
3 minutes read time

As an Amazon Associate, we earn from qualifying purchases. TweakTown may also earn commissions from other affiliate partners at no extra cost to you.

TL;DR: Elon Musk's xAI startup has built the Colossus AI supercomputer, powered by 100,000 NVIDIA H100 AI GPUs, in just 122 days. This engineering feat, praised as "absolutely amazing," uses Supermicro liquid-cooled racks, each containing 64 GPUs, to form mini clusters.

Elon Musk's gigantic Colossus AI supercomputer from his xAI startup is powered by 100,000 x NVIDIA H100 AI GPUs, and has just had an awesome walkthrough by our friends at ServeTheHome. Check it out:

Patrick from ServeTheHome notes that the engineering accomplishment from Elon and his team at xAI is "absolutely amazing". xAI only took 122 days to build out the insane 100,000 NVIDIA H100 AI GPU-powered Colossus AI supercomputer, something that normally takes many years... but not for Elon and xAI, which is why NVIDIA CEO Jensen Huang recently called the SpaceX and Tesla boss "superhuman". More on that below:

Back to the xAI supercluster, with STH noting that the basic building block for Colossus is the Supermicro liquid-cooled rack, which features 8 x 4U servers each with 8 x NVIDIA H100 AI GPUs for a total of 64 x H100 AI GPUs per rack. 8 of these GPU servers + Supermicro Coolant Distribution Unit (CDU) and required hardware make up one of these GPU compute racks.

The racks are arranged in groups of 8 for a total of 512 AI GPUs (64 x 8) + networking to create mini clusters within a much larger system.

Elon Musk xAI Colossus AI supercomputer with 100,000 NVIDIA H100 AI GPUs gets in-depth look 805

In the image above, we've got xAI using the Supermicro 4U Universal GPU system, which ServeTheHome notes are the "most advanced AI servers on the market right now, for a few reasons". One of these reasons is the degree of liquid cooling, the other is how servicable they are, adds STH.

STH explained in its YouTube description: "We FINALLY get to show the largest AI supercomputer in the world, xAI Colossus. This is the 100,000 (at the time we filmed this) GPU cluster in Memphis Tennessee that has been on the news a lot. This video has been five months in the making, and finally Elon Musk gave us the green light to not just film, but also show everyone the Supermicro side of the cluster".

Elon Musk xAI Colossus AI supercomputer with 100,000 NVIDIA H100 AI GPUs gets in-depth look 807

"This video is being sponsored by Supermicro and that is why we are only showing the Supermicro side, which is the more advanced side. Unlike our normal content creation, this video had to be reviewed by Elon and his team before going live and we were asked to blur out portions at their request".

You can read the written form of STH's awesome walkthrough of xAI's new Colossus AI supercomputer here.

NEWS SOURCE:servethehome.com
Follow TweakTown on Google News

Gaming Editor

Email IconX IconLinkedIn Icon

Anthony joined the TweakTown team in 2010 and has since reviewed 100s of graphics cards. Anthony is a long time PC enthusiast with a passion of hate for games built around consoles. FPS gaming since the pre-Quake days, where you were insulted if you used a mouse to aim, he has been addicted to gaming and hardware ever since. Working in IT retail for 10 years gave him great experience with custom-built PCs. His addiction to GPU tech is unwavering and has recently taken a keen interest in artificial intelligence (AI) hardware.

Related Topics

Newsletter Subscription