Server Leftovers
-
Testing The Mettle Of The Top Supercomputing Iron On Earth
There are 179 supercomputers on the Top500 list that use one or another form of accelerator to do the bulk of their calculations, and of these, there are 84 machines that use Nvidia’s “Volta” V100 GPUs, 64 that use Nvidia’s “Ampere” A100 GPUs. There is a single machine using Nvidia’s “Hopper” H100 GPUs, and that is the “Henri” system at the Flatiron Institute in New York City. The Henri machine is built by Lenovo using its ThinkSystem SR670 V2 servers with a pair of 32-core Ice Lake Xeon SP-8362 Platinum processors and a quad of H100 GPUs with 80 GB of HBM3 memory. The Henri nodes are linked by 200 Gb/sec HDR InfiniBand from Nvidia, and the machine has a total of 5,920 CPUs and SMs and a peak performance of 5.42 petaflops and a sustained performance of 2.04 petaflops. That is only a 37.6 percent computation efficiency, which is raising our eyebrows a bit, but the machine only used 31.3 kilowatts and thus delivers 65.1 gigaflops per watt. That’s a smidgen better than the Frontier development system (which has lower Slingshot overhead than the actual Frontier machine) and quite a bit better than Frontier itself.
[...]
For years and years, we have been complaining that the middle and some of the bottom of the Top500 list is dominated by telcos, service providers, and clouds who run Linpack tests on portions of their infrastructure to give their OEMs and nationalities a boost on the list. We do not expect this to change, and it warps the list in ways that distort the true HPC market.
-
Canonical at AWS re:Invent 2022 | Ubuntu
AWS re:Invent is a learning conference hosted by Amazon Web Services for the global cloud computing community. The in-person event features keynote announcements, training and certification opportunities, access to technical sessions, and so much more.
-
Can Rancher Help You Better Manage Kubernetes? - Earthly Blog
Recently, multi-cloud and hybrid cloud deployments have gained significant traction as they let you optimize costs, increase scalability, improve agility, and achieve greater operational resilience. However, with these deployment strategies, managing different Kubernetes clusters with multiple tools and dashboards can be a challenge; Rancher can help you seamlessly manage such deployments at scale.
This article will explore the features and capabilities of Rancher, an open source Cloud Native Computing Foundation (CNCF) certified Kubernetes distribution designed to make it easy to deploy, manage, and monitor multi-cluster environments from a centralized UI. Here, you’ll learn about Rancher and the different deployment options it provides, and understand the aspects that make it unique.