Accelerating Science with AI in HPC

By Louis Vistola, Technology Evangelist

October 16, 2023

High-performance computing (HPC) has played a major role in advancing scientific research for decades using extremely large datasets and sophisticated modeling that mimics the physical world. Rapidly advancing is the ability to complement the power and capabilities of HPC with artificial intelligence (AI) to accelerate innovations and deliver faster outcomes.

I had the opportunity to talk with Radhika Rao, Senior Director of Data Center GPU Product Management at Intel, to discuss this AI transformation and its impact on the HPC landscape.

Q: With the rise of AI, what do HPC leaders and developers need to consider around AI? Why now?

Rao: In the last year alone, we’ve seen an explosive growth in using of AI in all industries. In HPC, we have reached a pivotal moment where we’re seeing a true HPC and AI convergence. We have been talking about for a long time, but we can see that happening now. AI is helping advance models and codes in physics, weather, manufacturing, and many more areas.

What is making this so relevant now is that AI has become mainstream due to the popularity and wide-scale use of ChatGPT (large language models) and generative AI. This trend is making it more important to view HPC and AI as a converged space to drive advances in science.

Q: What are the evolving requirements HPC leaders should consider when looking to invest in the next-gen environments for accelerating HPC and AI?

Rao: HPC workloads have traditionally had a very specific CPU-to-GPU ratio and compute profile. In the last couple of years, we’ve seen models change and become far more dynamic with increasing compute and scale requirements. As a result, the need for architecture flexibility to run these data-intensive workloads across heterogeneous environments has become critical, as well as the need for increased memory bandwidth and memory capacity. Another area to consider is sustainability requirements with respect to power and environmental impact. When building out large clusters to solve such problems as climate change and sustainability, you don’t want to be part of the problem. We must consider the sustainability footprint of the data center and new technology investments to ensure they are not creating a negative impact on the environment (see June article on Top Considerations for HPC, AI and Sustainability (hpcwire.com)).

Q: How has Intel’s portfolio advanced to address this HPC and AI convergence?

Rao: CPUs, and particularly Intel x86 process technologies from Intel, have been the backbone of HPC systems for decades. And now we are seeing powerful AI capabilities being infused into every aspect of compute, including in the HPC space. Intel’s CPUs are now complemented with a variety of built-in and discrete accelerators and GPUs. For example, the built-in Advanced Matrix Extensions (AMX) built into the 4th Gen Intel® Xeon® Scalable processors deliver 10x higher inference and training performance.1 Intel® Data Center GPU Max Series delivers up to 2x performance gain on HPC and AI workloads over competition.2 Recent MLPerf AI inference results spotlight the Intel® Gaudi®2 accelerator as the only viable alternative on the market for dedicated AI compute needs. Additionally, Intel is the only vendor to submit public CPU results on 4th Gen Intel Xeon and Intel Xeon Max Series with industry-standard, deep-learning ecosystem software.

Our portfolio is supported by a full suite of AI and HPC software development tools. Developers have traditionally been required to be use proprietary software to code and run AI and HPC models specific to each platform. With a new suite of open-sourced software, such as the Intel oneAPI toolkit, developers now have freedom of choice. They can program once and then run the code on different hardware, even shifting the underlying hardware mix over time to suit the needs of a particular workload. The oneAPI programming model supports Intel’s full hardware portfolio, as well as solutions from competitors.

Q: What are examples of some of how Intel is working across the ecosystem on this HPC and AI convergence?

Technology adoption is the key to converging HPC and AI into one system to advance scientific research. One example is the work we are doing with the Aurora Exascale Supercomputer at the Argonne Leadership Computing Facility (ALCF), a Department of Energy Office of Science User Facility at Argonne National Laboratory, and Hewlett Packard Enterprise. Aurora, being built on the full Intel® Max Series CPUs and GPUs, will offer researchers high computing speed and artificial intelligence capabilities to enable science that is not possible today. Earlier this year, Intel and Argonne National Lab announced the full Aurora specifications and efforts (with partners) to bring the power of generative AI and large language models (LLM) to science and society.

Beyond Aurora, there is much work being done to bring HPC and AI together. We have several software partners that are using oneAPI on Intel hardware to bring AI into some of the places that are very specific to HPC use cases. One example is Ansys who is combining the power of both the Intel Max Series GPUs and 4th Gen Intel Xeon processors to add AI capabilities into their applications. We are also deeply engaged with the AI and HPC software ecosystem, optimizing popular developer tools like Pytorch and Tensorflow.

Q: What’s one piece of advice you have for HPC leaders looking to invest in AI?

Rao: When adding AI capabilities to an HPC environment, the last thing anyone wants is to incur more costs or incur delays due to complex codes having to be ported from one programming model to another. Intel has made significant investments in both the hardware and software needed to run, scale and protect investments in modern HPC centers. The convergence of HPC and AI is making it even more important to adopt open standards, like oneAPI, so researchers can focus on delivering scientific breakthroughs faster and with greater precision.

One of the newest ways HPC technologists and developers can build, test and optimize AI and HPC applications is on the newly launched Intel® Developer Cloud. The Intel Developer Cloud provides developers access to the latest Intel HPC and AI technologies, including Intel Gaudi2 processors for deep learning, and the latest Intel hardware platforms, such as the 5th Gen Intel® Xeon® Scalable processors and Intel® Data Center GPU Max Series 1100 and 1550.

Learn more about how Intel’s HPC and AI portfolio is helping customers achieve outstanding results for demanding workloads and the complex problems they solve here.


1See [A16] and [A17] at intel.com/processorclaims: 4th Gen Intel® Xeon® Scalable processors. Results may vary.
2Visit intel.com/performanceindex (Events: Supercomputing 22) for workloads and configurations. Results may vary.

Subscribe to HPCwire's Weekly Update!

Be the most informed person in the room! Stay ahead of the tech trends with industry updates delivered to you every week!

US Implements Controls on Quantum Computing and other Technologies

September 27, 2024

Yesterday the Commerce Department announced  export controls on quantum computing technologies as well as new controls for advanced semiconductors and additive manufacturing technologies. AIP’s FYI has posted a good Read more…

IBM Develops New Quantum Benchmarking Tool — Benchpress

September 26, 2024

Benchmarking is an important topic in quantum computing. There’s consensus it’s needed but opinions vary widely on how to go about it. Last week, IBM introduced a new tool — Benchpress — intended to help evaluate Read more…

Editor’s Note: Datanami Is Now BigDATAwire

September 26, 2024

Earlier this week, Datanami completed the transition to BigDATAwire. Loyal readers will notice that we began this journey nearly two years ago. And while the transition may have taken a little longer than expected, it’ Read more…

Launch Codes: Code@TACC Alum Lands at UT Austin

September 26, 2024

For new college graduates, finding a job after earning your degree can take months. And, if the labor market is struggling with inflation, employment opportunities can be scarce. Being patient, staying positive, and expl Read more…

IBM and NASA Launch Open-Source AI Model for Advanced Climate and Weather Research

September 25, 2024

IBM and NASA have developed a new AI foundation model for a wide range of climate and weather applications, with contributions from the Department of Energy’s Oak Ridge National Laboratory. The new open-source model, n Read more…

Intel Customizing Granite Rapids Server Chips for Nvidia GPUs

September 25, 2024

Intel is now customizing its latest Xeon 6 server chips for use with Nvidia's GPUs that dominate the AI landscape. The chipmaker's new Xeon 6 chips, also called Granite Rapids, have been customized and validated specific Read more…

IBM and NASA Launch Open-Source AI Model for Advanced Climate and Weather Research

September 25, 2024

IBM and NASA have developed a new AI foundation model for a wide range of climate and weather applications, with contributions from the Department of Energy’s Read more…

Intel Customizing Granite Rapids Server Chips for Nvidia GPUs

September 25, 2024

Intel is now customizing its latest Xeon 6 server chips for use with Nvidia's GPUs that dominate the AI landscape. The chipmaker's new Xeon 6 chips, also called Read more…

Building the Quantum Economy — Chicago Style

September 24, 2024

Will there be regional winner in the global quantum economy sweepstakes? With visions of Silicon Valley’s iconic success in electronics and Boston/Cambridge� Read more…

How GPUs Are Embedded in the HPC Landscape

September 23, 2024

Grasping the basics of Graphics Processing Unit (GPU) architecture is crucial for understanding how these powerful processors function, particularly in high-per Read more…

Google’s DataGemma Tackles AI Hallucination

September 18, 2024

The rapid evolution of large language models (LLMs) has fueled significant advancement in AI, enabling these systems to analyze text, generate summaries, sugges Read more…

Quantum and AI: Navigating the Resource Challenge

September 18, 2024

Rapid advancements in quantum computing are bringing a new era of technological possibilities. However, as quantum technology progresses, there are growing conc Read more…

Shutterstock_2176157037

Intel’s Falcon Shores Future Looks Bleak as It Concedes AI Training to GPU Rivals

September 17, 2024

Intel's Falcon Shores future looks bleak as it concedes AI training to GPU rivals On Monday, Intel sent a letter to employees detailing its comeback plan after Read more…

The Three Laws of Robotics and the Future

September 14, 2024

Isaac Asimov's Three Laws of Robotics have captivated imaginations for decades, providing a blueprint for ethical AI long before it became a reality. First i Read more…

AMD Clears Up Messy GPU Roadmap, Upgrades Chips Annually

June 3, 2024

In the world of AI, there's a desperate search for an alternative to Nvidia's GPUs, and AMD is stepping up to the plate. AMD detailed its updated GPU roadmap, w Read more…

Shutterstock_2176157037

Intel’s Falcon Shores Future Looks Bleak as It Concedes AI Training to GPU Rivals

September 17, 2024

Intel's Falcon Shores future looks bleak as it concedes AI training to GPU rivals On Monday, Intel sent a letter to employees detailing its comeback plan after Read more…

Nvidia Shipped 3.76 Million Data-center GPUs in 2023, According to Study

June 10, 2024

Nvidia had an explosive 2023 in data-center GPU shipments, which totaled roughly 3.76 million units, according to a study conducted by semiconductor analyst fir Read more…

Everyone Except Nvidia Forms Ultra Accelerator Link (UALink) Consortium

May 30, 2024

Consider the GPU. An island of SIMD greatness that makes light work of matrix math. Originally designed to rapidly paint dots on a computer monitor, it was then Read more…

Granite Rapids HPC Benchmarks: I’m Thinking Intel Is Back (Updated)

September 25, 2024

Waiting is the hardest part. In the fall of 2023, HPCwire wrote about the new diverging Xeon processor strategy from Intel. Instead of a on-size-fits all approa Read more…

Ansys Fluent® Adds AMD Instinct™ MI200 and MI300 Acceleration to Power CFD Simulations

September 23, 2024

Ansys Fluent® is well-known in the commercial computational fluid dynamics (CFD) space and is praised for its versatility as a general-purpose solver. Its impr Read more…

Shutterstock_1687123447

Nvidia Economics: Make $5-$7 for Every $1 Spent on GPUs

June 30, 2024

Nvidia is saying that companies could make $5 to $7 for every $1 invested in GPUs over a four-year period. Customers are investing billions in new Nvidia hardwa Read more…

Shutterstock 1024337068

Researchers Benchmark Nvidia’s GH200 Supercomputing Chips

September 4, 2024

Nvidia is putting its GH200 chips in European supercomputers, and researchers are getting their hands on those systems and releasing research papers with perfor Read more…

Leading Solution Providers

Contributors

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Quantum and AI: Navigating the Resource Challenge

September 18, 2024

Rapid advancements in quantum computing are bringing a new era of technological possibilities. However, as quantum technology progresses, there are growing conc Read more…

Google’s DataGemma Tackles AI Hallucination

September 18, 2024

The rapid evolution of large language models (LLMs) has fueled significant advancement in AI, enabling these systems to analyze text, generate summaries, sugges Read more…

Microsoft, Quantinuum Use Hybrid Workflow to Simulate Catalyst

September 13, 2024

Microsoft and Quantinuum reported the ability to create 12 logical qubits on Quantinuum's H2 trapped ion system this week and also reported using two logical qu Read more…

IonQ Plots Path to Commercial (Quantum) Advantage

July 2, 2024

IonQ, the trapped ion quantum computing specialist, delivered a progress report last week firming up 2024/25 product goals and reviewing its technology roadmap. Read more…

IBM Develops New Quantum Benchmarking Tool — Benchpress

September 26, 2024

Benchmarking is an important topic in quantum computing. There’s consensus it’s needed but opinions vary widely on how to go about it. Last week, IBM introd Read more…

US Implements Controls on Quantum Computing and other Technologies

September 27, 2024

Yesterday the Commerce Department announced  export controls on quantum computing technologies as well as new controls for advanced semiconductors and additiv Read more…

Intel’s Next-gen Falcon Shores Coming Out in Late 2025 

April 30, 2024

It's a long wait for customers hanging on for Intel's next-generation GPU, Falcon Shores, which will be released in late 2025.  "Then we have a rich, a very Read more…

  • arrow
  • Click Here for More Headlines
  • arrow
HPCwire