AWS ParallelCluster | AWS HPC Blog

Recent improvement to Open MPI AllReduce and the impact to application performance

Our team engineered some Open MPI optimizations for EFA to enhance performance of HPC codes running in the cloud. By improving MPI_AllReduce they improved scaling – matching commercial MPIs. Tests show gains for apps including Code Saturne and OpenFOAM on both Arm64 and x86 instances. Check out how these tweaks can speed up your HPC workloads in the cloud.

Securing HPC on AWS – isolated clusters

In this post, we’ll share two ways customers can operate HPC workloads using AWS ParallelCluster while completely isolated from the Internet. ParallelCluster supports many different network configurations to support a range of uses. When referring to isolation we mean situations where your HPC cluster is completely self-contained inside AWS, or where you have a private […]

Electronic design at the speed of Lightmatter: transforming EDA workloads with RES

Check out this post to learn how Lightmatter leveraged AWS tools like Research and Engineering Studio (RES) and AWS ParallelCluster to meet demanding computational requirements for electronic design.

HPC Ops: DevOps for HPC workloads in the cloud

Interested in boosting productivity for HPC users and admins? Read this post to learn how HPC Ops (infrastructure as code + CI/CD) delivers faster, more consistent infrastructure deployments and pain free experiences for your users.

Improve HPC workloads on AWS for environmental sustainability

Need to cut your carbon footprint without sacrificing productivity? Migrating HPC workloads to the cloud allowed Baker Hughes to reduce emissions by 99%! Get tips for optimizing compute, storage, networking so you can do better.

A library of HPC Applications Best Practices on AWS

Want insights on running HPC codes efficiently on AWS? Our HPC specialists compiled their know-how into a new public GitHub repo. Get best practices, templates, scripts and more to optimize your workloads.

Call for participation: HPC tutorial series from the HPCIC

Interested in getting hands-on experience with cutting-edge HPC tools? Check out this blog post on an upcoming virtual training series from @LLNL and @AWSCloud. Learn emerging technologies from the experts this August.

Securing HPC on AWS: implementing STIGs in AWS ParallelCluster

Want to accelerate creating compliant Amazon EC2 images? Learn how HPC users can leverage cloud-native methods for applying STIG security standards.

Large scale training with NeMo Megatron on AWS ParallelCluster using P5 instances

Large scale training with NVIDIA NeMo Megatron on AWS ParallelCluster using P5 instances

Launching distributed GPT training? See how AWS ParallelCluster sets up a fast shared filesystem, SSH keys, host files, and more between nodes. Our guide has the details for creating a Slurm-managed cluster to train NeMo Megatron at scale.

Best practices for running molecular dynamics simulations on AWS Graviton3E

If you run molecular dynamics simulations, you need to read this. We walk through running benchmarks of popular apps like GROMACS and LAMMPS on new Hpc7g instances and Graviton3E processors. The results – up to 35% better vector performance versus Graviton3! Learn how to optimize your own workflows.

AWS HPC Blog

Category: AWS ParallelCluster