AWS HPC Blog
Category: AWS ParallelCluster
Advancing research in the cloud: AWS announces expanded training resources
AWS is investing in researcher training with new learning plans for HPC, quantum, stats, AI/ML & generative AI. Check out the details!
Cross-account HPC cluster monitoring using Amazon EventBridge
Managing extensive HPC workflows? This post details how to monitor resource consumption without compromising security. Check it out for a customizable reference architecture that sends only relevant data to your monitoring account.
Simulating complex systems with LLM-driven agents: leveraging AWS ParallelCluster for scalable AI experiments
How might AI change the rules of the energy game? A new post explores using large language models to power smarter, more adaptive agents in an energy supply chain simulation. Learn how LLMs could enable more nuanced decision-making behaviors.
A guide to identity management in Research and Engineering Studio on AWS
Check out this new post to learn about identity options for Research and Engineering Studio on AWS. Understanding choices for SAML IdPs and Active Directory will help you plan secure VDI access.
Recent improvement to Open MPI AllReduce and the impact to application performance
Our team engineered some Open MPI optimizations for EFA to enhance performance of HPC codes running in the cloud. By improving MPI_AllReduce they improved scaling – matching commercial MPIs. Tests show gains for apps including Code Saturne and OpenFOAM on both Arm64 and x86 instances. Check out how these tweaks can speed up your HPC workloads in the cloud.
Securing HPC on AWS – isolated clusters
In this post, we’ll share two ways customers can operate HPC workloads using AWS ParallelCluster while completely isolated from the Internet. ParallelCluster supports many different network configurations to support a range of uses. When referring to isolation we mean situations where your HPC cluster is completely self-contained inside AWS, or where you have a private […]
Electronic design at the speed of Lightmatter: transforming EDA workloads with RES
Check out this post to learn how Lightmatter leveraged AWS tools like Research and Engineering Studio (RES) and AWS ParallelCluster to meet demanding computational requirements for electronic design.
HPC Ops: DevOps for HPC workloads in the cloud
Interested in boosting productivity for HPC users and admins? Read this post to learn how HPC Ops (infrastructure as code + CI/CD) delivers faster, more consistent infrastructure deployments and pain free experiences for your users.
Improve HPC workloads on AWS for environmental sustainability
Need to cut your carbon footprint without sacrificing productivity? Migrating HPC workloads to the cloud allowed Baker Hughes to reduce emissions by 99%! Get tips for optimizing compute, storage, networking so you can do better.
A library of HPC Applications Best Practices on AWS
Want insights on running HPC codes efficiently on AWS? Our HPC specialists compiled their know-how into a new public GitHub repo. Get best practices, templates, scripts and more to optimize your workloads.