AWS HPC Blog
Tag: EC2 Spot
Electronic design at the speed of Lightmatter: transforming EDA workloads with RES
Check out this post to learn how Lightmatter leveraged AWS tools like Research and Engineering Studio (RES) and AWS ParallelCluster to meet demanding computational requirements for electronic design.
Running FSI workloads on AWS with YellowDog
Financial services firms: we stress-tested YellowDog’s HPC environment to see if it could handle a 10m task batch at 3,000 tasks per second. Check out the results.
Job queue snapshots: see what’s at the head of your queues in AWS Batch
AWS Batch just grew a neat new feature: Job queue snapshots. This gives you the visibility you need for managing throughput in a dynamic environment – with competing priorities – and across multiple queues and workloads. Today we give you the inside scoop on how this feature works – especially when you’re using fair share scheduling.
Save up to 90% using EC2 Spot, even for long-running HPC jobs
New OS-level checkpointing tools can let you run existing HPC codes on EC2 Spot instances with minimal impact from interruptions. Read on for the details.
Support for Instance Allocation Flexibility in AWS ParallelCluster 3.3
AWS ParallelCluster 3.3.0 now lets you define a list of Amazon EC2 instance types for resourcing a compute queue. This gives you more flexibility to optimize the cost and total time to solution of your HPC jobs, especially when capacity is limited or you’re using Spot Instances.
Running cost-effective GROMACS simulations using Amazon EC2 Spot Instances with AWS ParallelCluster
In this blog post, we cover how to run GROMACS – a popular open source designed for simulations of proteins, lipids, and nucleic acids – cost effectively by leveraging EC2 Spot Instances within AWS ParallelCluster. We also show how to checkpoint GROMACS to recover gracefully from possible Spot Instance interruptions.
Using Spot Instances with AWS ParallelCluster and Amazon FSx for Lustre
Processing large amounts of complex data often requires leveraging a mix of different Amazon EC2 instance types. These types of computations also benefit from shared, high performance, scalable storage like Amazon FSx for Lustre. A way to save costs on your analysis is to use Amazon EC2 Spot Instances, which can help to reduce EC2 costs up to 90% compared to On-Demand Instance pricing. This post will guide you in the creation of a fault-tolerant cluster using AWS ParallelCluster. We will explain how to configure ParallelCluster to automatically unmount the Amazon FSx for Lustre filesystem and resubmit the interrupted jobs back into the queue in the case of Spot interruption events.
Cost-optimization on Spot Instances using checkpoint for Ansys LS-DYNA
A major portion of the costs incurred for running Finite Element Analyses (FEA) workloads on AWS comes from the usage of Amazon EC2 instances. Amazon EC2 Spot Instances offer a cost-effective architectural choice, allowing you to take advantage of unused EC2 capacity for up to a 90% discount compared to On-Demand Instance prices. In this post, we describe how you 0can run fault-tolerant FEA workloads on Spot Instances using Ansys LS-DYNA’s checkpointing and auto-restart utility.