Compute | AWS HPC Blog

EFA: how fixing one thing, lead to an improvement for … everyone

EFA: how fixing one thing, led to an improvement for … everyone

Today, we’re diving deep into the open-source frameworks that move MPI messages around, and showing you how work we did in the Open MPI and libfabrics community lead to an improvement for EFA users – and everyone else, too.

Why you should use Fargate with AWS Batch for your serverless batch architectures

AWS Batch recently added support for Graviton and Windows containers on Fargate. Read about how these and other features like large task sizes and configurable local storage make AWS Batch on Fargate a fantastic serverless solution for your batch workloads.

Introducing login nodes in AWS ParallelCluster

AWS ParallelCluster 3.7 now supports adding login nodes to your cluster, out of the box. Here, we’ll show you how to set this up, and highlight some important tunable options for tweaking the experience.

Financial services industry HPC migrations using AWS ParallelCluster with Slurm

In this post, we’ll walk you through how banks and other financial services firms migrate or burst their grid workloads onto AWS using AWS ParallelCluster and the Slurm scheduler.

Conceptual design using generative AI and CFD simulations on AWS

In this post we’ll show how generative AI, combined with conventional physics-based CFD can create a rapid design process to explore new design concepts in automotive and aerospace from just a single image.

Implementing AWS ParallelCluster in a Shared VPC

In this post we’ll show you how to deploy ParallelCluster in a shared VPC environment so you can separate infrastructure management, cluster operations, and help segregate costs, too.

Introducing a community recipe library for HPC infrastructure on AWS

Today we’re showing you our community library of HPC Recipes for AWS. It’s a public repo @github that will help you achieve feature-rich, reliable HPC deployments ready to run your workloads no matter where you’re starting from.

Real-time quant trading on AWS

In this post, we’ll show you an open-source solution for a real-time quant trading system that you can deploy on AWS. We’ll go over the challenges brought on by monitoring portfolios, the solution, and its components. We’ll finish with the installation and configuration process and show you how to use it.

How Maxar builds short duration ‘bursty’ HPC workloads on AWS at scale

In this post, we hear from Maxar’s WeatherDesk team on how they deploy their HPC workloads using a “fail fast” software development technique so they can be sure of meeting customer deadlines for their business.

How Amazon’s Search M5 team optimizes compute resources and cost with fair-share scheduling on AWS Batch

In this post, we share how Amazon Search optimizes their use of accelerated compute resources using AWS Batch fair-share scheduling to schedule distributed deep learning workloads.

Category: Compute