AWS Cloud Operations Blog

Category: Intermediate (200)

How Rackspace uses AWS Systems Manager for instance patching across multi-cloud and hybrid environments

How Rackspace uses AWS Systems Manager for instance patching across multi-cloud and hybrid environments

This post was written in collaboration with Ryan Walker, a Principal Engineer with the Solutions and Services Engineering team at Rackspace. Now that cloud technology has become ubiquitous, companies have many options for hosting servers and building out solutions. From the use of multiple cloud providers to bare metal devices to private cloud, or even […]

Enhance CloudWatch metrics with metric math functions

Enhance CloudWatch metrics with metric math functions

In June 2021, the Amazon CloudWatch team launched 14 new metric math functions. In this blog post, I’ll describe these new functions and show how you can use them to enhance your existing CloudWatch metrics, dashboards, and alarms. Metrics are an important part of observability and monitoring. A numerical representation of data measured over time, […]

Create fine-grained CloudWatch canary schedules with cron expressions

Create fine-grained CloudWatch canary schedules with cron expressions

In this post, I’ll explain how to create fine-grained canary schedules to meet your business requirements using built-in cron expression scheduling in Amazon CloudWatch Synthetics. You can use CloudWatch Synthetics to create canaries, configurable scripts that run on a schedule, to monitor your endpoints and APIs. Because canaries follow the same routes and perform the […]

Implement operations observability in landing zone environments

Implement operations observability in landing zone environments

In an earlier blog post, Automate customized deployment of cross-account/cross-region CloudWatch dashboards using tags, we showed you how to implement Amazon CloudWatch dashboards for specific events with automation. This solution is great for seasonal events, holidays, important releases, and other use cases. In this blog post, we will review a landing zone environment and share a […]

Use AWS Systems Manager Session Manager for port forwarding to Amazon ElastiCache for Redis inside a private subnet

Use AWS Systems Manager Session Manager for port forwarding to Amazon ElastiCache for Redis inside a private subnet

With the increasing adoption of the public cloud, customers must minimize the attack surface of their infrastructure. When it comes to optimizing the response time of read-intensive applications, data caching is one of the first steps to consider. Amazon ElastiCache for Redis is versatile in-memory storage that offers highly available, highly scalable, and extremely fast […]

Scaling Well-Architected reviews with the AWS Well- Architected Tool

Scaling Well-Architected reviews with the AWS Well-Architected Tool

The AWS Well-Architected Framework describes key concepts and architectural best practices that help cloud architects build secure, high-performing, resilient, and efficient infrastructure for their applications and workloads in the cloud. An AWS Well-Architected Review includes questions around the AWS Well-Architected Framework that can help application owners ensure that their workloads are following best practices. The […]

How to easily configure DevOps Guru across your organization with Systems Manager Quick Setup

How to easily configure DevOps Guru across your organization with Systems Manager Quick Setup

Amazon DevOps Guru is a machine learning (ML) powered service that helps developers and operators automatically detect anomalies and improve application availability. DevOps Guru utilizes machine learning models informed by years of Amazon.com and AWS operational excellence in order to identify anomalous application behavior (e.g., increased latency, error rates, resource constraints) and surface critical issues that […]

Operational insights in Systems Manager OpsCenter help you identify duplicate issues and noisy event sources

Operational insights in Systems Manager OpsCenter help you identify duplicate issues and noisy event sources

If you use AWS Systems Manager OpsCenter, you might be familiar with the challenges of large numbers of OpsItems. When the same problem causes the creation of a significant number of OpsItems, it can be hard to see that these OpsItems are in fact the result of a single issue. It can also be difficult […]

Using AWS Migration Hub network visualization to overcome application and server dependency challenges

Using AWS Migration Hub network visualization to overcome application and server dependency challenges

When you embark on a cloud migration, it can be challenging to identify application and server dependencies. In some organizations, IT assets are maintained in a spreadsheet that is updated infrequently. Some organizations have configuration management database (CMDB) software, but it tends to be outdated or unusable. It lacks the infrastructure data that is essential […]